WO2003056837A1 - Method for resizing images in block dct domain - Google Patents

Method for resizing images in block dct domain Download PDF

Info

Publication number
WO2003056837A1
WO2003056837A1 PCT/KR2002/002083 KR0202083W WO03056837A1 WO 2003056837 A1 WO2003056837 A1 WO 2003056837A1 KR 0202083 W KR0202083 W KR 0202083W WO 03056837 A1 WO03056837 A1 WO 03056837A1
Authority
WO
WIPO (PCT)
Prior art keywords
sample
samples
dct
image
resizing
Prior art date
Application number
PCT/KR2002/002083
Other languages
French (fr)
Inventor
Hyun-Wook Park
Young-Seo Park
Seung-Kyun Oh
Original Assignee
Ontimetek Inc.
Korea Advanced Institute Of Science And Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ontimetek Inc., Korea Advanced Institute Of Science And Technology filed Critical Ontimetek Inc.
Priority to AU2002354281A priority Critical patent/AU2002354281A1/en
Publication of WO2003056837A1 publication Critical patent/WO2003056837A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/48Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/59Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/625Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals

Definitions

  • the present invention relates to a method for reducing or increasing size of an image using symmetric convolution in block discrete cosine transform (hereinafter, "DCT") domain.
  • DCT block discrete cosine transform
  • VOD video on demand
  • resizing of an image requires two operations: sampling rate change and low pass filtering. For reducing an image size, an input image is low pass filtered and then down-sampled (Decimation); while for increasing an image size, an input image is up- sampled and then low pass filtered (Interpolation), which operations commonly are performed in the pixel domain.
  • Decimation down-sampled
  • Interpolation low pass filtered
  • compressed images are normally discrete cosine transformed (DCT)
  • DCT discrete cosine transformed
  • the present invention conceived to solve the above problems, aims to provide an image resizing method in the DCT domain, which method is capable of resizing an image in any size.
  • Still another objective of the present invention is to provide an image resizing method requiring a reduced amount of calculations.
  • a DCT generally used in reducing an image size is of type 2, the multiplication- convolution properties of which can be summarized as follows:
  • Equation 3 the weighting function u(k) is as Equation 3, below.
  • W(n) which is a random function here, can be a filtering function in a upsampling/downsampling process as per the present invention.
  • one 8xM sample is generated by values resulting from the M-number symmetrical convolutions to each sample, to subsequently generate one 8-sample DCT block through taking first eight samples from the 8xM sample.
  • one 8-sample DCT block can also be generated by adding each sample to one another after first eight samples of values resulting from the M-number symmetrical convolutions have been taken.
  • window functions, w / (n) ⁇ 6 and W 2 (n) ⁇ for cutting off desired parts from the signals in (b) as well as their respective DCT coefficients, W ⁇ (k) ⁇ , and W 2 (k) ⁇ ( s are shown.
  • a multiplication of (b) signal in the spatial domain by each the window function yields a (d) signal, which corresponds to a signal resulting from a symmetric convolution of the DCT coefficients of (b) with the DCT coefficients of each window function in the DCT domain. It can be seen here that the window functions used in the upsampling are the
  • Fig. 4 shows a block diagram of an apparatus for performing the above process in accordance with the present invention.
  • Equation 19 The above upsampling process can be expressed in matrices by Equations 19 and 20.
  • a window function as well as a DCT coefficient of an image for convolution in the DCT domain comprise zeroes, and convolution in the DCT domain is performed symmetrically, it can be easily expressed in matrices and a further simplification of the window functions within a prescribed image quality can be achieved.
  • Equation 21 By using the simplified Equation 21, a 60% reduction in calculations can be reached in comparison to cases where Equations 11 and 12 are used.
  • Fig. 5b illustrates resizing of an image by 3/2-times, i.e. 1.5 fold resizing.
  • each ofthe 8-sample signal is first upsampled to three 8-sample signals, and then, each two 8-sample signals are downsampled to form one 8-sample signal, with which procedure a 3/2-times resizing ofthe image is achieved.
  • 8xL samples are generated by adding (L-l)x8 zero samples at the end of an 8-sample in a 8-sample DCT block. Then, these 8xL samples are symmetrically convoluted with the DCT coefficients of the L window functions, to yield L symmetrical convolution values of 8xL samples.
  • An L-fold upsampling is completed by generating L DCT blocks of 8 samples after every L th sample has been taken from these values.
  • 8xM samples are generated by inserting a zero sample at the end of each sample of the DCT blocks upsampled, and each of 8xM samples of adjacent M DCT blocks are symmetrically convoluted with the DCT coefficient of the M window functions, to generate M results of the symmetrical convolution.
  • a L/M-times resizing of an image is completed by generating one 8-sample DCT through addition of all the upper 8-samples of such results.
  • the present invention provides a resized DCT block in
  • the present invention provides an image resizing method capable of resizing an image in a size at random, such as resizing by L/M-times through upsampling an input image by L-times and then downsampling the same by 1/M-times. Furthermore, the present invention provides an image resizing method, wherein the amount of calculations is substantially reduced through simplification of the window functions.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Discrete Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Image Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

A method for reducing or increasing image size using symmetric convolution in block DCT domain is provided. In down-sampling method, zeros are added to the end of block samples to be symmetrically convoluted with DCT coefficients of M window functions, the resulting values of such convolution are added together, and first eight samples thereof are down-sampled by 1/M-time; while in up-sampling method, eight samples of 0 are padded to eight samples of one block, which are symmetrically convoluted with DCT coefficients of L-number window functions, and then up-sampled by L-times taking only even samples from the resulting values of the convolution. Resizing of image by L/M-times can be effected through down-sampling by 1/M after the image has been up-sampled by L-times. The quantity of calculations can substantially be reduced by simplifying the window functions using a matrix equation.

Description

METHOD FOR RESIZING IMAGES IN BLOCK DCT DOMAIN
Technical Field of the Invention
The present invention relates to a method for reducing or increasing size of an image using symmetric convolution in block discrete cosine transform (hereinafter, "DCT") domain.
Background of the Invention
With wide spreading use of digital images the need for an efficient tool for manipulating such images grows as well. A fundamental and important tool for such manipulation is resizing of images. In particular, resizing of image is inevitable in cases where various imaging means shall be supported by a single image data. For example, a video on demand (hereinafter, "VOD") service shall be able to support various differently sized imaging means such as computer monitors, HDTVs, general TVs, as well as LCD screens of mobile phones. Since generating and storing separate images fitting to each of such imaging means would be almost impossible, it would be effective that a single image data is generated and stored, to be used in each of the above imaging means after an appropriate resizing thereof.
Generally, resizing of an image requires two operations: sampling rate change and low pass filtering. For reducing an image size, an input image is low pass filtered and then down-sampled (Decimation); while for increasing an image size, an input image is up- sampled and then low pass filtered (Interpolation), which operations commonly are performed in the pixel domain.
However, since almost all still images and moving images are stored in compressed forms, a conventional image resizing process comprises three steps of decompressing, resizing, and compressing an image, requiring much time.
In order to overcome such shortcomings, different methods for directly resizing compressed images have been disclosed. Since compressed images are normally discrete cosine transformed (DCT), resizing an image means here resizing in the DCT domain.
One of such methods is disclosed in USP 5,708,732 (Fast DCT Domain Downsampling and Inverse Motion Compensation), wherein an image is downsampled by
1/n through generating one 8x8 DCT block from the average of n 8x8 DCT blocks.
However, a problem with such method is that, though it allows achieving a reduction rate of 1/n, it is incapable of achieving a reduction rate at random, of m/n.
Another method is disclosed in USP 5,845,015 (Method and Apparatus for Resizing Images Using the Discrete Cosine Transform), wherein a 4x4 DCT block is generated in a manner that low pass filtering is performed by multiplying the DCT coefficients with the transform coefficients of the low pass filter, and then, the low pass filtered coefficients thus calculated are folded over to be subsequently subtracted. However, a problem with this method is that the DCT block thus produced cannot be used immediately in the DCT domain, for it is not of 8x8 size as used in a general compression process.
Detailed Description of the Invention The present invention, conceived to solve the above problems, aims to provide an image resizing method in the DCT domain, which method is capable of resizing an image in any size.
Another objective of the present invention is to provide an image resizing method in the DCT domain, wherein the resized DCT block is of 8x8 size as used in general compression standards.
Still another objective of the present invention is to provide an image resizing method requiring a reduced amount of calculations.
Brief Description of the Drawings
Fig. 1 is a diagram showing the concept of two-fold downsampling in accordance with the present invention.
Fig. 2 is a block diagram of an apparatus for performing two-fold downsampling in accordance with the present invention.
Fig. 3 is a diagram showing the concept of two-fold upsampling in accordance with the present invention.
Fig. 4 is a block diagram of an apparatus for performing two-fold upsampling in accordance with the present invention. Figs. 5a and 5b are concept diagrams showing a method for image resizing in a resizing rate at random.
Description of the Preferred Embodiments In downloading method as per the present invention, zeros are inserted to the end of each sample of a block to be symmetrically convoluted with DCT coefficients of M window functions, the resulting values of such convolution are added together, and first eight samples thereof are downsampled by 1/M-time; while in upsampling method as per the present invention, eight samples of 0 are padded to eight samples of one block, which are symmetrically convoluted with DCT coefficients of L window functions, and then upsampled by L-time taking only even samples from the resulting values of the convolution. An image is resized by L/M after it has been upsampled by L-time and then downsampled by 1/M-time. The quantity of calculations can substantially be reduced by simplifying the window functions using a matrix equation.
Now, a discussion on the relational expressions of multiplication-convolution properties in the DCT domain follows, such properties being fundamental for embodying a filter for use in the above downsampling/ upsampling processes. A description of the multiplication-convolution in the DCT domain is given in the article, "Symmetric Convolution and the Discrete Sine and Cosine Transforms" by S. A. Martucci (In: IEEE Trans. Signal Processing, vol. 42, No. 5, May 1994).
A DCT generally used in reducing an image size is of type 2, the multiplication- convolution properties of which can be summarized as follows:
Type 2 DCT of a sequence x(n) having size N can be expressed in Equation 1 , while its inverse transformation is expressed in Equation 2.
[Equation 1]
X( k) , ioτO≤k, n<N
Figure imgf000006_0001
[Equation 2] πk n-
*(«) = CϊβHX k)} = -jj *J£Ϊ0u(k)X(k)cos JT 41-) ' for0≤/fe, «<N.
Here, the weighting function u(k) is as Equation 3, below.
[Equation 3]
u( k) _ = / f 11//22.. * k== 00 ] uK*} { 1, l≤k<N I
The multiplication-convolution properties of a DCT in Equation 1 through Equation 3 is defined by Equation 4, below.
[Equation 4]
x( n) x w(n) = C l { C2t{X( k)) <g)C2e{ uK «)} }
W(n), which is a random function here, can be a filtering function in a upsampling/downsampling process as per the present invention.
However, Θ s _ which represents a symmetric convolution, can be defined by
Equation 5, below.
[Equation 5]
Figure imgf000007_0001
Xe and We are anti-symmetric extensions of X(k) and W(k), respectively, wherein, e.g., Xe(k) is defined by X(k) by Equation 6.
[Equation 6]
( X(k) , Q≤k< N i
0. k= N
*, (*) = ' - X(2N- k) , N<k<2N}
Thus, if an arbitrary filter in the spatial domain is given, a counterpart filter in the DCT domain can easily be obtained using Equation 4.
Since a DCT adopted in actual image compression is a type 2 DCT in orthogonal form, the total calculation requires a scaling factor.
The relation between a type 2 DCT and a general DCT can be expressed as follows:
(*r for 0 ≤ k,n < N
Figure imgf000008_0001
Figure imgf000008_0002
Equation 4 can be summarized as Formula 7 below, in view ofthe orthogonal form.
[Equation 7]
x(n) x tυ(n) = C2 ' 0r l Λo{ k)(F(k)C2θriho{X(k))<S>F(k)C2ortho{ u* n)))}
FXk) 2 N. k= 0 , J~2N, l≤k< N \
G(k) = l/F(k)
A description of the method for resizing images in accordance with the present invention is given below from the above described background.
Fig. 1 is a diagram showing the concept of two-fold downsampling in accordance with the present invention, wherein the left column shows examples of digital image signals in the spatial domain, while the right column shows the coefficients generated by applying a (forward) DCT to these signals. i (n)% and xi(n) in (a) are 8 sample signals ofthe adjacent blocks, while X\ (k)% and
Xι(k)%, are their respective DCT coefficients. » in (b) are symmetrical expansions in spatial domain of xι(n)$ and x∑fnjs, respectively, corresponding respectively
to . , which are generated by adding zeros to each end of the corresponding DCT coefficient samples in the DCT domain. In (c), the window functions, >ι(n)i6 and 2(n)κ, for cutting off of desired parts from the signals in (b) as well as their corresponding DCT coefficients,
Figure imgf000009_0001
and W2(k)ιβ are shown.
Signals indicated in (d) are generated by multiplying the signals in (b) with the window functions in the spatial domain, which correspond to the values obtained by symmetrical convolution of DCT coefficients in (b) with DCT coefficients of the window functions in the DCT domain. As shown in (e), the sum of these two signals after the windowing, x(n)ιβ is the same as the original signal, i.e. the signal in (a), which can be expressed in the DCT domain by Equation 8 below.
[Equation 8 ]
X(k)l6 = ,(*)I6 ®5 Wx (k) 6 +X2(k)x6 <8>5 W2(k 6
Accordingly, since the 16 samples in (e) are the same as those obtained through an application of DCT to the adjacent two blocks, a two-fold downsampling can be achieved by taking eight coefficients from the low frequency side.
Fig. 2 shows a block diagram of an apparatus for performing the above two-fold downsampling.
The zero inserters 11, 12 output two 16-sample signals by inserting zeros at the end of each sample of the inputted 8-sample signals Xi and X2 of the two adjacent blocks. The symmetric convolution calculators 13, 14 perform a symmetric convolution of these 16-sample signals to the window function W| and W2, respectively. The adder 15 outputs a result as Equation 8 by adding results outputted by the symmetric convolution calculators 13, 14. The first 8-sample extractor 17 takes first 8 samples from the 16-sample signals resulting from Equation 8, i.e. 8 samples from the low frequency side, and then, outputs two-fold downsampled signal Z.
The window functions, Wι(k)ιβ and Wι(k)i6 can be obtained using Equation 1 , results of which calculation are shown in Table 1.
[Table 1]
Figure imgf000011_0001
As shown in Table 1, since the coefficients contain a lot of zeros and the convolution is performed symmetrically, the window functions can be summarized in 8x8 matrices as follows:
[Equation 9]
XMtQsWM** for0≤*<8 * W,"Xl
[Equation 10] 01
[ς\ uoμenbg]
Λ['(ύlx
Figure imgf000012_0001
*(ς)zx *(p)zx * τx *(z)τx
Figure imgf000012_0002
°(o)zx] =
[H uoμtmbg]
91
Figure imgf000012_0003
[ t uoμtmbg]
(9*(π) -,l<«!*>- "ti> >- ("(coin* "(i) »- ("tit)!* ♦ "to"*)-
Figure imgf000012_0004
"αi'j-
("(ci)1*-"**)'*)- (,,(tιι f-"U),)- <•'«!)!«♦ «<«!*>- (*,<it) *"(i>!*)- ("(«)'« + "(D ϊ- "(ι) >- *'tt) -
<*'t - " D - <"(ti) ;ΛV o
(*,(ϊi> +,,(tιϊ,Λ>-
Figure imgf000012_0005
01
[31 uoμenbg]
»■(«>'_« "(si) -"<f>'- «l(tι)lΛ. «'(ι) "(tD t'Ot »l(ιι)!_ι ♦»'(*) 'a "<»>l- ♦ "ID1* "ω'V o o α "(o) α o
"<π)'.« -"<«>',- ι)|j-"ωlι 1(tι) »"(t) »'(cι>!-«* •'<.)!ιι»*'(ιl1-i «"((>'-« "«>'.-
"(iD'Λ-'O 'Λ "<S|) |»"<6>'.. "(«>!« ♦»'<.)!• «l(ll)lΛ» "(t) "C6)'*»"(t) "ω' *"!!)1* *l(f>I-»« »,<ι) ι l(t) rpΛΛ o o o o o . o "(o) o
"(tl)lΛ*"(tl)'-l »'(CI)'j(*»'<ll) l »l(lll1Λ»"<6) "(SlU*"!.)1* »'(.)'*♦ •'(()!« ,l(fl +"<C>- »Hc>,-»**,(ι>l« "<l)!*r o o o o o o o "(o)
[π uoμenbg] g
:S OIJOJ SB si oi purj 6 suoμenbg ui xureui qoB
XSIP'Λ S = rxSΣ»ΛV = 8>;y50J°j " ^ε^® 'z(%)zx
C80Z0/Z0HX/I3d Z.C89S0/C0 OΛV
Figure imgf000013_0001
Consequently, a downsampling (zor) can be expressed in matrix form by Equation
16.
[Equation 16]
Zc (3), Z° (4)8 Zor(5)8 Z'(6)t Zor(7)i] T
Figure imgf000013_0002
}.
Here, the scaling factors, F and G can be expressed by Equations 17 and 18, respectively.
[Equation 17]
Figure imgf000013_0003
[Equation 18]
Figure imgf000014_0001
The quantity of calculations for an upsampling / downsampling can substantially be reduced in comparison to a conventional image resizing method, considering the zero coefficients in the symmetrical convolutions in matrix form as in the above Equation 16 and calculating the part in {}of Equation 16 in advance.
In the same manner, 8 samples of the adjacent three blocks can be three-fold downsampled by three window functions. Here, left 8 samples of first window function are 1 , while the remaining 16 samples thereof are 0; middle 8 samples of the second window function are 1, while the remaining 16 samples thereof are 0; and right 8 samples of the third window function are 1. while the remaining 16 samples thereof are 0. The DCT coefficients of these window functions can be obtained with Formula 1 above.
A 1/M-time downsampling of an image can be performed by expanded application of the above method, i.e. 8-samples of the adjacent M blocks can be downsampled by M window functions in the same manner.
To elaborate, 8xM samples are generated by inserting a zero sample at the end of each sample of 8-sample DCT blocks, and then, each of the 8xM samples of adjacent M- number DCT blocks is symmetrically convoluted with the DCT coefficients of M-number window functions. 1/M-time downsampling is completed when an 8-sample DCT is generated by adding first eight samples of values resulting from the above M-number symmetrical convolutions. Here, the M-number window function is a function having a size of 8xM samples, and the n' window function thereof has 1 for its nth eight sample while the remaining samples thereof are all zeros.
In order to generate one 8-sample DCT by adding first eight samples of values resulting from the M-number symmetrical convolutions, one 8xM sample is generated by values resulting from the M-number symmetrical convolutions to each sample, to subsequently generate one 8-sample DCT block through taking first eight samples from the 8xM sample. Alternatively, one 8-sample DCT block can also be generated by adding each sample to one another after first eight samples of values resulting from the M-number symmetrical convolutions have been taken.
Next, the concept of two-fold upsampling in accordance with the present invention is explained referring to Fig. 3. In Fig. 3, the left column exemplifies digital image signals in the spatial domain, while the right column indicates DCT coefficients of these signals. x(n)s in (a) is an 8-sample signal of a block, while X(k)g is DCT coefficient of x(n)s. Y(k)ιβ in (b) represents a signal with added eight zeros at high frequency side of X(k)s, while y(n)i6 represents a signal appearing in the spatial domain at that time. It can be seen that the signal in the spatial domain is an upsampled 16-sample signal. In (c), window functions, w/(n)ι6 and W2(n)ιβ for cutting off desired parts from the signals in (b) as well as their respective DCT coefficients, Wι(k)κ, and W2(k)\(s are shown. A multiplication of (b) signal in the spatial domain by each the window function yields a (d) signal, which corresponds to a signal resulting from a symmetric convolution of the DCT coefficients of (b) with the DCT coefficients of each window function in the DCT domain. It can be seen here that the window functions used in the upsampling are the
same as that used in the downsampling. From the first 8-sample of
Figure imgf000015_0001
and the last 8- sample of
Figure imgf000016_0001
as in (e) can be obtained, which procedure is the same as that of taking even samples of (d) signal in the DCT domain.
Fig. 4 shows a block diagram of an apparatus for performing the above process in accordance with the present invention.
A zero padder 21 pads zeroes to each 8-sample signal X in one block, at an 8- sample high frequency section, and outputs 16-sample signals, each of which is then symmetrically convoluted with the window functions Wi, W2 by the symmetrical convolution calculators 23, 24. After that, the even sample extractors 25, 26 extract even samples from outputs of the symmetrical convolution calculators 23, 24 and output two 8- sample signals Yi, Y , with which procedure a two fold upsampling process is completed.
The above upsampling process can be expressed in matrices by Equations 19 and 20.
[Equation 19] y°r = G W 2 Fx0f = Rx" ; R=G W,B, F
[Equation 20]
y? = GSWa α2 Fxor = GSW,uiSFxor = SG ^FSx"^ SRSxor
Here, ? > ? are results of the respective upsampled 8-samples, while the other matrices are the same as those in case ofthe downsampling.
In the same manner, a three-fold upsampling can be preformed on each 8-sample signal in a block with three different window functions as in a three-fold downsampling.
To generalize: First, 8xL samples are generated by adding (L-l)x8 zero samples at the end of an 8-sample DCT block. Then, these 8xL samples are symmetrically convoluted with the DCT coefficients of the L window functions, to yield L symmetrical convolution values of 8xL samples. An L-fold upsampling is completed by generating L DCT blocks of 8 samples after every Lth sample has been taken from these values. Here, the L window functions are functions having a size of 8xL samples, and the nth window function thereof has 1 for its nth 8-sample while the remaining samples thereof are all zeros.
Since a window function as well as a DCT coefficient of an image for convolution in the DCT domain comprise zeroes, and convolution in the DCT domain is performed symmetrically, it can be easily expressed in matrices and a further simplification of the window functions within a prescribed image quality can be achieved.
For example, a matrix as Equation 21 can be obtained in case window functions for downsampling are employed in a manner that several coefficients thereof are substituted by zeros.
[Equation 21]
Figure imgf000018_0001
By using the simplified Equation 21, a 60% reduction in calculations can be reached in comparison to cases where Equations 11 and 12 are used.
As next, resizing of images at a resizing ratio at random is described below.
In order to achieve a L/M-times resizing of an image, the image is downsampled by 1/M after it has been upsampled by L-times, as described above. An example of such process is illustrated in Fig. 5a. Since the Equation 16 as well as Equations 19 and 20 can easily be transformed for an upsampling/downsampling to a desired size, corresponding window functions can also be easily obtained by the method of the present invention.
Fig. 5b illustrates resizing of an image by 3/2-times, i.e. 1.5 fold resizing. Here, each ofthe 8-sample signal is first upsampled to three 8-sample signals, and then, each two 8-sample signals are downsampled to form one 8-sample signal, with which procedure a 3/2-times resizing ofthe image is achieved.
To generalize: First, 8xL samples are generated by adding (L-l)x8 zero samples at the end of an 8-sample in a 8-sample DCT block. Then, these 8xL samples are symmetrically convoluted with the DCT coefficients of the L window functions, to yield L symmetrical convolution values of 8xL samples. An L-fold upsampling is completed by generating L DCT blocks of 8 samples after every Lth sample has been taken from these values. After that, 8xM samples are generated by inserting a zero sample at the end of each sample of the DCT blocks upsampled, and each of 8xM samples of adjacent M DCT blocks are symmetrically convoluted with the DCT coefficient of the M window functions, to generate M results of the symmetrical convolution. A L/M-times resizing of an image is completed by generating one 8-sample DCT through addition of all the upper 8-samples of such results.
Here, the L window functions are functions having a size of 8xL samples, and the nth window function thereof has 1 for its n' 8-sample while the remaining samples thereof are all zeros. And, the M window functions are functions having a size of 8xM samples, and the mth window function thereof has 1 for its m' 8-sample while the remaining samples thereof are all zeros.
Industrial Applicability
As described above, since the present invention provides a resized DCT block in
8x8 size as used in general compression standards, no inverse DCT transform is required.
In addition, since the present invention provides an image resizing method capable of resizing an image in a size at random, such as resizing by L/M-times through upsampling an input image by L-times and then downsampling the same by 1/M-times. Furthermore, the present invention provides an image resizing method, wherein the amount of calculations is substantially reduced through simplification of the window functions.

Claims

What is claimed is:
1. An image resizing method comprising: a first step wherein 8xM samples are generated by inserting a zero sample at the end of each sample of 8-sample DCT block; a second step wherein each 8xM sample of adjacent M DCT blocks are symmetrically convoluted with the DCT coefficient of the M window functions; and a third step wherein one 8-sample DCT is generated through addition of all the upper 8-samples of values resulting from said M symmetrical convolutions, whereby said M window functions are functions having a size of 8xM samples, and nlh window function thereof has 1 for its nl 8-sample while the remaining samples thereof are all zeros.
2. The image resizing method of Claim 1, wherein said third step comprises steps of: generating one 8xM sample through addition of values for each sample resulting from said M symmetrical convolutions; and generating one 8-sample DCT block through taking the upper 8-samples from said one 8xM sample.
3. The image resizing method of Claim 1, wherein said third step comprises steps of: taking the upper 8-samples from values yielded through said M symmetrical convolutions; and generating one 8-sample DCT block through addition of said M-number 8-samples.
4. An image resizing method comprising the steps of: generating 8xL samples through padding (L-l)x8 zero samples at the end of an 8- sample DCT block; calculating symmetrical convolutions of said 8xL samples and the DCT coefficients of the L-number window functions; and generating L-number 8-sample DCT blocks by taking every L1 sample out of values resulting from said L-number symmetrical convolutions of 8xL samples, whereby said L-number window functions are functions having a size of 8xL samples, and nth window function thereof has 1 for its nth 8-sample while the remaining samples thereof are all zeros.
5. An image resizing method comprising: a first step wherein 8xL samples are generated by padding (L-l)x8 zero samples at the end of 8-samples of an 8-sample DCT block; a second step wherein said 8xL samples are symmetrically convoluted with the DCT coefficients of the L-number window functions; a third step wherein L-number 8-sample DCT blocks are generated by taking every Lth sample out of values resulting from said L-number symmetrical convolutions of 8xL samples; a fourth step wherein 8xM samples are generated by inserting a zero sample at the end of each sample of 8-sample DCT block of said third step; a fifth step wherein each of 8xM samples of adjacent M-number DCT blocks are symmetrically convoluted with the DCT coefficient of the M-number window functions; and a sixth step wherein one 8-sample DCT is generated through addition of all the upper 8-samples of values resulting from said M-number symmetrical convolutions, whereby said L-number window functions are functions having a size of 8xL samples, and nth window function thereof has 1 for its nl 8-sample while the remaining samples thereof are all zeros, while said M-number window functions are functions having a size of 8xM samples, and mth window function thereof has 1 for its ml 8-sample while the remaining samples thereof are all zeros
6. The image resizing method of Calim 5 wherein said sixth step comprises the steps of: generating one 8xM sample through addition of values for each samples resulting from said M-number symmetrical convolutions; and generating one 8-sample DCT block through taking the upper 8-samples from said one 8xM sample.
7. The image resizing method of Claim 5 wherein said sixth step comprises the steps of: taking the upper 8-samples from values yielded through said M-number symmetrical convolutions; and generating one 8-sample DCT block through addition of said M-number 8-samples.
8. A method for resizing an image through downsampling to one 8-sample DCT block, z=[Z(0)8 Z(l)8 Z(2)8 Z(3)8 Z(4)8 Z(5)8 Z(6)8 Z(7)8 ]τ from two 8-sample DCT blocks,
Figure imgf000023_0001
^(2)8 X2(3)s X2(A)S X2(5)& X2(6)s X2(7)8 ]τ , wherein said z is obtained from the matrix:
zor=G{ w F T + SW SFx j
whereby,
Figure imgf000023_0002
Figure imgf000023_0003
Figure imgf000023_0004
Figure imgf000024_0001
*i<IJ>l6
0 l<°)|6 o - o 0 0 0 0
W,β. r,(J),6 »ϊ (1
0
Figure imgf000024_0002
3T, (S)l6 ei(])M*r,(),β r,(i)w«»}(*)|t »i<ι>u ♦ r,fli),«
Figure imgf000024_0003
r,(i)lt«r,05)l( »"ι<7>i6-»i<l5>ι« »'ι<*li6-rι(|3li6
Figure imgf000024_0004
|(')l& »i<'>lβt lWl. »ϊ(»16+ l<">l« »l<">lβ.+ ,»|0*lβ
Figure imgf000024_0005
lri«»l«-'»i<">l6 ■'iWlt-'lO'llβ l<7>!6 -*'ι"|)ι«. , and
W,(Λ)i6 = [16.0000, 10.2023, 0, -3.4449, 0, 2.1214, 0, -1.5763, 0, 1.2936, 0, -1.1339,0,1.0450,0,-1.0048].
9. A method for resizing an image through downsampling to one 8-sample DCT block, z=[Z(0)8 Z(l)8 Z(2)8 Z(3)8 Z(4)8 Z(5)8 Z(6)8 Z(7)8 ]τ from two 8-sample DCT blocks, x,=[^ι(0)8 ,(1)8 X\(2)% ,(3)8 ,(4)8 ,(5)8 ,(6)8 Z,(7)8 ]τand x2=[^2(0)8 Λf2(l)8 X2(2)8 X2(3)8 ^2(4)8 X2(5)t 2(6)8 2(7)8 ]T , wherein said z is obtained from the matrix:
zor = G { modFx \r + S W J2 modSFx 2°r
whereby,
Figure imgf000024_0006
Figure imgf000025_0001
Figure imgf000025_0002
Figure imgf000025_0003
and
W,(*)i6 = [16.0000, 10.2023, 0, -3.4449, 0, 2.1214, 0, -1.5763, 0, 1.2936, 0, -1.1339, 0, 1.0450, 0, -1.0048].
10. A method for resizing an image through upsampling to two 8-sample DCT blocks, y,=[7,(0)8 7,(1)8 7,(2)8 7,(3)8 7,(4)8 7,(5)8 Y.(6), 7,(7)8 ]τ and y2=[72(0)8 72(1)8 72(2)8 72(3)8 72(4)8 72(5)8 Y2(6)s 72(7)8 ]τ from one 8-sample DCT block, x=[ΛΪ0)8 X(\)s X(2)i X(3)8 X(A)& X(5)& X(6)& X(7)8 ) wherein said y, and y2 are obtained respectively from the matrices: or d2, i GW , Fx
Figure imgf000026_0001
whereby,
Figure imgf000026_0002
Figure imgf000026_0003
Figure imgf000026_0004
Figure imgf000027_0001
, and
W,0fc),6 = [16.0000, 10.2023, 0, -3.4449, 0, 2.1214, 0, -1.5763, 0, 1.2936, 0, 1.1339, 0, 1.0450, 0, -1.0048].
11. A method for resizing an image through upsampling to two 8-sample DCT blocks, y,=[7,(0)8 7,(1)8 7,(2)8 7,(3)8 7,(4)8 7,(5)8 7,(6)8 7,(7)8 ]τ and y2=[72(0)8 72(1)8 72(2)8 72(3)8 72(4)8 72(5)8 Y2(6)& 72(7)8 ]τ from one 8-sample DCT block, x=[X(0)& X(\)& X(2)t X(3)& X(A)s X(5)8 X(6)8 X(7)s ] wherein said y, and y2 are obtained respectively from the matrices: v 1or = GW d2 .Fx01"
Figure imgf000027_0002
whereby,
Figure imgf000027_0003
Figure imgf000028_0001
Figure imgf000028_0002
Figure imgf000028_0003
W,(*)16 = [16.0000, 10.2023, 0, -3.4449, 0, 2.1214, 0, -1.5763, 0, 1.2936, 0, 1.1339,0, 1.0450,0,-1.0048].
PCT/KR2002/002083 2001-12-22 2002-11-08 Method for resizing images in block dct domain WO2003056837A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2002354281A AU2002354281A1 (en) 2001-12-22 2002-11-08 Method for resizing images in block dct domain

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR2001/83146 2001-12-22
KR1020010083146A KR20030053108A (en) 2001-12-22 2001-12-22 Method for resizing images in block-DCT domain

Publications (1)

Publication Number Publication Date
WO2003056837A1 true WO2003056837A1 (en) 2003-07-10

Family

ID=19717434

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2002/002083 WO2003056837A1 (en) 2001-12-22 2002-11-08 Method for resizing images in block dct domain

Country Status (3)

Country Link
KR (1) KR20030053108A (en)
AU (1) AU2002354281A1 (en)
WO (1) WO2003056837A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008148206A1 (en) * 2007-06-04 2008-12-11 Research In Motion Limited Method and device for selecting transform matrices for down-sampling dct image using learning with forgetting algorithm
WO2008148205A1 (en) * 2007-06-04 2008-12-11 Research In Motion Limited Method and device for down-sampling a dct image in the dct domain
WO2008148207A1 (en) * 2007-06-04 2008-12-11 Research In Motion Limited Method and device for selecting optimal transform matrices for down-sampling dct image
US7876976B2 (en) * 2006-02-23 2011-01-25 Seoul National University Industry Foundation Method and apparatus for resizing images using discrete cosine transform
US8233727B2 (en) 2007-10-19 2012-07-31 Research In Motion Limited Arbitrary ratio image resizing in the DCT domain
CN105263027A (en) * 2015-09-17 2016-01-20 东南大学 Down-sampling method and up-sampling method of video frames, and transmission processing method

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100948413B1 (en) * 2008-03-05 2010-03-19 경희대학교 산학협력단 Method and apparatus for up/down sampling of image block in the integer DCT domain
WO2010085125A2 (en) * 2009-01-22 2010-07-29 삼성전자 주식회사 Method and device for transformation of image and method and device for reverse transformation of image

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5845015A (en) * 1995-10-12 1998-12-01 Sarnoff Corporation Method and apparatus for resizing images using the discrete cosine transform

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2171335A1 (en) * 1993-11-30 1995-06-08 Munib A. Wober Coding methods and apparatus for scaling and filtering images using discrete cosine transforms
JPH09322165A (en) * 1996-05-31 1997-12-12 Sony Corp Image decoding device and method, and image reproducing device
US6222944B1 (en) * 1998-05-07 2001-04-24 Sarnoff Corporation Down-sampling MPEG image decoder

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5845015A (en) * 1995-10-12 1998-12-01 Sarnoff Corporation Method and apparatus for resizing images using the discrete cosine transform

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MARTUCCI STEPHEN A., MERSEREAU RUSSEL M.: "New approach to block filtering of images using symmetric convolution and the DST or DCT", INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEM (ISCAS) CONFERENCE, 1993, pages 259 - 262 *
MARTUCCI STEPHEN A.: "Symmetric convolution and the discrete sine and cosine transforms", IEEE TRANSACTIONS ON SIGNAL PROCESSING, vol. 42, no. 5, May 1994 (1994-05-01), pages 1038 - 1051, XP000863873 *

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7876976B2 (en) * 2006-02-23 2011-01-25 Seoul National University Industry Foundation Method and apparatus for resizing images using discrete cosine transform
US8094956B2 (en) 2007-06-04 2012-01-10 Research In Motion Limited Method and device for down-sampling a DCT image in the DCT domain
US8687905B2 (en) 2007-06-04 2014-04-01 Blackberry Limited Method and device for selecting optimal transform matrices for down-sampling DCT image
WO2008148205A1 (en) * 2007-06-04 2008-12-11 Research In Motion Limited Method and device for down-sampling a dct image in the dct domain
US7978934B2 (en) 2007-06-04 2011-07-12 Research In Motion Limited Method and device for selecting optimal transform matrices for down-sampling DCT image
US7995850B2 (en) 2007-06-04 2011-08-09 Research In Motion Limited Method and device for selecting transform matrices for down-sampling DCT image using learning with forgetting algorithm
WO2008148206A1 (en) * 2007-06-04 2008-12-11 Research In Motion Limited Method and device for selecting transform matrices for down-sampling dct image using learning with forgetting algorithm
US8233703B2 (en) 2007-06-04 2012-07-31 Research In Motion Limited Method and device for selecting transform matrices for down-sampling DCT image using learning with forgetting algorithm
US8903168B2 (en) 2007-06-04 2014-12-02 Blackberry Limited Method and device for selecting transform matrices for down-sampling DCT image using learning with forgetting algorithm
US8233732B2 (en) 2007-06-04 2012-07-31 Research In Motion Limited Method and device for selecting optimal transform matrices for down-sampling DCT image
US8792742B2 (en) 2007-06-04 2014-07-29 Blackberry Limited Method and device for down-sampling a DCT image in the DCT domain
WO2008148207A1 (en) * 2007-06-04 2008-12-11 Research In Motion Limited Method and device for selecting optimal transform matrices for down-sampling dct image
US8478059B2 (en) 2007-10-19 2013-07-02 Research In Motion Limited Arbitrary ratio image resizing in the DCT domain
US8233727B2 (en) 2007-10-19 2012-07-31 Research In Motion Limited Arbitrary ratio image resizing in the DCT domain
CN105263027A (en) * 2015-09-17 2016-01-20 东南大学 Down-sampling method and up-sampling method of video frames, and transmission processing method

Also Published As

Publication number Publication date
AU2002354281A1 (en) 2003-07-15
KR20030053108A (en) 2003-06-28

Similar Documents

Publication Publication Date Title
US6263119B1 (en) Method and apparatus for resizing images using the discrete trigonometric transform
KR100388377B1 (en) Image Encoding Device Decoding Device
DE69532640T2 (en) ADAPTIVE MULTIPORT VIDEO DESIGN AND BRIDGE SYSTEM
EP0781052B1 (en) Universal MPEG decoder with scalable picture size
US20020041632A1 (en) Picture decoding method and apparatus
US5835160A (en) Sampling rate conversion using digital differential analyzers
JPH08107568A (en) Method and equipment for scaling video picture
EP1041511A2 (en) System and method for image scaling
US20020140853A1 (en) Sampling rate conversion apparatus and method thereof
JP2000504520A (en) Improved sample ratio conversion
Park et al. Design and analysis of an image resizing filter in the block-DCT domain
WO2003056837A1 (en) Method for resizing images in block dct domain
JP2002500455A (en) High-speed IDCT / downsampling combined operation method and apparatus
US8417066B2 (en) Method and system for polyphase filtering by combining IIR and FIR filters and its applications in video scaling
US20030021486A1 (en) Method and apparatus for image scaling
JP2000244743A (en) Filter bank constitution method and filter bank device
JP2000165664A (en) Resolution converter for image and resolution conversing method for image
US6934420B1 (en) Wave image compression
US5483474A (en) D-dimensional, fractional bandwidth signal processing apparatus
KR100629310B1 (en) Apparatus, and associated method, for altering the resolution of a digital image
EP0700016A1 (en) Improvements in and relating to filters
KR100287866B1 (en) Vertical image format conversion device and digital receiving system using same
US20030102991A1 (en) Digital sample frequency converter
US20040258317A1 (en) Apparatus for converting conversion encoding coefficients using sampling/resolution conversion
US8068685B2 (en) Method for resizing images using integer DCT coefficients

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SC SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 69(1) EPC

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP