WO2017045344A1 - 一种视频帧的下采样方法和上采样方法以及传输处理方法 - Google Patents
一种视频帧的下采样方法和上采样方法以及传输处理方法 Download PDFInfo
- Publication number
- WO2017045344A1 WO2017045344A1 PCT/CN2016/073415 CN2016073415W WO2017045344A1 WO 2017045344 A1 WO2017045344 A1 WO 2017045344A1 CN 2016073415 W CN2016073415 W CN 2016073415W WO 2017045344 A1 WO2017045344 A1 WO 2017045344A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- frequency component
- video frame
- low
- cosine transform
- matrix
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/59—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/625—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
Definitions
- the invention belongs to the field of image processing, and in particular relates to a downsampling method, an upsampling method and a transmission processing method for a video frame.
- DCT Discrete Cosine Transform
- Mitra of the University of California, Santa Barbara proposed a sub-band DCT approximation scheme
- Park of Seoul National University proposed a modular representation scheme for non-reverse DCT
- the State University of New York Chen proposed the DCT-Wiener scheme, in which the Wiener filter It is used to estimate high-frequency components
- Siu of the Hong Kong Polytechnic University proposes a self-learning k-NN MMSE estimation scheme to improve PSNR
- Mitra applies the correlation between DCT coefficients to improve visual quality.
- these schemes are based on the assumption that most of the energy is concentrated on the DCT low-frequency coefficients, but when this assumption is not met, the performance of these schemes will deteriorate.
- the premise of using compressed sensing theory is that the compressed sampled signal must be sparse, because Therefore, the image sampling processing using the compressed sensing must first perform the sparse transform on the image, but often after the image is sparsely transformed, the amount of data of the image will become larger, resulting in a large bandwidth occupied by the image transmission.
- the applicant found that the spatial components corresponding to the high-frequency DCT coefficients are sparse, that is, when the low-frequency coefficients of the video frames are removed and the high-frequency components are preserved, only the video frames are in the video frames. Fast changing areas and edge areas.
- the present invention provides a downsampling method and an upsampling method for video frames, which perform downsampling and corresponding video frame data by utilizing the sparsity of the spatial domain corresponding to the high frequency coefficients of the DCT. Upsampling achieves low data throughput and high reconstruction accuracy.
- the method for downsampling a video frame in the present invention includes the following steps:
- the present invention also discloses an upsampling method for a video frame, including the following steps:
- the present invention further provides a method for processing a video frame, including the following steps:
- the transmitting end down-samples the low-frequency component and the high-frequency component in the video frame according to the above-mentioned downsampling method to obtain the down sampling result T lf of the low-frequency component and the down-sampling result T hf of the high-frequency component;
- the receiving end receives the down sampling result T lf of the low frequency component and the down sampling result T hf of the high frequency component, and performs the restoration of the video frame by using the above up sampling method.
- the present invention most of the energy of the video frame is concentrated on the low frequency coefficient of the discrete cosine transform and the air space corresponding to the high frequency coefficient is sparse. These two properties are first used in the downsampling of the video frame.
- the components and the high-frequency components are separated, and the DCT low-frequency coefficients of the video frame are truncated and retained.
- the spatial components corresponding to the high-frequency DCT coefficients are compressed and sampled by the compressed sensing theory, and the low-frequency components and the high-frequency components are respectively downsampled and retained.
- the edge information corresponding to the high frequency coefficient and the information of the rapidly changing region correspondingly, in the process of upsampling the video frame, for the low frequency component and the high frequency component of the down sampling,
- the inverse discrete cosine transform and the l 1 norm reconstruction algorithm are used to reconstruct the low frequency component and the high frequency component respectively, and the reserved DCT low frequency coefficients are zeroed to the same data size as the original image, and reconstructed by the compressed sensing reconstruction algorithm.
- the spatial component corresponding to the high-frequency DCT coefficient is added, which is the high-similar reconstructed image of the original image, which is retained in the downsampling.
- upsampling the reconstructed image can be reduced to achieve high accuracy.
- the method of the present invention has a higher similarity to the original video frame at the same downsampling ratio as the conventional scheme.
- FIG. 1 is a flow chart of a video frame up and down sampling scheme of the present invention
- FIG. 2 is an exploded view of the high frequency component and the low frequency component of the video frame DCT;
- FIG. 2(a) is the original video frame;
- FIG. 2(b) is a spatial domain diagram of the original video frame retaining the discrete cosine transform low frequency coefficient and the high frequency coefficient zero padding
- Figure 2 (c) is a spatial map corresponding to the high frequency coefficients of the discrete cosine transform of the original video frame;
- FIG. 3 is a flow chart of a video frame downsampling method of the present invention.
- FIG. 4 is a flow chart of a method for sampling a video frame of the present invention.
- FIG. 6 is a simulation comparison diagram of SSIM data of the present invention and the existing hybrid interpolation method under different downsampling ratio conditions.
- the up-and-down sampling process is divided into the high-frequency part and the low-frequency part of the video frame, respectively, using the discrete cosine transform energy concentrated in the low-frequency coefficient and the high-frequency coefficient corresponding to the spatial domain is sparse Two properties.
- Figure 2(b) is more ambiguous than Figure 2(a), but basically the video frame can be determined.
- the content is sufficient to see that the airspace map corresponding to the low-frequency coefficients concentrates most of the energy in the original video frame.
- the grayscale value of the video frame in Figure 2(c) is mostly zero or close to zero (corresponding to the black part in the figure). ), only the sparse value exists in the edge region, it can be seen that the spatial domain corresponding to the high frequency component of the video frame is sparse.
- the video uses video frames as a processing unit, and the video frames are represented in the form of a matrix.
- the elements of the matrix are pixels, and the video frames are used as images.
- the length and width are also in pixels.
- the mentioned video frame and video frame data are the same concept, and both represent a matrix of a certain video frame.
- the video frame transmitting end uses the discrete cosine transform and the compressed sensing to downsample the low frequency component and the high frequency component of the video frame, respectively, including the following steps:
- C 1Dm and C 1Dn are both one-dimensional discrete cosine transform matrices.
- M D is the number of image columns after truncation Truncated matrix is a column
- R represents the set of real numbers
- the image width N D are truncated columns
- E is the identity matrix.
- ⁇ is the measurement matrix [DLDonoho, "Compressed sensing," IEEE Transactions on Information Theory, vol. 52, no. 4, pp. 1289–1306, 2006], and the measurement matrix is a definition in the theory of compressed sensing.
- Quantity is a kind of random matrix, and satisfies the finite equidistant property (RIP) as the measurement matrix.
- RIP finite equidistant property
- the sampled data is sent to the video receiving end.
- the receiving end separately samples the low frequency component and the high frequency component of the video frame by using discrete cosine transform and compressed sensing, and then performs The restored video frame data is integrated, as shown in FIG. 4, the method includes the following steps:
- U c and U r are the upsampled line zero complement matrix and the downsampled column complement zero matrix, respectively, and E is an identity matrix.
- ⁇ ' is the reconstructed ⁇
- mat() is the inverse of vec(), which converts the vector into a matrix. It represents the 0 norm of the vector ⁇ , which is the number of non-zero elements in the vector ⁇ , argmin( ⁇ ) is the minimum value, st is the constraint condition, ⁇ is the measurement matrix, and T hf is the high-frequency component downsampled. results, I hf 'is the reconstructed discrete cosine transform high-frequency component corresponding to the spatial data.
- the original video frame matrix I' can be restored by adding the reconstructed spatial domain high-frequency component to the low-frequency spatial domain component obtained by inverse discrete cosine transform:
- the simulation of the SSIM data using the peak signal-to-noise ratio PSNR and the structural similarity is performed at different sampling rates of the lower upsampling method of the present invention and the existing downsampling method.
- PSNR and SSIM are both quantitative evaluations of the similarity between the two images. The higher the value, the higher the similarity.
- the two pictures compared with the method of the present invention are: the original video frame and Upsampled restored image; the two images compared by the existing downsampling method are: the original video frame and its corresponding upsampled restored image.
- the peak signal-to-noise ratio (PSNR) of the method of the present invention is 1.239 dB higher than the discrete cosine transform-Wiener interpolation method at different downsampling rates. It can be seen that as the sampling rate increases, The peak signal-to-noise ratio (PSNR) of the inventive method has a higher rate of improvement than the discrete cosine transform-Wiener interpolation method, and the hybrid interpolation method has a small increase in the peak signal-to-noise ratio PSNR as the downsampling rate increases.
- the present invention In terms of the method, the performance of the low-frequency component down-sampling data fixed (fixed DCT cut-off rate) is higher than that of the high-frequency component down-sampled data (fixed compression sampling rate).
- the structural similarity measurement SSIM of the method of the present invention is higher than the discrete cosine transform-Wiener interpolation method by 0.0067 or more at different downsampling rates.
- the same down sampling rate is used to compare different images by using the method of the present invention and DCT Wiener hybrid interpolation and bicubic interpolation.
- the method has a downsampling rate of 0.5 and is processed for different images (12 images are listed in the table).
- the method of the present invention is at least 2.5 dB higher than the DCT Wiener hybrid interpolation method and the bicubic interpolation;
- the method of the present invention is at least 0.05 higher than the DCT Wiener hybrid interpolation method and the bicubic interpolation average, which is sufficient to see the method of the present invention. Stability and general applicability.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Discrete Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (4)
- 一种视频帧的下采样方法,其特征在于,包括以下步骤:(1)将视频帧Ioriginal进行离散余弦变换得到视频帧Ioriginal在余弦变换域下的系数IDCT;(2)将所述余弦变换域下的系数IDCT中的高频系数利用截断矩阵进行截断,得到低频成分的下采样结果Tlf;(3)将所述余弦变换域下的系数IDCT中的低频系数置零得到高频成分,并将高频成分通过逆离散余弦变换回到空间域Ihf;(4)将所述空间域Ihf排列成向量,然后利用测量矩阵进行压缩采样,得到高频成分的下采样结果Thf。
- 一种视频帧的上采样方法,其特征在于,包括以下步骤:(1)将低频成分的下采样结果Tlf中的高频系数补零,然后使用逆离散余弦变换回到空间域,得到低频空间域成分Ilf';(2)将高频成分的下采样结果Thf利用压缩感知重构算法重构,并排列成一个矩阵,得到空间域高频成分矩阵I'hf;(3)将所述低频空间域成分Ilf'和空间域高频成分I'hf进行叠加恢复出原视频帧I'。
- 一种视频帧的传输处理方法,其特征在于,包括以下步骤:(1)发送端对输入的视频帧按照权利要求1所述的下采样方法对视频帧中的低频成分和高频成分分别进行下采样得到低频成分的下采样结果Tlf和高频成分的下采样结果Thf;(2)将所述低频成分的下采样结果Tlf和高频成分的下采样结果Thf进行发送;(3)接收端接收到所述低频成分的下采样结果Tlf和高频成分的下采样结果Thf,并利用权利要求2所述的上采样方法进行视频帧的还原。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510593694.9A CN105263027B (zh) | 2015-09-17 | 2015-09-17 | 一种视频帧的下采样方法和上采样方法以及传输处理方法 |
CN201510593694.9 | 2015-09-17 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2017045344A1 true WO2017045344A1 (zh) | 2017-03-23 |
Family
ID=55102508
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2016/073415 WO2017045344A1 (zh) | 2015-09-17 | 2016-02-04 | 一种视频帧的下采样方法和上采样方法以及传输处理方法 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN105263027B (zh) |
WO (1) | WO2017045344A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112106304A (zh) * | 2018-05-11 | 2020-12-18 | 三菱电机株式会社 | 通信系统、通信方法及存储介质 |
CN114025168A (zh) * | 2021-09-30 | 2022-02-08 | 浙江大华技术股份有限公司 | 视频图像的处理方法、处理设备、电子设备以及存储介质 |
CN114155874A (zh) * | 2021-12-09 | 2022-03-08 | 云知声智能科技股份有限公司 | 一种特征提取方法、装置、电子设备和存储介质 |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105263027B (zh) * | 2015-09-17 | 2019-01-29 | 东南大学 | 一种视频帧的下采样方法和上采样方法以及传输处理方法 |
CN109040116B (zh) * | 2018-09-06 | 2020-03-27 | 广州宏途教育网络科技有限公司 | 一种基于云端服务器的视频会议系统 |
CN109413152B (zh) * | 2018-09-25 | 2021-02-26 | 上海瑾盛通信科技有限公司 | 图像处理方法、装置、存储介质及电子设备 |
CN114372995A (zh) * | 2020-10-14 | 2022-04-19 | 晶晨半导体(上海)股份有限公司 | 显示驱动方法、模块及芯片、电子设备和存储介质 |
WO2022109910A1 (zh) * | 2020-11-26 | 2022-06-02 | 深圳市大疆创新科技有限公司 | 视频编码方法、装置、编码器和存储介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102378011A (zh) * | 2010-08-12 | 2012-03-14 | 华为技术有限公司 | 一种图像上采样的方法、装置和系统 |
CN103098464A (zh) * | 2010-08-04 | 2013-05-08 | Nxp股份有限公司 | 在频域中具有下采样的视频解码器 |
WO2014173970A1 (en) * | 2013-04-26 | 2014-10-30 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Upsampling and signal enhancement |
CN104469389A (zh) * | 2014-12-09 | 2015-03-25 | 上海交通大学 | 基于变换下采样的低码率视频编码方法及系统 |
CN105263027A (zh) * | 2015-09-17 | 2016-01-20 | 东南大学 | 一种视频帧的下采样方法和上采样方法以及传输处理方法 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20030053108A (ko) * | 2001-12-22 | 2003-06-28 | 한국과학기술원 | 블록 이산코사인변환 영역에서의 영상 크기 변환 방법 |
CN102281443A (zh) * | 2011-08-22 | 2011-12-14 | 天津师范大学 | 基于优化的层式离散余弦变换的压缩感知图像处理方法 |
CN104883579B (zh) * | 2015-06-08 | 2017-11-14 | 电子科技大学 | 一种基于空‑频域的联合视频图像的上采样方法 |
-
2015
- 2015-09-17 CN CN201510593694.9A patent/CN105263027B/zh active Active
-
2016
- 2016-02-04 WO PCT/CN2016/073415 patent/WO2017045344A1/zh active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103098464A (zh) * | 2010-08-04 | 2013-05-08 | Nxp股份有限公司 | 在频域中具有下采样的视频解码器 |
CN102378011A (zh) * | 2010-08-12 | 2012-03-14 | 华为技术有限公司 | 一种图像上采样的方法、装置和系统 |
WO2014173970A1 (en) * | 2013-04-26 | 2014-10-30 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Upsampling and signal enhancement |
CN104469389A (zh) * | 2014-12-09 | 2015-03-25 | 上海交通大学 | 基于变换下采样的低码率视频编码方法及系统 |
CN105263027A (zh) * | 2015-09-17 | 2016-01-20 | 东南大学 | 一种视频帧的下采样方法和上采样方法以及传输处理方法 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112106304A (zh) * | 2018-05-11 | 2020-12-18 | 三菱电机株式会社 | 通信系统、通信方法及存储介质 |
CN112106304B (zh) * | 2018-05-11 | 2022-03-25 | 三菱电机株式会社 | 通信系统、通信方法及存储介质 |
CN114025168A (zh) * | 2021-09-30 | 2022-02-08 | 浙江大华技术股份有限公司 | 视频图像的处理方法、处理设备、电子设备以及存储介质 |
CN114155874A (zh) * | 2021-12-09 | 2022-03-08 | 云知声智能科技股份有限公司 | 一种特征提取方法、装置、电子设备和存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN105263027A (zh) | 2016-01-20 |
CN105263027B (zh) | 2019-01-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2017045344A1 (zh) | 一种视频帧的下采样方法和上采样方法以及传输处理方法 | |
CN106097251B (zh) | 非均匀稀疏采样视频超分辨率方法 | |
Sandić-Stanković et al. | DIBR-synthesized image quality assessment based on morphological multi-scale approach | |
US9830682B2 (en) | Upsampling and signal enhancement | |
CN104883579B (zh) | 一种基于空‑频域的联合视频图像的上采样方法 | |
CN113450280A (zh) | 一种由粗到细地融合时空信息的压缩视频质量增强方法 | |
Bagawade Ramdas et al. | Wavelet transform techniques for image resolution enhancement: a study | |
Harikrishna et al. | Satellite image resolution enhancement using dwt technique | |
CN102842123B (zh) | 图像上采样中边缘清晰度提升的稀疏域残差补偿修正方法 | |
Park et al. | Hybrid image upsampling method in the discrete cosine transform domain | |
Pandey et al. | Block wise image compression & Reduced Blocks Artifacts Using Discrete Cosine Transform | |
CN105392014A (zh) | 一种优化的小波变换图像压缩方法 | |
Tai et al. | A fast algorithm for single image super resolution in both wavelet and spatial domain | |
CN104683818A (zh) | 基于双正交不变集多小波的图像压缩方法 | |
Yang et al. | A new image super-resolution method in the wavelet domain | |
Kapinaiah et al. | Block DCT to wavelet transcoding in transform domain | |
EP2645712A1 (en) | Image downsampling | |
Nahar | A compression original image based on the DDWT technique and enhancement SNR | |
Viswanath et al. | Wavelet transcoding in the block discrete cosine transform space | |
Pokle et al. | Lossy Image Compression using Discrete Cosine Transform | |
KR101650897B1 (ko) | 저해상도 컨텐츠를 위한 윈도우 사이즈 주밍 방법 및 그 장치 | |
Zhizhong et al. | Translation invariance-based super resolution method for mixed resolution multiview video | |
Nahar et al. | A Novel Improve and Compression for the Medical Image Technique Based on the Double Density Wavelet | |
Xiong et al. | Super-resolution for low quality thumbnail images | |
Dhara et al. | A novel hybrid technique in spatio-frequecncy domain for better quality image and video up-sampling |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16845462 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 16845462 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 16845462 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 14.12.2018) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 16845462 Country of ref document: EP Kind code of ref document: A1 |