CN114982248A - 使用基于卷积神经网络(cnn)的滤波器来增强360度视频 - Google Patents
使用基于卷积神经网络(cnn)的滤波器来增强360度视频 Download PDFInfo
- Publication number
- CN114982248A CN114982248A CN202080093560.XA CN202080093560A CN114982248A CN 114982248 A CN114982248 A CN 114982248A CN 202080093560 A CN202080093560 A CN 202080093560A CN 114982248 A CN114982248 A CN 114982248A
- Authority
- CN
- China
- Prior art keywords
- cnn
- artifacts
- video
- viewport
- based filter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000013527 convolutional neural network Methods 0.000 title claims abstract description 148
- 230000002708 enhancing effect Effects 0.000 title claims abstract description 12
- 238000000034 method Methods 0.000 claims description 61
- 238000012549 training Methods 0.000 claims description 26
- 230000000903 blocking effect Effects 0.000 claims description 20
- 239000010410 layer Substances 0.000 description 37
- 230000006835 compression Effects 0.000 description 24
- 238000007906 compression Methods 0.000 description 24
- 238000010586 diagram Methods 0.000 description 16
- 230000008569 process Effects 0.000 description 9
- 238000012545 processing Methods 0.000 description 9
- 230000009466 transformation Effects 0.000 description 4
- 230000003044 adaptive effect Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 238000013507 mapping Methods 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000002346 layers by function Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 210000003813 thumb Anatomy 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
- H04N23/698—Control of cameras or camera modules for achieving an enlarged field of view, e.g. panoramic image capture
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/597—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/20—Image enhancement or restoration using local operators
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/70—Denoising; Smoothing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/77—Retouching; Inpainting; Scratch removal
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/002—Image coding using neural networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
- H04N19/86—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Image Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2020/075548 WO2021163845A1 (en) | 2020-02-17 | 2020-02-17 | Enhancing 360-degree video using convolutional neural network (cnn) -based filter |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114982248A true CN114982248A (zh) | 2022-08-30 |
Family
ID=77390304
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080093560.XA Pending CN114982248A (zh) | 2020-02-17 | 2020-02-17 | 使用基于卷积神经网络(cnn)的滤波器来增强360度视频 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20230054523A1 (ko) |
EP (1) | EP4107966A4 (ko) |
KR (1) | KR20220140706A (ko) |
CN (1) | CN114982248A (ko) |
WO (1) | WO2021163845A1 (ko) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3916633A1 (de) * | 2020-05-25 | 2021-12-01 | Sick Ag | Kamera und verfahren zum verarbeiten von bilddaten |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9576214B1 (en) * | 2012-01-23 | 2017-02-21 | Hrl Laboratories, Llc | Robust object recognition from moving platforms by combining form and motion detection with bio-inspired classification |
JP6983862B2 (ja) * | 2016-07-08 | 2021-12-17 | ヴィド スケール インコーポレイテッド | ジオメトリ投影を使用する360度ビデオ符号化 |
US20180176468A1 (en) * | 2016-12-19 | 2018-06-21 | Qualcomm Incorporated | Preferred rendering of signalled regions-of-interest or viewports in virtual reality video |
KR102676278B1 (ko) * | 2017-02-20 | 2024-06-19 | 삼성전자주식회사 | 전자 장치 및 전자 장치에서 360도 영상 디스플레이 방법 |
US10616482B2 (en) * | 2017-03-10 | 2020-04-07 | Gopro, Inc. | Image quality assessment |
US10607329B2 (en) * | 2017-03-13 | 2020-03-31 | Adobe Inc. | Illumination estimation from a single image |
KR20200005539A (ko) * | 2017-04-11 | 2020-01-15 | 브이아이디 스케일, 인크. | 면 연속성을 사용하는 360 도 비디오 코딩 |
US20180338160A1 (en) * | 2017-05-18 | 2018-11-22 | Mediatek Inc. | Method and Apparatus for Reduction of Artifacts in Coded Virtual-Reality Images |
US20190387212A1 (en) * | 2017-05-26 | 2019-12-19 | Lg Electronics Inc. | 360 video processing method and apparatus therefor |
US10484682B2 (en) * | 2017-07-03 | 2019-11-19 | Qualcomm Incorporated | Reference picture derivation and motion compensation for 360-degree video coding |
US10798417B2 (en) * | 2017-07-05 | 2020-10-06 | Qualcomm Incorporated | Deblock filtering for 360-degree video coding |
US11432010B2 (en) * | 2017-12-19 | 2022-08-30 | Vid Scale, Inc. | Face discontinuity filtering for 360-degree video coding |
US11212438B2 (en) * | 2018-02-14 | 2021-12-28 | Qualcomm Incorporated | Loop filter padding for 360-degree video coding |
US10721465B2 (en) * | 2018-02-14 | 2020-07-21 | Qualcomm Incorporated | Motion compensation for cubemap packed frames |
WO2019170154A1 (en) * | 2018-03-09 | 2019-09-12 | Mediatek Inc. | De-blocking method for reconstructed projection-based frame that employs projection layout of 360-degree virtual reality projection |
US20190289327A1 (en) * | 2018-03-13 | 2019-09-19 | Mediatek Inc. | Method and Apparatus of Loop Filtering for VR360 Videos |
US11272209B2 (en) * | 2018-04-03 | 2022-03-08 | Samsung Electronics Co., Ltd. | Methods and apparatus for determining adjustment parameter during encoding of spherical multimedia content |
KR102022648B1 (ko) * | 2018-08-10 | 2019-09-19 | 삼성전자주식회사 | 전자 장치, 이의 제어 방법 및 서버의 제어 방법 |
US10744936B1 (en) * | 2019-06-10 | 2020-08-18 | Ambarella International Lp | Using camera data to automatically change the tint of transparent materials |
US11416002B1 (en) * | 2019-06-11 | 2022-08-16 | Ambarella International Lp | Robotic vacuum with mobile security function |
IT201900011403A1 (it) * | 2019-07-10 | 2021-01-10 | Ambarella Int Lp | Detecting illegal use of phone to prevent the driver from getting a fine |
CA3146773A1 (en) * | 2019-07-11 | 2021-01-14 | Beijing Bytedance Network Technology Co., Ltd. | Sample padding in adaptive loop filtering |
US11193312B1 (en) * | 2019-09-30 | 2021-12-07 | Ambarella International Lp | Child safety lock |
US11109152B2 (en) * | 2019-10-28 | 2021-08-31 | Ambarella International Lp | Optimize the audio capture during conference call in cars |
CN113439439A (zh) * | 2019-12-17 | 2021-09-24 | 株式会社 Xris | 用于对图像信号进行编码/解码的方法及其装置 |
US11343485B1 (en) * | 2020-08-24 | 2022-05-24 | Ambarella International Lp | Virtual horizontal stereo camera |
-
2020
- 2020-02-17 CN CN202080093560.XA patent/CN114982248A/zh active Pending
- 2020-02-17 WO PCT/CN2020/075548 patent/WO2021163845A1/en unknown
- 2020-02-17 US US17/793,348 patent/US20230054523A1/en active Pending
- 2020-02-17 KR KR1020227024293A patent/KR20220140706A/ko unknown
- 2020-02-17 EP EP20920220.9A patent/EP4107966A4/en not_active Withdrawn
Also Published As
Publication number | Publication date |
---|---|
WO2021163845A1 (en) | 2021-08-26 |
EP4107966A1 (en) | 2022-12-28 |
US20230054523A1 (en) | 2023-02-23 |
EP4107966A4 (en) | 2023-07-26 |
KR20220140706A (ko) | 2022-10-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112204993B (zh) | 使用重叠的被分区的分段的自适应全景视频流式传输 | |
CN107454468B (zh) | 对沉浸式视频进行格式化的方法、装置和流 | |
US10798389B2 (en) | Method and apparatus for content-aware point cloud compression using HEVC tiles | |
US11004173B2 (en) | Method for processing projection-based frame that includes at least one projection face packed in 360-degree virtual reality projection layout | |
US20190281273A1 (en) | Adaptive loop filtering method for reconstructed projection-based frame that employs projection layout of 360-degree virtual reality projection | |
US20190045212A1 (en) | METHOD AND APPARATUS FOR PREDICTIVE CODING OF 360º VIDEO | |
JP2020532212A (ja) | 階層化シーン分解コーデックシステム及び方法 | |
JP2017530626A (ja) | ビデオコード化のための同時ローカライゼーション及びマッピング | |
US11138460B2 (en) | Image processing method and apparatus | |
US11069026B2 (en) | Method for processing projection-based frame that includes projection faces packed in cube-based projection layout with padding | |
US11159811B2 (en) | Partitioning of coded point cloud data | |
US11451836B2 (en) | Techniques and apparatus for PCM patch creation using Morton codes | |
JP7371691B2 (ja) | ホモグラフィ変換を使用した点群符号化 | |
US11494870B2 (en) | Method and apparatus for reducing artifacts in projection-based frame | |
CN102272793A (zh) | 缩放已压缩图像帧的方法和系统 | |
WO2021163845A1 (en) | Enhancing 360-degree video using convolutional neural network (cnn) -based filter | |
CN113452870B (zh) | 视频处理方法和装置 | |
EP3895425A1 (en) | Immersive video bitstream processing | |
CN106664387B (zh) | 一种对视频图像帧进行后处理的计算机装置和方法,以及计算机可读介质 | |
US10922783B2 (en) | Cube-based projection method that applies different mapping functions to different square projection faces, different axes, and/or different locations of axis | |
Groth et al. | Wavelet-Based Fast Decoding of 360 Videos | |
US11727536B2 (en) | Method and apparatus for geometric smoothing | |
Groth et al. | Wavelet-Based Fast Decoding of 360-Degree Videos |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |