JP2024501331A - ビデオコーディング中にフィルタ処理するための複数のニューラルネットワークモデル - Google Patents
ビデオコーディング中にフィルタ処理するための複数のニューラルネットワークモデル Download PDFInfo
- Publication number
- JP2024501331A JP2024501331A JP2023539890A JP2023539890A JP2024501331A JP 2024501331 A JP2024501331 A JP 2024501331A JP 2023539890 A JP2023539890 A JP 2023539890A JP 2023539890 A JP2023539890 A JP 2023539890A JP 2024501331 A JP2024501331 A JP 2024501331A
- Authority
- JP
- Japan
- Prior art keywords
- data
- unit
- neural network
- filtering
- video
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
- H04N19/86—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/002—Image coding using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/11—Selection of coding mode or of prediction mode among a plurality of spatial predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/117—Filters, e.g. for pre-processing or post-processing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
- H04N19/619—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding the transform being operated outside the prediction loop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
- H04N19/82—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- General Physics & Mathematics (AREA)
- Biomedical Technology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202163133733P | 2021-01-04 | 2021-01-04 | |
| US63/133,733 | 2021-01-04 | ||
| US17/566,282 | 2021-12-30 | ||
| US17/566,282 US12327384B2 (en) | 2021-01-04 | 2021-12-30 | Multiple neural network models for filtering during video coding |
| PCT/US2022/011021 WO2022147494A1 (en) | 2021-01-04 | 2022-01-03 | Multiple neural network models for filtering during video coding |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2024501331A true JP2024501331A (ja) | 2024-01-11 |
| JPWO2022147494A5 JPWO2022147494A5 (enExample) | 2024-12-24 |
Family
ID=80050929
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2023539890A Pending JP2024501331A (ja) | 2021-01-04 | 2022-01-03 | ビデオコーディング中にフィルタ処理するための複数のニューラルネットワークモデル |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20250218052A1 (enExample) |
| EP (1) | EP4272448A1 (enExample) |
| JP (1) | JP2024501331A (enExample) |
| KR (1) | KR20230129015A (enExample) |
| BR (1) | BR112023012685A2 (enExample) |
| WO (1) | WO2022147494A1 (enExample) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2025516483A (ja) * | 2022-09-19 | 2025-05-30 | ▲騰▼▲訊▼科技(深▲セン▼)有限公司 | マルチメディアデータ処理方法及びその装置、機器、並びにプログラム |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20230023579A1 (en) * | 2021-07-07 | 2023-01-26 | Lemon, Inc. | Configurable Neural Network Model Depth In Neural Network-Based Video Coding |
| WO2024078598A1 (en) * | 2022-10-13 | 2024-04-18 | Douyin Vision Co., Ltd. | Method, apparatus, and medium for video processing |
| CN120051988A (zh) * | 2022-10-13 | 2025-05-27 | 抖音视界有限公司 | 用于视频处理的方法、装置和介质 |
| WO2025058218A1 (ko) * | 2023-09-13 | 2025-03-20 | 삼성전자 주식회사 | 필터링된 옵티컬 플로우를 이용한 영상의 부호화 방법 및 장치, 및 영상의 복호화 방법 및 장치 |
| WO2025170428A1 (en) * | 2024-02-07 | 2025-08-14 | Samsung Electronics Co., Ltd. | System and method for encoding and decoding video-codec using artificial intelligence-based in-loop filtering model |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2019031410A1 (ja) * | 2017-08-10 | 2019-02-14 | シャープ株式会社 | 画像フィルタ装置、画像復号装置、および画像符号化装置 |
| JP2019201256A (ja) * | 2018-05-14 | 2019-11-21 | シャープ株式会社 | 画像フィルタ装置 |
-
2022
- 2022-01-03 WO PCT/US2022/011021 patent/WO2022147494A1/en not_active Ceased
- 2022-01-03 BR BR112023012685A patent/BR112023012685A2/pt unknown
- 2022-01-03 JP JP2023539890A patent/JP2024501331A/ja active Pending
- 2022-01-03 EP EP22701075.8A patent/EP4272448A1/en active Pending
- 2022-01-03 KR KR1020237021763A patent/KR20230129015A/ko active Pending
-
2025
- 2025-03-20 US US19/085,414 patent/US20250218052A1/en active Pending
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2019031410A1 (ja) * | 2017-08-10 | 2019-02-14 | シャープ株式会社 | 画像フィルタ装置、画像復号装置、および画像符号化装置 |
| JP2019201256A (ja) * | 2018-05-14 | 2019-11-21 | シャープ株式会社 | 画像フィルタ装置 |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2025516483A (ja) * | 2022-09-19 | 2025-05-30 | ▲騰▼▲訊▼科技(深▲セン▼)有限公司 | マルチメディアデータ処理方法及びその装置、機器、並びにプログラム |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2022147494A1 (en) | 2022-07-07 |
| US20250218052A1 (en) | 2025-07-03 |
| BR112023012685A2 (pt) | 2023-12-05 |
| KR20230129015A (ko) | 2023-09-05 |
| EP4272448A1 (en) | 2023-11-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11825101B2 (en) | Joint-component neural network based filtering during video coding | |
| US11206400B2 (en) | Low-frequency non-separable transform (LFNST) simplifications | |
| JP2023542841A (ja) | ビデオコーディング中にフィルタ処理するための複数のニューラルネットワークモデル | |
| TWI862578B (zh) | 適應性迴路濾波器組之索引發信 | |
| JP2023542840A (ja) | ビデオコーディングのためのフィルタ処理プロセス | |
| US12327384B2 (en) | Multiple neural network models for filtering during video coding | |
| CN113940069A (zh) | 用于视频译码中的低频不可分离变换的变换和最后有效系数位置信令 | |
| US11778213B2 (en) | Activation function design in neural network-based filtering process for video coding | |
| US20210136422A1 (en) | Merge estimation region for multi-type-tree block structure | |
| TWI840427B (zh) | 用於置零轉換之掃描及最後係數位置寫碼 | |
| JP7423647B2 (ja) | 異なるクロマフォーマットを使用した三角予測ユニットモードでのビデオコーディング | |
| US20250218052A1 (en) | Multiple neural network models for filtering during video coding | |
| US12149707B2 (en) | Intra block copy prediction restrictions in video coding | |
| CN114223202A (zh) | 低频不可分离变换(lfnst)信令 | |
| US11310519B2 (en) | Deblocking of subblock boundaries for affine motion compensated coding | |
| US12439038B2 (en) | Reduced complexity multi-mode neural network filtering of video data | |
| US12432344B2 (en) | Intra chroma mode list construction for video coding | |
| CN111602395A (zh) | 用于视频译码的量化组 | |
| CN114128298A (zh) | 调色板模式下的增量量化参数(qp)信令 | |
| US12309400B2 (en) | Fixed bit depth processing for cross-component linear model (CCLM) mode in video coding | |
| CN114175643A (zh) | 调色板和预测模式信令 | |
| US11729381B2 (en) | Deblocking filter parameter signaling | |
| JP2023544046A (ja) | 高ビット深度ビデオコーディングのためのライスパラメータ値の適応的な導出 | |
| KR20230075443A (ko) | 상이한 비트 심도에서 비디오 데이터의 코딩을 위한 적응적 루프 필터링의 동작 비트 심도의 제한 | |
| US20240015312A1 (en) | Neural network based filtering process for multiple color components in video coding |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20241216 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20241216 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20250922 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20250930 |