TW202145792A - 使用深度學習的並行化的速率失真最佳化量化 - Google Patents
使用深度學習的並行化的速率失真最佳化量化 Download PDFInfo
- Publication number
- TW202145792A TW202145792A TW110113490A TW110113490A TW202145792A TW 202145792 A TW202145792 A TW 202145792A TW 110113490 A TW110113490 A TW 110113490A TW 110113490 A TW110113490 A TW 110113490A TW 202145792 A TW202145792 A TW 202145792A
- Authority
- TW
- Taiwan
- Prior art keywords
- transform coefficient
- coefficients
- determining
- block
- video
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
- G06F30/27—Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/01—Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/002—Image coding using neural networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/13—Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/147—Data rate or code amount at the encoder output according to rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/18—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
- H04N19/463—Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/48—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/117—Filters, e.g. for pre-processing or post-processing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04Q—SELECTING
- H04Q2213/00—Indexing scheme relating to selecting arrangements in general and for multiplex systems
- H04Q2213/13343—Neural networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04Q—SELECTING
- H04Q2213/00—Indexing scheme relating to selecting arrangements in general and for multiplex systems
- H04Q2213/343—Neural network
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S706/00—Data processing: artificial intelligence
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Evolutionary Computation (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Computing Systems (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Probability & Statistics with Applications (AREA)
- Computer Hardware Design (AREA)
- Geometry (AREA)
- Automation & Control Theory (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202063011685P | 2020-04-17 | 2020-04-17 | |
| US63/011,685 | 2020-04-17 | ||
| US202063034618P | 2020-06-04 | 2020-06-04 | |
| US63/034,618 | 2020-06-04 | ||
| US17/070,589 | 2020-10-14 | ||
| US17/070,589 US12058348B2 (en) | 2020-04-17 | 2020-10-14 | Parallelized rate-distortion optimized quantization using deep learning |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| TW202145792A true TW202145792A (zh) | 2021-12-01 |
Family
ID=78082393
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW110113490A TW202145792A (zh) | 2020-04-17 | 2021-04-15 | 使用深度學習的並行化的速率失真最佳化量化 |
Country Status (9)
| Country | Link |
|---|---|
| US (1) | US12058348B2 (https=) |
| EP (1) | EP4136837A1 (https=) |
| JP (1) | JP7642671B2 (https=) |
| KR (1) | KR20230007313A (https=) |
| CN (1) | CN115336266B (https=) |
| BR (1) | BR112022020125A2 (https=) |
| PH (1) | PH12022552241A1 (https=) |
| TW (1) | TW202145792A (https=) |
| WO (1) | WO2021211270A1 (https=) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TWI894907B (zh) * | 2023-07-31 | 2025-08-21 | 大陸商中國銀聯股份有限公司 | 用於訓練矩陣生成器的方法、系統和電腦可讀儲存媒體、用於構建對比矩陣的方法、系統和電腦可讀儲存媒體以及層次分析方法、層次分析系統和電腦可讀儲存媒體 |
Families Citing this family (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11490083B2 (en) | 2020-02-05 | 2022-11-01 | Qualcomm Incorporated | Learned low-complexity adaptive quantization for video compression |
| US20220215265A1 (en) * | 2021-01-04 | 2022-07-07 | Tencent America LLC | Method and apparatus for end-to-end task-oriented latent compression with deep reinforcement learning |
| US12244792B2 (en) * | 2021-03-30 | 2025-03-04 | Sony Interactive Entertainment Europe Limited | Processing image data |
| US11368349B1 (en) * | 2021-11-15 | 2022-06-21 | King Abdulaziz University | Convolutional neural networks based computationally efficient method for equalization in FBMC-OQAM system |
| JP7825447B2 (ja) * | 2022-02-14 | 2026-03-06 | 日本放送協会 | 符号化装置、プログラム、及びモデル生成方法 |
| WO2023169501A1 (en) * | 2022-03-09 | 2023-09-14 | Beijing Bytedance Network Technology Co., Ltd. | Method, apparatus, and medium for visual data processing |
| US20230306239A1 (en) * | 2022-03-25 | 2023-09-28 | Tencent America LLC | Online training-based encoder tuning in neural image compression |
| US20230316588A1 (en) * | 2022-03-29 | 2023-10-05 | Tencent America LLC | Online training-based encoder tuning with multi model selection in neural image compression |
| US12231183B2 (en) * | 2022-04-29 | 2025-02-18 | Qualcomm Incorporated | Machine learning for beam predictions with confidence indications |
| CN114708436B (zh) * | 2022-06-02 | 2022-09-02 | 深圳比特微电子科技有限公司 | 语义分割模型的训练方法、语义分割方法、装置和介质 |
| CN115209147B (zh) * | 2022-09-15 | 2022-12-27 | 深圳沛喆微电子有限公司 | 摄像头视频传输带宽优化方法、装置、设备及存储介质 |
| CN116366846B (zh) * | 2023-03-14 | 2025-11-11 | 北京百度网讯科技有限公司 | 视频编码方法、装置以及设备 |
| WO2025114549A1 (en) * | 2023-12-01 | 2025-06-05 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Block-based codec supporting transform coefficient prediction and/or transform improvement |
| US20250307133A1 (en) * | 2024-03-28 | 2025-10-02 | Advanced Micro Devices, Inc. | Offloading Quantization of Directional Blocked Data Formats to Near-Memory Units |
| WO2025238505A1 (en) * | 2024-05-13 | 2025-11-20 | Imax Corporation | Large multimodal model-based video encoding optimization |
Family Cites Families (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20030009575A (ko) * | 2001-06-26 | 2003-02-05 | 박광훈 | 신경망 분류기를 이용한 동영상 전송률 제어 장치 및 그방법 |
| US7620103B2 (en) | 2004-12-10 | 2009-11-17 | Lsi Corporation | Programmable quantization dead zone and threshold for standard-based H.264 and/or VC1 video encoding |
| US7889790B2 (en) | 2005-12-20 | 2011-02-15 | Sharp Laboratories Of America, Inc. | Method and apparatus for dynamically adjusting quantization offset values |
| US7995649B2 (en) | 2006-04-07 | 2011-08-09 | Microsoft Corporation | Quantization adjustment based on texture level |
| US8767834B2 (en) | 2007-03-09 | 2014-07-01 | Sharp Laboratories Of America, Inc. | Methods and systems for scalable-to-non-scalable bit-stream rewriting |
| ES2681209T3 (es) | 2009-09-10 | 2018-09-12 | Guangdong Oppo Mobile Telecommunications Corp., Ltd. | Técnicas de aceleración para una cuantificación optimizada de tasa de distorsión |
| US8170110B2 (en) * | 2009-10-16 | 2012-05-01 | Hong Kong Applied Science and Technology Research Institute Company Limited | Method and apparatus for zoom motion estimation |
| KR101492930B1 (ko) | 2010-09-14 | 2015-02-23 | 블랙베리 리미티드 | 변환 도메인 내의 어댑티브 필터링을 이용한 데이터 압축 방법 및 장치 |
| BR112013007023A2 (pt) * | 2010-09-28 | 2017-07-25 | Samsung Electronics Co Ltd | método de codificação de vídeo e método de decodificação de vídeo |
| US8553769B2 (en) * | 2011-01-19 | 2013-10-08 | Blackberry Limited | Method and device for improved multi-layer data compression |
| US9521410B2 (en) | 2012-04-26 | 2016-12-13 | Qualcomm Incorporated | Quantization parameter (QP) coding in video coding |
| US9213556B2 (en) * | 2012-07-30 | 2015-12-15 | Vmware, Inc. | Application directed user interface remoting using video encoding techniques |
| US9560386B2 (en) | 2013-02-21 | 2017-01-31 | Mozilla Corporation | Pyramid vector quantization for video coding |
| US9294766B2 (en) | 2013-09-09 | 2016-03-22 | Apple Inc. | Chroma quantization in video coding |
| US10057578B2 (en) * | 2014-10-07 | 2018-08-21 | Qualcomm Incorporated | QP derivation and offset for adaptive color transform in video coding |
| EP3545679B1 (en) * | 2016-12-02 | 2022-08-24 | Huawei Technologies Co., Ltd. | Apparatus and method for encoding an image |
| US10721471B2 (en) * | 2017-10-26 | 2020-07-21 | Intel Corporation | Deep learning based quantization parameter estimation for video encoding |
| KR102941657B1 (ko) * | 2018-02-08 | 2026-03-20 | 한국전자통신연구원 | 신경망에 기반하는 비디오 부호화 및 비디오 복호화를 위한 방법 및 장치 |
| EP3633990B1 (en) * | 2018-10-02 | 2021-10-27 | Nokia Technologies Oy | An apparatus and method for using a neural network in video coding |
| JP2020088740A (ja) * | 2018-11-29 | 2020-06-04 | ピクシブ株式会社 | 画像処理装置、画像処理方法及び画像処理プログラム |
| US12505580B2 (en) * | 2019-07-02 | 2025-12-23 | Telefonaktiebolaget Lm Ericsson (Publ) | Inference processing of data |
| US11496769B2 (en) * | 2019-09-27 | 2022-11-08 | Apple Inc. | Neural network based image set compression |
| CN112819699B (zh) * | 2019-11-15 | 2024-11-05 | 北京金山云网络技术有限公司 | 视频处理方法、装置及电子设备 |
-
2020
- 2020-10-14 US US17/070,589 patent/US12058348B2/en active Active
-
2021
- 2021-03-23 KR KR1020227032350A patent/KR20230007313A/ko active Pending
- 2021-03-23 JP JP2022557846A patent/JP7642671B2/ja active Active
- 2021-03-23 PH PH1/2022/552241A patent/PH12022552241A1/en unknown
- 2021-03-23 BR BR112022020125A patent/BR112022020125A2/pt unknown
- 2021-03-23 EP EP21718744.2A patent/EP4136837A1/en active Pending
- 2021-03-23 CN CN202180022319.2A patent/CN115336266B/zh active Active
- 2021-03-23 WO PCT/US2021/023680 patent/WO2021211270A1/en not_active Ceased
- 2021-04-15 TW TW110113490A patent/TW202145792A/zh unknown
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TWI894907B (zh) * | 2023-07-31 | 2025-08-21 | 大陸商中國銀聯股份有限公司 | 用於訓練矩陣生成器的方法、系統和電腦可讀儲存媒體、用於構建對比矩陣的方法、系統和電腦可讀儲存媒體以及層次分析方法、層次分析系統和電腦可讀儲存媒體 |
Also Published As
| Publication number | Publication date |
|---|---|
| US12058348B2 (en) | 2024-08-06 |
| JP2023522575A (ja) | 2023-05-31 |
| CN115336266B (zh) | 2025-09-23 |
| BR112022020125A2 (pt) | 2022-11-29 |
| WO2021211270A1 (en) | 2021-10-21 |
| KR20230007313A (ko) | 2023-01-12 |
| EP4136837A1 (en) | 2023-02-22 |
| JP7642671B2 (ja) | 2025-03-10 |
| PH12022552241A1 (en) | 2024-03-11 |
| CN115336266A (zh) | 2022-11-11 |
| US20210329267A1 (en) | 2021-10-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7642671B2 (ja) | 深層学習を使用する並列化されたレートひずみ最適量子化 | |
| CN116965029A (zh) | 使用卷积神经网络对图像进行译码的装置和方法 | |
| US20230362378A1 (en) | Video coding method and apparatus | |
| JP2023544562A (ja) | イントラ予測方法及び装置 | |
| CN116349225B (zh) | 视频解码方法和装置、电子设备和存储介质 | |
| US11849113B2 (en) | Quantization constrained neural image coding | |
| TW202133620A (zh) | 用於視訊壓縮的學習低複雜度自我調整量化 | |
| JP7704475B2 (ja) | 符号化及び復号化方法並びに装置 | |
| US12177473B2 (en) | Video coding using optical flow and residual predictors | |
| CN115735359A (zh) | 用于神经图像压缩中的内容自适应在线训练的方法和设备 | |
| US11893783B2 (en) | Apparatus and method for transceiving feature map extracted using MPEG-VCM | |
| KR20240002416A (ko) | Npu를 이용한 머신 분석을 위한 비트스트림 포맷 | |
| KR20230003227A (ko) | 신경 이미지 압축에서의 스케일링 인자 및/또는 오프셋에 의한 컨텐츠-적응적 온라인 훈련 | |
| KR20250020478A (ko) | 크로마 샘플의 크로스 성분 예측 | |
| WO2022063265A1 (zh) | 帧间预测方法及装置 | |
| WO2023039859A1 (zh) | 视频编解码方法、设备、系统、及存储介质 | |
| CN117981319A (zh) | 用于基于块的视频编解码的符号预测 | |
| TW202450297A (zh) | 用於視頻寫碼的具有可分離卷積和多尺度增強的基於神經網路的迴路內濾波器架構 | |
| WO2023092404A1 (zh) | 视频编解码方法、设备、系统、及存储介质 | |
| CN118235412A (zh) | 基于块的视频编解码的符号预测 | |
| CN118476228A (zh) | 用于基于块的视频编解码的符号预测 | |
| TW202606296A (zh) | 具有複雜度降低的輸入特徵提取的基於nn的迴路內濾波器(ilf)架構 | |
| TW202541491A (zh) | 用於視訊寫碼之具有整數變換器模組之基於resnet的迴路內濾波器 | |
| TW202510566A (zh) | 用於基於神經網路的視頻寫碼工具的複雜度降低的方法 | |
| TW202529440A (zh) | 用於視訊寫碼之具有注意力模組之基於ResNet的迴路內濾波器 |