TW202335498A - 使用量化熵譯碼分佈參數的神經網路媒體壓縮 - Google Patents
使用量化熵譯碼分佈參數的神經網路媒體壓縮 Download PDFInfo
- Publication number
- TW202335498A TW202335498A TW112101937A TW112101937A TW202335498A TW 202335498 A TW202335498 A TW 202335498A TW 112101937 A TW112101937 A TW 112101937A TW 112101937 A TW112101937 A TW 112101937A TW 202335498 A TW202335498 A TW 202335498A
- Authority
- TW
- Taiwan
- Prior art keywords
- data element
- media
- probability distribution
- distribution function
- code vector
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/91—Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/13—Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
- G06N3/0455—Auto-encoder networks; Encoder-decoder networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
- H04N19/463—Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/94—Vector quantisation
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Biomedical Technology (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202263267857P | 2022-02-11 | 2022-02-11 | |
| US63/267,857 | 2022-02-11 | ||
| US17/814,426 | 2022-07-22 | ||
| US17/814,426 US11876969B2 (en) | 2022-02-11 | 2022-07-22 | Neural-network media compression using quantized entropy coding distribution parameters |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| TW202335498A true TW202335498A (zh) | 2023-09-01 |
Family
ID=85278445
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW112101937A TW202335498A (zh) | 2022-02-11 | 2023-01-17 | 使用量化熵譯碼分佈參數的神經網路媒體壓縮 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US12470715B2 (enExample) |
| EP (1) | EP4476914A1 (enExample) |
| JP (1) | JP2025508344A (enExample) |
| KR (1) | KR20240149891A (enExample) |
| TW (1) | TW202335498A (enExample) |
| WO (1) | WO2023154594A1 (enExample) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP7665776B2 (ja) * | 2021-03-26 | 2025-04-21 | ドルビー ラボラトリーズ ライセンシング コーポレイション | ニューラルネットワークを用いた画像及びビデオコーディングにおける潜時特徴の多分布エントロピーモデリング |
| US12323634B2 (en) * | 2022-02-11 | 2025-06-03 | Qualcomm Incorporated | Entropy coding for neural-based media compression |
| US12501050B2 (en) | 2023-03-09 | 2025-12-16 | Qualcomm Incorporated | Efficient warping-based neural video codec |
| FR3163515A1 (fr) * | 2024-06-17 | 2025-12-19 | Orange | Procédés de codage et de décodage, dispositifs et programme d’ordinateur correspondants. |
Family Cites Families (32)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3413720B2 (ja) * | 1998-06-26 | 2003-06-09 | ソニー株式会社 | 画像符号化方法及び装置、並びに画像復号方法及び装置 |
| US6421467B1 (en) * | 1999-05-28 | 2002-07-16 | Texas Tech University | Adaptive vector quantization/quantizer |
| US7760911B2 (en) | 2005-09-15 | 2010-07-20 | Sarnoff Corporation | Method and system for segment-based optical flow estimation |
| JP2010524383A (ja) | 2007-04-09 | 2010-07-15 | エルジー エレクトロニクス インコーポレイティド | ビデオ信号処理方法及び装置 |
| US8107550B2 (en) * | 2008-09-23 | 2012-01-31 | Alcatel Lucent | Methods for precoding signals for transmission in wireless MIMO system |
| WO2012122299A1 (en) * | 2011-03-07 | 2012-09-13 | Xiph. Org. | Bit allocation and partitioning in gain-shape vector quantization for audio coding |
| JP5845801B2 (ja) * | 2011-10-18 | 2016-01-20 | ソニー株式会社 | 画像処理装置、画像処理方法、及び、プログラム |
| JP6315980B2 (ja) * | 2013-12-24 | 2018-04-25 | 株式会社東芝 | デコーダ、デコード方法およびプログラム |
| US10075692B2 (en) | 2015-01-28 | 2018-09-11 | Hfi Innovation Inc. | Method of simple intra mode for video coding |
| KR102251828B1 (ko) * | 2015-09-02 | 2021-05-13 | 삼성전자주식회사 | 율―왜곡 최적화 기반의 양자화 방법 및 그 장치 |
| WO2018015963A1 (en) * | 2016-07-21 | 2018-01-25 | Ramot At Tel-Aviv University Ltd. | Method and system for comparing sequences |
| GB2586941B (en) | 2017-08-01 | 2022-06-22 | Displaylink Uk Ltd | Reducing judder using motion vectors |
| KR102285064B1 (ko) * | 2017-10-30 | 2021-08-04 | 한국전자통신연구원 | 은닉 변수를 이용하는 영상 및 신경망 압축을 위한 방법 및 장치 |
| US11257254B2 (en) * | 2018-07-20 | 2022-02-22 | Google Llc | Data compression using conditional entropy models |
| US11109065B2 (en) * | 2018-09-26 | 2021-08-31 | Google Llc | Video encoding by providing geometric proxies |
| WO2020068498A1 (en) | 2018-09-27 | 2020-04-02 | Google Llc | Data compression using integer neural networks |
| US10652581B1 (en) | 2019-02-27 | 2020-05-12 | Google Llc | Entropy coding in image and video compression using machine learning |
| US20200356835A1 (en) * | 2019-05-09 | 2020-11-12 | LGN Innovations Limited | Sensor-Action Fusion System for Optimising Sensor Measurement Collection from Multiple Sensors |
| US11532155B1 (en) * | 2019-07-09 | 2022-12-20 | ACME Atronomatic, LLC | Methods and devices for earth remote sensing using stereoscopic hyperspectral imaging in the visible (VIS) and infrared (IR) bands |
| US11374952B1 (en) * | 2019-09-27 | 2022-06-28 | Amazon Technologies, Inc. | Detecting anomalous events using autoencoders |
| KR102287947B1 (ko) | 2019-10-28 | 2021-08-09 | 삼성전자주식회사 | 영상의 ai 부호화 및 ai 복호화 방법, 및 장치 |
| US11256967B2 (en) * | 2020-01-27 | 2022-02-22 | Kla Corporation | Characterization system and method with guided defect discovery |
| US11776679B2 (en) * | 2020-03-10 | 2023-10-03 | The Board Of Trustees Of The Leland Stanford Junior University | Methods for risk map prediction in AI-based MRI reconstruction |
| WO2022098737A1 (en) * | 2020-11-03 | 2022-05-12 | Sri International | Longitudinal datasets and machine learning models for menopause state and anomaly predictions |
| EP4250729A4 (en) | 2021-02-22 | 2024-05-01 | Samsung Electronics Co., Ltd. | AI-BASED IMAGE ENCODING AND DECODING APPARATUS AND METHOD THEREFOR |
| EP4318376A4 (en) | 2021-05-24 | 2024-05-22 | Samsung Electronics Co., Ltd. | Ai-based frame interpolation method and device |
| US20230185953A1 (en) * | 2021-12-14 | 2023-06-15 | Sap Se | Selecting differential privacy parameters in neural networks |
| US11599972B1 (en) | 2021-12-22 | 2023-03-07 | Deep Render Ltd. | Method and system for lossy image or video encoding, transmission and decoding |
| US12307674B2 (en) * | 2022-02-03 | 2025-05-20 | GE Precision Healthcare LLC | Low latency interactive segmentation of medical images within a web-based deployment architecture |
| US12323634B2 (en) | 2022-02-11 | 2025-06-03 | Qualcomm Incorporated | Entropy coding for neural-based media compression |
| US11876969B2 (en) | 2022-02-11 | 2024-01-16 | Qualcomm Incorporated | Neural-network media compression using quantized entropy coding distribution parameters |
| US11825090B1 (en) | 2022-07-12 | 2023-11-21 | Qualcomm Incorporated | Bit-rate estimation for video coding with machine learning enhancement |
-
2023
- 2023-01-12 EP EP23705836.7A patent/EP4476914A1/en active Pending
- 2023-01-12 KR KR1020247026353A patent/KR20240149891A/ko active Pending
- 2023-01-12 WO PCT/US2023/060543 patent/WO2023154594A1/en not_active Ceased
- 2023-01-12 JP JP2024545801A patent/JP2025508344A/ja active Pending
- 2023-01-17 TW TW112101937A patent/TW202335498A/zh unknown
- 2023-12-08 US US18/534,073 patent/US12470715B2/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| EP4476914A1 (en) | 2024-12-18 |
| KR20240149891A (ko) | 2024-10-15 |
| JP2025508344A (ja) | 2025-03-26 |
| WO2023154594A1 (en) | 2023-08-17 |
| US20240121392A1 (en) | 2024-04-11 |
| US12470715B2 (en) | 2025-11-11 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11876969B2 (en) | Neural-network media compression using quantized entropy coding distribution parameters | |
| TW202335498A (zh) | 使用量化熵譯碼分佈參數的神經網路媒體壓縮 | |
| TW202349967A (zh) | 用於基於神經的媒體壓縮的熵譯碼 | |
| JP2013530610A (ja) | ランダムアクセス機能による画像圧縮方法 | |
| US12444090B2 (en) | Point cloud attribute prediction method and apparatus, and related device | |
| US20240129473A1 (en) | Probability estimation in multi-symbol entropy coding | |
| US20240242467A1 (en) | Video encoding and decoding method, encoder, decoder and storage medium | |
| JP7823216B2 (ja) | エンコード方法および装置、デコード方法および装置、デバイス、記憶媒体、ならびにコンピュータ・プログラム・プロダクト | |
| TW202446067A (zh) | 基於神經網路的視頻寫碼中的滑動窗口率失真優化 | |
| CN121056649A (zh) | 一种使用包括子网的神经网络编码或解码图像的方法和装置 | |
| TW202446075A (zh) | 高效的基於扭曲的神經視頻編解碼器 | |
| CN118633290A (zh) | 使用量化熵编解码分布参数的神经网络媒体压缩 | |
| WO2023202158A1 (zh) | 视频编码方法及装置 | |
| WO2022258055A1 (zh) | 点云属性信息编码方法、解码方法、装置及相关设备 | |
| Sheng et al. | High efficiency lossless image recompression algorithm with asymmetric numeral systems for real-time mobile application | |
| CN116405663A (zh) | 图像编解码方法、装置、电子设备及存储介质 | |
| CN116233426A (zh) | 属性量化、反量化方法、装置及设备 | |
| CN112188216B (zh) | 视频数据的编码方法、装置、计算机设备及存储介质 | |
| Ye et al. | Variable-rate learned image compression with integer-arithmetic-only inference | |
| US20210289206A1 (en) | Block-based spatial activity measures for pictures | |
| TW202349966A (zh) | 濾波方法、濾波模型訓練方法及相關裝置 | |
| WO2026068212A1 (en) | Gaussian pyramid based decomposition for hybrid inr network | |
| WO2025087214A1 (zh) | 解码器、编码器、图像编码、图像解码方法及存储介质 | |
| TW202433928A (zh) | 編解碼網路模型的量化方法和相關裝置 | |
| WO2024207244A1 (zh) | 点云的编解码方法、码流、编码器、解码器以及存储介质 |