CN117897954A - 用于组合式无损和有损编码的机器视频编码(vcm)编码器和解码器 - Google Patents
用于组合式无损和有损编码的机器视频编码(vcm)编码器和解码器 Download PDFInfo
- Publication number
- CN117897954A CN117897954A CN202280055653.2A CN202280055653A CN117897954A CN 117897954 A CN117897954 A CN 117897954A CN 202280055653 A CN202280055653 A CN 202280055653A CN 117897954 A CN117897954 A CN 117897954A
- Authority
- CN
- China
- Prior art keywords
- encoder
- video
- vcm
- decoder
- sub
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
- H04N21/2355—Processing of additional data, e.g. scrambling of additional data or processing content descriptors involving reformatting operations of additional data, e.g. HTML pages
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/44—Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/7715—Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/625—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/236—Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
- H04N21/23614—Multiplexing of additional data and video streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
- H04N21/4355—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Health & Medical Sciences (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Artificial Intelligence (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Discrete Mathematics (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression Of Band Width Or Redundancy In Fax (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202163208241P | 2021-06-08 | 2021-06-08 | |
| US63/208,241 | 2021-06-08 | ||
| PCT/US2022/031726 WO2022260900A1 (en) | 2021-06-08 | 2022-06-01 | Video coding for machines (vcm) encoder and decoder for combined lossless and lossy encoding |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN117897954A true CN117897954A (zh) | 2024-04-16 |
Family
ID=84425305
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202280055653.2A Pending CN117897954A (zh) | 2021-06-08 | 2022-06-01 | 用于组合式无损和有损编码的机器视频编码(vcm)编码器和解码器 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US20240114185A1 (https=) |
| EP (1) | EP4352963A4 (https=) |
| JP (1) | JP2024521572A (https=) |
| KR (1) | KR20240051104A (https=) |
| CN (1) | CN117897954A (https=) |
| BR (1) | BR112023025493A2 (https=) |
| WO (1) | WO2022260900A1 (https=) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119484823A (zh) * | 2025-01-13 | 2025-02-18 | 中南大学 | 基于图像频域特征的vvc编码单元快速划分方法和系统 |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN116366841B (zh) * | 2021-12-28 | 2025-11-11 | 维沃移动通信有限公司 | 环路滤波方法及终端 |
| US12549772B2 (en) | 2023-04-11 | 2026-02-10 | Alibaba Innovation Private Limited | Object mask information for supplemental enhancement information message |
| WO2025154982A1 (ko) * | 2024-01-17 | 2025-07-24 | 삼성전자 주식회사 | 영상 복호화 장치 및 방법, 및 영상 부호화 장치 및 방법 |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5659634A (en) * | 1994-09-29 | 1997-08-19 | Xerox Corporation | Apparatus and method for encoding and reconstructing image data |
| US8848802B2 (en) * | 2009-09-04 | 2014-09-30 | Stmicroelectronics International N.V. | System and method for object based parametric video coding |
| US10244246B2 (en) * | 2012-02-02 | 2019-03-26 | Texas Instruments Incorporated | Sub-pictures for pixel rate balancing on multi-core platforms |
| US10248663B1 (en) * | 2017-03-03 | 2019-04-02 | Descartes Labs, Inc. | Geo-visual search |
| WO2019110149A1 (en) * | 2017-12-08 | 2019-06-13 | Huawei Technologies Co., Ltd. | Cluster refinement for texture synthesis in video coding |
| US10397518B1 (en) * | 2018-01-16 | 2019-08-27 | Amazon Technologies, Inc. | Combining encoded video streams |
| CN112673625A (zh) * | 2018-09-10 | 2021-04-16 | 华为技术有限公司 | 混合视频以及特征编码和解码 |
| EP3939318A1 (en) * | 2019-03-11 | 2022-01-19 | VID SCALE, Inc. | Sub-picture bitstream extraction and reposition |
| US11410275B2 (en) * | 2019-09-23 | 2022-08-09 | Tencent America LLC | Video coding for machine (VCM) based system and method for video super resolution (SR) |
| CN115210715A (zh) * | 2020-01-07 | 2022-10-18 | 诺基亚技术有限公司 | 用于神经网络的压缩表示的高级语法 |
| EP3863022A1 (en) * | 2020-02-06 | 2021-08-11 | Siemens Healthcare GmbH | Method and system for automatically characterizing liver tissue of a patient, computer program and electronically readable storage medium |
| US11375204B2 (en) * | 2020-04-07 | 2022-06-28 | Nokia Technologies Oy | Feature-domain residual for video coding for machines |
| TWI798714B (zh) * | 2020-06-09 | 2023-04-11 | 弗勞恩霍夫爾協會 | 時間移動向量預測、層間參考及時間子層指示的視訊寫碼技術 |
| US11451790B2 (en) * | 2020-10-09 | 2022-09-20 | Tencent America LLC | Method and apparatus in video coding for machines |
| US20250090036A1 (en) * | 2020-10-21 | 2025-03-20 | Bruce Hopenfeld | Multichannel Heartbeat Detection by Temporal Pattern Search |
| US12003719B2 (en) * | 2020-11-26 | 2024-06-04 | Electronics And Telecommunications Research Institute | Method, apparatus and storage medium for image encoding/decoding using segmentation map |
| US11516478B2 (en) * | 2020-12-30 | 2022-11-29 | Hyundai Motor Company | Method and apparatus for coding machine vision data using prediction |
| EP4272441A1 (en) * | 2021-01-04 | 2023-11-08 | Guangdong Oppo Mobile Telecommunications Corp., Ltd. | Video coding based on feature extraction and picture synthesis |
| AU2021202142A1 (en) * | 2021-04-07 | 2022-10-27 | Canon Kabushiki Kaisha | Tool selection for feature map encoding vs regular video encoding |
-
2022
- 2022-06-01 CN CN202280055653.2A patent/CN117897954A/zh active Pending
- 2022-06-01 BR BR112023025493A patent/BR112023025493A2/pt unknown
- 2022-06-01 JP JP2023574429A patent/JP2024521572A/ja active Pending
- 2022-06-01 WO PCT/US2022/031726 patent/WO2022260900A1/en not_active Ceased
- 2022-06-01 EP EP22820781.7A patent/EP4352963A4/en active Pending
- 2022-06-01 KR KR1020247000360A patent/KR20240051104A/ko active Pending
-
2023
- 2023-12-01 US US18/526,539 patent/US20240114185A1/en active Pending
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119484823A (zh) * | 2025-01-13 | 2025-02-18 | 中南大学 | 基于图像频域特征的vvc编码单元快速划分方法和系统 |
| CN119484823B (zh) * | 2025-01-13 | 2025-04-01 | 中南大学 | 基于图像频域特征的vvc编码单元快速划分方法和系统 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4352963A4 (en) | 2025-04-23 |
| EP4352963A1 (en) | 2024-04-17 |
| US20240114185A1 (en) | 2024-04-04 |
| BR112023025493A2 (pt) | 2024-02-27 |
| WO2022260900A1 (en) | 2022-12-15 |
| JP2024521572A (ja) | 2024-06-03 |
| KR20240051104A (ko) | 2024-04-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN108028941B (zh) | 用于通过超像素编码和解码数字图像的方法和装置 | |
| KR102071764B1 (ko) | 영상 부호화, 복호화 방법 및 장치 | |
| CN117897954A (zh) | 用于组合式无损和有损编码的机器视频编码(vcm)编码器和解码器 | |
| CN107211131B (zh) | 对数字图像块进行基于掩码的处理的系统和方法 | |
| KR20240051076A (ko) | 기계용 비디오 코딩(vcm)을 위한 인코더 및 디코더 | |
| JP2022523309A (ja) | 指数関数的分割におけるインター予測 | |
| US20250280115A1 (en) | Methods, systems and decoder for combined lossless and lossy coding | |
| CN119487841A (zh) | 使用神经网络进行图像区域的并行处理-解码、后滤波和rdoq | |
| US20240414316A1 (en) | Systems, methods, and bitstream structure for video coding and decoding for machines with adaptive inference | |
| JP2024514681A (ja) | ハイブリッド特徴ビデオ・ビットストリーム用のシステム、方法、及びビットストリーム構造、及びデコーダ | |
| JP2023062136A (ja) | 文脈的区分化および処理のためのブロックベースのピクチャ融合 | |
| KR20240104130A (ko) | 객체 및 이벤트 검출 및 비디오 코딩을 위한 특징-기반 레이트-왜곡 최적화를 위한 시스템 및 방법 | |
| US12355947B1 (en) | Methods and systems for combined lossless and lossy coding | |
| CN118235408A (zh) | 用于可缩放的机器视频编码的系统和方法 | |
| CN118414833A (zh) | 用于优化机器视频编码的损失函数的系统和方法 | |
| CN118451709A (zh) | 特征编码/解码方法和设备以及其中存储比特流的记录介质 | |
| CN119522574A (zh) | 使用神经网络进行图像区域的并行处理-解码、后滤波和rdoq | |
| CN118020290A (zh) | 用存储器高效预测模式选择来编码和解码视频的系统和方法 | |
| JP7253053B2 (ja) | ピクチャのためのブロックベースの空間活性測度 | |
| WO2025059287A1 (en) | Systems and methods for content adaptive multi-scale feature layer filtering | |
| WO2022047144A1 (en) | Methods and systems for combined lossless and lossy coding | |
| CN118414829A (zh) | 用于对象和事件检测以及用于视频编码的基于特征的率失真优化的系统和方法 | |
| CN120153652A (zh) | 用于基于机器的应用的具有自适应量化的图像和视频编码 | |
| CN118119951A (zh) | 用于联合优化训练和编码器侧下采样的系统和方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| CB03 | Change of inventor or designer information |
Inventor after: Harry Cava Inventor after: Borivoye Fult Inventor after: Filippo Azke Inventor before: Harry Cava Inventor before: Borivoye Fult Inventor before: Filippo Avik |
|
| CB03 | Change of inventor or designer information |