CN117897736A - 用于机器视频编码(vcm)的编码器和解码器 - Google Patents
用于机器视频编码(vcm)的编码器和解码器 Download PDFInfo
- Publication number
- CN117897736A CN117897736A CN202280047141.1A CN202280047141A CN117897736A CN 117897736 A CN117897736 A CN 117897736A CN 202280047141 A CN202280047141 A CN 202280047141A CN 117897736 A CN117897736 A CN 117897736A
- Authority
- CN
- China
- Prior art keywords
- feature
- encoder
- video
- vcm
- decoder
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
- H04N21/2355—Processing of additional data, e.g. scrambling of additional data or processing content descriptors involving reformatting operations of additional data, e.g. HTML pages
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/44—Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/7715—Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/23418—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
- H04N21/4355—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Databases & Information Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Biomedical Technology (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202163197834P | 2021-06-07 | 2021-06-07 | |
| US63/197,834 | 2021-06-07 | ||
| PCT/US2022/032048 WO2022260934A1 (en) | 2021-06-07 | 2022-06-03 | Encoder and decoder for video coding for machines (vcm) |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN117897736A true CN117897736A (zh) | 2024-04-16 |
Family
ID=84425308
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202280047141.1A Pending CN117897736A (zh) | 2021-06-07 | 2022-06-03 | 用于机器视频编码(vcm)的编码器和解码器 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20240107088A1 (https=) |
| EP (1) | EP4352701A4 (https=) |
| JP (1) | JP2024520682A (https=) |
| KR (1) | KR20240051076A (https=) |
| CN (1) | CN117897736A (https=) |
| WO (1) | WO2022260934A1 (https=) |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP4427459A4 (en) * | 2021-11-04 | 2026-01-28 | Op Solutions Llc | SYSTEMS AND METHODS FOR TRANSFERRING MOTION INFORMATION FROM A VISUAL DOMAIN WITH CHARACTERISTICS AND FINE-FINING MOTION VECTOR CONTROL ON THE DECODER SIDE BASED ON CHARACTERISTICS |
| CN120419187A (zh) * | 2023-01-03 | 2025-08-01 | Lg 电子株式会社 | 编码/解码方法和装置以及存储比特流的记录介质 |
| US12309404B2 (en) * | 2023-03-07 | 2025-05-20 | Disney Enterprises, Inc. | Contextual video compression framework with spatial-temporal cross-covariance transformers |
| CN120917753A (zh) * | 2023-04-12 | 2025-11-07 | Lg 电子株式会社 | 编码/解码方法、设备和存储比特流的记录介质 |
| WO2025221719A1 (en) * | 2024-04-15 | 2025-10-23 | Op Solutions, Llc | Systems, methods and bitstreams for removing non-essential feature map information in machine-based applications using regions of interest |
| WO2025254466A1 (ko) * | 2024-06-05 | 2025-12-11 | 한화비전 주식회사 | 스케일링 파라미터를 이용하는 부호화/복호화 방법, 장치 및 기록 매체 |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20170249739A1 (en) * | 2016-02-26 | 2017-08-31 | Biomediq A/S | Computer analysis of mammograms |
| US11019355B2 (en) * | 2018-04-03 | 2021-05-25 | Electronics And Telecommunications Research Institute | Inter-prediction method and apparatus using reference frame generated based on deep learning |
| WO2020055279A1 (en) * | 2018-09-10 | 2020-03-19 | Huawei Technologies Co., Ltd. | Hybrid video and feature coding and decoding |
| US11410275B2 (en) * | 2019-09-23 | 2022-08-09 | Tencent America LLC | Video coding for machine (VCM) based system and method for video super resolution (SR) |
| WO2021095245A1 (ja) * | 2019-11-15 | 2021-05-20 | 日本電信電話株式会社 | 画像処理方法、データ処理方法、画像処理装置、およびプログラム |
| WO2021172956A1 (ko) * | 2020-02-28 | 2021-09-02 | 엘지전자 주식회사 | 영상 특징 정보 시그널링을 위한 영상 부호화/복호화 방법, 장치 및 비트스트림을 전송하는 방법 |
| EP4136848A4 (en) * | 2020-04-16 | 2024-04-03 | INTEL Corporation | PATCH-BASED VIDEO CODING FOR MACHINES |
| US11516478B2 (en) * | 2020-12-30 | 2022-11-29 | Hyundai Motor Company | Method and apparatus for coding machine vision data using prediction |
-
2022
- 2022-06-03 CN CN202280047141.1A patent/CN117897736A/zh active Pending
- 2022-06-03 EP EP22820800.5A patent/EP4352701A4/en active Pending
- 2022-06-03 WO PCT/US2022/032048 patent/WO2022260934A1/en not_active Ceased
- 2022-06-03 JP JP2023574502A patent/JP2024520682A/ja active Pending
- 2022-06-03 KR KR1020237044890A patent/KR20240051076A/ko active Pending
-
2023
- 2023-12-04 US US18/528,099 patent/US20240107088A1/en active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| KR20240051076A (ko) | 2024-04-19 |
| WO2022260934A1 (en) | 2022-12-15 |
| JP2024520682A (ja) | 2024-05-24 |
| EP4352701A4 (en) | 2025-04-09 |
| EP4352701A1 (en) | 2024-04-17 |
| US20240107088A1 (en) | 2024-03-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20240107088A1 (en) | Encoder and decoder for video coding for machines (vcm) | |
| US20240357142A1 (en) | Video and feature coding for multi-task machine learning | |
| US20240283942A1 (en) | Systems and methods for object and event detection and feature-based rate-distortion optimization for video coding | |
| US20240406424A1 (en) | Systems and methods for video coding for machines using an autoencoder | |
| CN119301609A (zh) | 用于使用生成式对抗模型对图像数据进行编码和解码的系统和方法 | |
| US20240185572A1 (en) | Systems and methods for joint optimization training and encoder side downsampling | |
| US20240340391A1 (en) | Intelligent multi-stream video coding for video surveillance | |
| US20240267531A1 (en) | Systems and methods for optimizing a loss function for video coding for machines | |
| US20240236342A1 (en) | Systems and methods for scalable video coding for machines | |
| US20240357107A1 (en) | Systems and methods for video coding of features using subpictures | |
| US20240291999A1 (en) | Systems and methods for motion information transfer from visual to feature domain and feature-based decoder-side motion vector refinement control | |
| CN118614062A (zh) | 用于从视觉到特征域的运动信息传递的系统和方法 | |
| CN118414829A (zh) | 用于对象和事件检测以及用于视频编码的基于特征的率失真优化的系统和方法 | |
| CN118119951A (zh) | 用于联合优化训练和编码器侧下采样的系统和方法 | |
| CN121713217A (zh) | 用于机器视频编码的解码后的帧增强的系统和方法 | |
| CN118020296A (zh) | 用于视频序列的解码器侧合成的系统和方法 | |
| CN118742904A (zh) | 用于多任务机器学习的视频和特征编码 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination |