KR20240051076A - 기계용 비디오 코딩(vcm)을 위한 인코더 및 디코더 - Google Patents
기계용 비디오 코딩(vcm)을 위한 인코더 및 디코더 Download PDFInfo
- Publication number
- KR20240051076A KR20240051076A KR1020237044890A KR20237044890A KR20240051076A KR 20240051076 A KR20240051076 A KR 20240051076A KR 1020237044890 A KR1020237044890 A KR 1020237044890A KR 20237044890 A KR20237044890 A KR 20237044890A KR 20240051076 A KR20240051076 A KR 20240051076A
- Authority
- KR
- South Korea
- Prior art keywords
- encoder
- feature
- video
- vcm
- decoder
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/235—Processing of additional data, e.g. scrambling of additional data or processing content descriptors
- H04N21/2355—Processing of additional data, e.g. scrambling of additional data or processing content descriptors involving reformatting operations of additional data, e.g. HTML pages
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/44—Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/7715—Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/23418—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/435—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
- H04N21/4355—Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Databases & Information Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Biomedical Technology (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202163197834P | 2021-06-07 | 2021-06-07 | |
| US63/197,834 | 2021-06-07 | ||
| PCT/US2022/032048 WO2022260934A1 (en) | 2021-06-07 | 2022-06-03 | Encoder and decoder for video coding for machines (vcm) |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| KR20240051076A true KR20240051076A (ko) | 2024-04-19 |
Family
ID=84425308
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020237044890A Pending KR20240051076A (ko) | 2021-06-07 | 2022-06-03 | 기계용 비디오 코딩(vcm)을 위한 인코더 및 디코더 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20240107088A1 (https=) |
| EP (1) | EP4352701A4 (https=) |
| JP (1) | JP2024520682A (https=) |
| KR (1) | KR20240051076A (https=) |
| CN (1) | CN117897736A (https=) |
| WO (1) | WO2022260934A1 (https=) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025254466A1 (ko) * | 2024-06-05 | 2025-12-11 | 한화비전 주식회사 | 스케일링 파라미터를 이용하는 부호화/복호화 방법, 장치 및 기록 매체 |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP4427459A4 (en) * | 2021-11-04 | 2026-01-28 | Op Solutions Llc | SYSTEMS AND METHODS FOR TRANSFERRING MOTION INFORMATION FROM A VISUAL DOMAIN WITH CHARACTERISTICS AND FINE-FINING MOTION VECTOR CONTROL ON THE DECODER SIDE BASED ON CHARACTERISTICS |
| CN120419187A (zh) * | 2023-01-03 | 2025-08-01 | Lg 电子株式会社 | 编码/解码方法和装置以及存储比特流的记录介质 |
| US12309404B2 (en) * | 2023-03-07 | 2025-05-20 | Disney Enterprises, Inc. | Contextual video compression framework with spatial-temporal cross-covariance transformers |
| CN120917753A (zh) * | 2023-04-12 | 2025-11-07 | Lg 电子株式会社 | 编码/解码方法、设备和存储比特流的记录介质 |
| WO2025221719A1 (en) * | 2024-04-15 | 2025-10-23 | Op Solutions, Llc | Systems, methods and bitstreams for removing non-essential feature map information in machine-based applications using regions of interest |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20170249739A1 (en) * | 2016-02-26 | 2017-08-31 | Biomediq A/S | Computer analysis of mammograms |
| US11019355B2 (en) * | 2018-04-03 | 2021-05-25 | Electronics And Telecommunications Research Institute | Inter-prediction method and apparatus using reference frame generated based on deep learning |
| WO2020055279A1 (en) * | 2018-09-10 | 2020-03-19 | Huawei Technologies Co., Ltd. | Hybrid video and feature coding and decoding |
| US11410275B2 (en) * | 2019-09-23 | 2022-08-09 | Tencent America LLC | Video coding for machine (VCM) based system and method for video super resolution (SR) |
| WO2021095245A1 (ja) * | 2019-11-15 | 2021-05-20 | 日本電信電話株式会社 | 画像処理方法、データ処理方法、画像処理装置、およびプログラム |
| WO2021172956A1 (ko) * | 2020-02-28 | 2021-09-02 | 엘지전자 주식회사 | 영상 특징 정보 시그널링을 위한 영상 부호화/복호화 방법, 장치 및 비트스트림을 전송하는 방법 |
| EP4136848A4 (en) * | 2020-04-16 | 2024-04-03 | INTEL Corporation | PATCH-BASED VIDEO CODING FOR MACHINES |
| US11516478B2 (en) * | 2020-12-30 | 2022-11-29 | Hyundai Motor Company | Method and apparatus for coding machine vision data using prediction |
-
2022
- 2022-06-03 CN CN202280047141.1A patent/CN117897736A/zh active Pending
- 2022-06-03 EP EP22820800.5A patent/EP4352701A4/en active Pending
- 2022-06-03 WO PCT/US2022/032048 patent/WO2022260934A1/en not_active Ceased
- 2022-06-03 JP JP2023574502A patent/JP2024520682A/ja active Pending
- 2022-06-03 KR KR1020237044890A patent/KR20240051076A/ko active Pending
-
2023
- 2023-12-04 US US18/528,099 patent/US20240107088A1/en active Pending
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025254466A1 (ko) * | 2024-06-05 | 2025-12-11 | 한화비전 주식회사 | 스케일링 파라미터를 이용하는 부호화/복호화 방법, 장치 및 기록 매체 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN117897736A (zh) | 2024-04-16 |
| WO2022260934A1 (en) | 2022-12-15 |
| JP2024520682A (ja) | 2024-05-24 |
| EP4352701A4 (en) | 2025-04-09 |
| EP4352701A1 (en) | 2024-04-17 |
| US20240107088A1 (en) | 2024-03-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20240107088A1 (en) | Encoder and decoder for video coding for machines (vcm) | |
| US20240357142A1 (en) | Video and feature coding for multi-task machine learning | |
| US20240338486A1 (en) | Systems and methods for privacy protection in video communication systems | |
| US20240283942A1 (en) | Systems and methods for object and event detection and feature-based rate-distortion optimization for video coding | |
| EP4388456A1 (en) | Systems and methods for joint optimization training and encoder side downsampling | |
| US20240267531A1 (en) | Systems and methods for optimizing a loss function for video coding for machines | |
| US20240340391A1 (en) | Intelligent multi-stream video coding for video surveillance | |
| US20240236342A1 (en) | Systems and methods for scalable video coding for machines | |
| US20240357107A1 (en) | Systems and methods for video coding of features using subpictures | |
| US20240291999A1 (en) | Systems and methods for motion information transfer from visual to feature domain and feature-based decoder-side motion vector refinement control | |
| CN118414829A (zh) | 用于对象和事件检测以及用于视频编码的基于特征的率失真优化的系统和方法 | |
| CN118614062A (zh) | 用于从视觉到特征域的运动信息传递的系统和方法 | |
| CN118119951A (zh) | 用于联合优化训练和编码器侧下采样的系统和方法 | |
| WO2025007083A1 (en) | Systems and method for decoded frame augmentation for video coding for machines | |
| WO2025015116A2 (en) | Systems and method for decoded frame augmentation for video coding for machines with dct filtering for map | |
| CN118742904A (zh) | 用于多任务机器学习的视频和特征编码 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
Patent event date: 20231226 Patent event code: PA01051R01D Comment text: International Patent Application |
|
| PG1501 | Laying open of application | ||
| A201 | Request for examination | ||
| PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 20250515 Comment text: Request for Examination of Application |