CN117897954A - 用于组合式无损和有损编码的机器视频编码(vcm)编码器和解码器 - Google Patents

用于组合式无损和有损编码的机器视频编码(vcm)编码器和解码器 Download PDF

Info

Publication number
CN117897954A
CN117897954A CN202280055653.2A CN202280055653A CN117897954A CN 117897954 A CN117897954 A CN 117897954A CN 202280055653 A CN202280055653 A CN 202280055653A CN 117897954 A CN117897954 A CN 117897954A
Authority
CN
China
Prior art keywords
encoder
video
vcm
decoder
sub
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202280055653.2A
Other languages
English (en)
Chinese (zh)
Inventor
哈利·卡瓦
博里沃耶·福尔特
菲力博·阿维克
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Op Solutions Co
Original Assignee
Op Solutions Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Op Solutions Co filed Critical Op Solutions Co
Publication of CN117897954A publication Critical patent/CN117897954A/zh
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • H04N21/2355Processing of additional data, e.g. scrambling of additional data or processing content descriptors involving reformatting operations of additional data, e.g. HTML pages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/7715Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/625Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/23614Multiplexing of additional data and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • H04N21/4355Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Discrete Mathematics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)
CN202280055653.2A 2021-06-08 2022-06-01 用于组合式无损和有损编码的机器视频编码(vcm)编码器和解码器 Pending CN117897954A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202163208241P 2021-06-08 2021-06-08
US63/208,241 2021-06-08
PCT/US2022/031726 WO2022260900A1 (en) 2021-06-08 2022-06-01 Video coding for machines (vcm) encoder and decoder for combined lossless and lossy encoding

Publications (1)

Publication Number Publication Date
CN117897954A true CN117897954A (zh) 2024-04-16

Family

ID=84425305

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202280055653.2A Pending CN117897954A (zh) 2021-06-08 2022-06-01 用于组合式无损和有损编码的机器视频编码(vcm)编码器和解码器

Country Status (7)

Country Link
US (1) US20240114185A1 (https=)
EP (1) EP4352963A4 (https=)
JP (1) JP2024521572A (https=)
KR (1) KR20240051104A (https=)
CN (1) CN117897954A (https=)
BR (1) BR112023025493A2 (https=)
WO (1) WO2022260900A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119484823A (zh) * 2025-01-13 2025-02-18 中南大学 基于图像频域特征的vvc编码单元快速划分方法和系统

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116366841B (zh) * 2021-12-28 2025-11-11 维沃移动通信有限公司 环路滤波方法及终端
US12549772B2 (en) 2023-04-11 2026-02-10 Alibaba Innovation Private Limited Object mask information for supplemental enhancement information message
WO2025154982A1 (ko) * 2024-01-17 2025-07-24 삼성전자 주식회사 영상 복호화 장치 및 방법, 및 영상 부호화 장치 및 방법

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5659634A (en) * 1994-09-29 1997-08-19 Xerox Corporation Apparatus and method for encoding and reconstructing image data
US8848802B2 (en) * 2009-09-04 2014-09-30 Stmicroelectronics International N.V. System and method for object based parametric video coding
US10244246B2 (en) * 2012-02-02 2019-03-26 Texas Instruments Incorporated Sub-pictures for pixel rate balancing on multi-core platforms
US10248663B1 (en) * 2017-03-03 2019-04-02 Descartes Labs, Inc. Geo-visual search
WO2019110149A1 (en) * 2017-12-08 2019-06-13 Huawei Technologies Co., Ltd. Cluster refinement for texture synthesis in video coding
US10397518B1 (en) * 2018-01-16 2019-08-27 Amazon Technologies, Inc. Combining encoded video streams
CN112673625A (zh) * 2018-09-10 2021-04-16 华为技术有限公司 混合视频以及特征编码和解码
EP3939318A1 (en) * 2019-03-11 2022-01-19 VID SCALE, Inc. Sub-picture bitstream extraction and reposition
US11410275B2 (en) * 2019-09-23 2022-08-09 Tencent America LLC Video coding for machine (VCM) based system and method for video super resolution (SR)
CN115210715A (zh) * 2020-01-07 2022-10-18 诺基亚技术有限公司 用于神经网络的压缩表示的高级语法
EP3863022A1 (en) * 2020-02-06 2021-08-11 Siemens Healthcare GmbH Method and system for automatically characterizing liver tissue of a patient, computer program and electronically readable storage medium
US11375204B2 (en) * 2020-04-07 2022-06-28 Nokia Technologies Oy Feature-domain residual for video coding for machines
TWI798714B (zh) * 2020-06-09 2023-04-11 弗勞恩霍夫爾協會 時間移動向量預測、層間參考及時間子層指示的視訊寫碼技術
US11451790B2 (en) * 2020-10-09 2022-09-20 Tencent America LLC Method and apparatus in video coding for machines
US20250090036A1 (en) * 2020-10-21 2025-03-20 Bruce Hopenfeld Multichannel Heartbeat Detection by Temporal Pattern Search
US12003719B2 (en) * 2020-11-26 2024-06-04 Electronics And Telecommunications Research Institute Method, apparatus and storage medium for image encoding/decoding using segmentation map
US11516478B2 (en) * 2020-12-30 2022-11-29 Hyundai Motor Company Method and apparatus for coding machine vision data using prediction
EP4272441A1 (en) * 2021-01-04 2023-11-08 Guangdong Oppo Mobile Telecommunications Corp., Ltd. Video coding based on feature extraction and picture synthesis
AU2021202142A1 (en) * 2021-04-07 2022-10-27 Canon Kabushiki Kaisha Tool selection for feature map encoding vs regular video encoding

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119484823A (zh) * 2025-01-13 2025-02-18 中南大学 基于图像频域特征的vvc编码单元快速划分方法和系统
CN119484823B (zh) * 2025-01-13 2025-04-01 中南大学 基于图像频域特征的vvc编码单元快速划分方法和系统

Also Published As

Publication number Publication date
EP4352963A4 (en) 2025-04-23
EP4352963A1 (en) 2024-04-17
US20240114185A1 (en) 2024-04-04
BR112023025493A2 (pt) 2024-02-27
WO2022260900A1 (en) 2022-12-15
JP2024521572A (ja) 2024-06-03
KR20240051104A (ko) 2024-04-19

Similar Documents

Publication Publication Date Title
CN108028941B (zh) 用于通过超像素编码和解码数字图像的方法和装置
KR102071764B1 (ko) 영상 부호화, 복호화 방법 및 장치
CN117897954A (zh) 用于组合式无损和有损编码的机器视频编码(vcm)编码器和解码器
CN107211131B (zh) 对数字图像块进行基于掩码的处理的系统和方法
KR20240051076A (ko) 기계용 비디오 코딩(vcm)을 위한 인코더 및 디코더
JP2022523309A (ja) 指数関数的分割におけるインター予測
US20250280115A1 (en) Methods, systems and decoder for combined lossless and lossy coding
CN119487841A (zh) 使用神经网络进行图像区域的并行处理-解码、后滤波和rdoq
US20240414316A1 (en) Systems, methods, and bitstream structure for video coding and decoding for machines with adaptive inference
JP2024514681A (ja) ハイブリッド特徴ビデオ・ビットストリーム用のシステム、方法、及びビットストリーム構造、及びデコーダ
JP2023062136A (ja) 文脈的区分化および処理のためのブロックベースのピクチャ融合
KR20240104130A (ko) 객체 및 이벤트 검출 및 비디오 코딩을 위한 특징-기반 레이트-왜곡 최적화를 위한 시스템 및 방법
US12355947B1 (en) Methods and systems for combined lossless and lossy coding
CN118235408A (zh) 用于可缩放的机器视频编码的系统和方法
CN118414833A (zh) 用于优化机器视频编码的损失函数的系统和方法
CN118451709A (zh) 特征编码/解码方法和设备以及其中存储比特流的记录介质
CN119522574A (zh) 使用神经网络进行图像区域的并行处理-解码、后滤波和rdoq
CN118020290A (zh) 用存储器高效预测模式选择来编码和解码视频的系统和方法
JP7253053B2 (ja) ピクチャのためのブロックベースの空間活性測度
WO2025059287A1 (en) Systems and methods for content adaptive multi-scale feature layer filtering
WO2022047144A1 (en) Methods and systems for combined lossless and lossy coding
CN118414829A (zh) 用于对象和事件检测以及用于视频编码的基于特征的率失真优化的系统和方法
CN120153652A (zh) 用于基于机器的应用的具有自适应量化的图像和视频编码
CN118119951A (zh) 用于联合优化训练和编码器侧下采样的系统和方法

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information

Inventor after: Harry Cava

Inventor after: Borivoye Fult

Inventor after: Filippo Azke

Inventor before: Harry Cava

Inventor before: Borivoye Fult

Inventor before: Filippo Avik

CB03 Change of inventor or designer information