JP2024521572A - 組み合わせられた可逆符号化および非可逆符号化のための機械向け映像符号化(vcm)エンコーダおよびデコーダ - Google Patents

組み合わせられた可逆符号化および非可逆符号化のための機械向け映像符号化(vcm)エンコーダおよびデコーダ Download PDF

Info

Publication number
JP2024521572A
JP2024521572A JP2023574429A JP2023574429A JP2024521572A JP 2024521572 A JP2024521572 A JP 2024521572A JP 2023574429 A JP2023574429 A JP 2023574429A JP 2023574429 A JP2023574429 A JP 2023574429A JP 2024521572 A JP2024521572 A JP 2024521572A
Authority
JP
Japan
Prior art keywords
encoder
video
vcm
decoder
feature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2023574429A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2022260900A5 (https=
JP2024521572A5 (https=
Inventor
カルバ、ハリ
フルト、ボリヴォジェ
アジッチ、ベリボル
Original Assignee
オーピー ソリューションズ, エルエルシー
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by オーピー ソリューションズ, エルエルシー filed Critical オーピー ソリューションズ, エルエルシー
Publication of JP2024521572A publication Critical patent/JP2024521572A/ja
Publication of JPWO2022260900A5 publication Critical patent/JPWO2022260900A5/ja
Publication of JP2024521572A5 publication Critical patent/JP2024521572A5/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • H04N21/2355Processing of additional data, e.g. scrambling of additional data or processing content descriptors involving reformatting operations of additional data, e.g. HTML pages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/7715Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/625Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding using discrete cosine transform [DCT]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/23614Multiplexing of additional data and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • H04N21/4355Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Discrete Mathematics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)
JP2023574429A 2021-06-08 2022-06-01 組み合わせられた可逆符号化および非可逆符号化のための機械向け映像符号化(vcm)エンコーダおよびデコーダ Pending JP2024521572A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202163208241P 2021-06-08 2021-06-08
US63/208,241 2021-06-08
PCT/US2022/031726 WO2022260900A1 (en) 2021-06-08 2022-06-01 Video coding for machines (vcm) encoder and decoder for combined lossless and lossy encoding

Publications (3)

Publication Number Publication Date
JP2024521572A true JP2024521572A (ja) 2024-06-03
JPWO2022260900A5 JPWO2022260900A5 (https=) 2025-05-27
JP2024521572A5 JP2024521572A5 (https=) 2025-05-27

Family

ID=84425305

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023574429A Pending JP2024521572A (ja) 2021-06-08 2022-06-01 組み合わせられた可逆符号化および非可逆符号化のための機械向け映像符号化(vcm)エンコーダおよびデコーダ

Country Status (7)

Country Link
US (1) US20240114185A1 (https=)
EP (1) EP4352963A4 (https=)
JP (1) JP2024521572A (https=)
KR (1) KR20240051104A (https=)
CN (1) CN117897954A (https=)
BR (1) BR112023025493A2 (https=)
WO (1) WO2022260900A1 (https=)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116366841B (zh) * 2021-12-28 2025-11-11 维沃移动通信有限公司 环路滤波方法及终端
US12549772B2 (en) 2023-04-11 2026-02-10 Alibaba Innovation Private Limited Object mask information for supplemental enhancement information message
WO2025154982A1 (ko) * 2024-01-17 2025-07-24 삼성전자 주식회사 영상 복호화 장치 및 방법, 및 영상 부호화 장치 및 방법
CN119484823B (zh) * 2025-01-13 2025-04-01 中南大学 基于图像频域特征的vvc编码单元快速划分方法和系统

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5659634A (en) * 1994-09-29 1997-08-19 Xerox Corporation Apparatus and method for encoding and reconstructing image data
US8848802B2 (en) * 2009-09-04 2014-09-30 Stmicroelectronics International N.V. System and method for object based parametric video coding
US10244246B2 (en) * 2012-02-02 2019-03-26 Texas Instruments Incorporated Sub-pictures for pixel rate balancing on multi-core platforms
US10248663B1 (en) * 2017-03-03 2019-04-02 Descartes Labs, Inc. Geo-visual search
WO2019110149A1 (en) * 2017-12-08 2019-06-13 Huawei Technologies Co., Ltd. Cluster refinement for texture synthesis in video coding
US10397518B1 (en) * 2018-01-16 2019-08-27 Amazon Technologies, Inc. Combining encoded video streams
CN112673625A (zh) * 2018-09-10 2021-04-16 华为技术有限公司 混合视频以及特征编码和解码
EP3939318A1 (en) * 2019-03-11 2022-01-19 VID SCALE, Inc. Sub-picture bitstream extraction and reposition
US11410275B2 (en) * 2019-09-23 2022-08-09 Tencent America LLC Video coding for machine (VCM) based system and method for video super resolution (SR)
CN115210715A (zh) * 2020-01-07 2022-10-18 诺基亚技术有限公司 用于神经网络的压缩表示的高级语法
EP3863022A1 (en) * 2020-02-06 2021-08-11 Siemens Healthcare GmbH Method and system for automatically characterizing liver tissue of a patient, computer program and electronically readable storage medium
US11375204B2 (en) * 2020-04-07 2022-06-28 Nokia Technologies Oy Feature-domain residual for video coding for machines
TWI798714B (zh) * 2020-06-09 2023-04-11 弗勞恩霍夫爾協會 時間移動向量預測、層間參考及時間子層指示的視訊寫碼技術
US11451790B2 (en) * 2020-10-09 2022-09-20 Tencent America LLC Method and apparatus in video coding for machines
US20250090036A1 (en) * 2020-10-21 2025-03-20 Bruce Hopenfeld Multichannel Heartbeat Detection by Temporal Pattern Search
US12003719B2 (en) * 2020-11-26 2024-06-04 Electronics And Telecommunications Research Institute Method, apparatus and storage medium for image encoding/decoding using segmentation map
US11516478B2 (en) * 2020-12-30 2022-11-29 Hyundai Motor Company Method and apparatus for coding machine vision data using prediction
EP4272441A1 (en) * 2021-01-04 2023-11-08 Guangdong Oppo Mobile Telecommunications Corp., Ltd. Video coding based on feature extraction and picture synthesis
AU2021202142A1 (en) * 2021-04-07 2022-10-27 Canon Kabushiki Kaisha Tool selection for feature map encoding vs regular video encoding

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
T. NGUYEN, ET AL.: "Description of Core Experiment 3 (CE3) : LosslessCoding", JOINT VIDEO EXPERTS TEAM (JVET) OF ITU-T SG 16 WP 3 AND ISO/IEC JTC 1/SC 29/WG 11, vol. [JVET-P2023-v1] (version 1), JPN6025049744, 11 October 2019 (2019-10-11), pages 7 - 9, ISSN: 0005744201 *
THOMAS SIKORA, ET AL.: "Shape-adaptive DCT for generic coding of video", IEEE TRANSACTIONSON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, vol. Volume: 5, Issue: 1, JPN6025049745, 28 February 1995 (1995-02-28), ISSN: 0005744202 *
WEN GAO, ET.AL.: "Recent Standard Development Activities on Video Coding for Machines", ARXIV:2105.12653V1, JPN6025049748, 26 May 2021 (2021-05-26), pages 1 - 4, ISSN: 0005744200 *
YUAN ZHANG, ET.AL.: "Use cases and requirements for Video Coding for Machines", INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC, vol. ISO/IEC JTC 1/SC 29/WG 2 N43, JPN6025049746, 6 February 2021 (2021-02-06), pages 2 - 4, ISSN: 0005744199 *

Also Published As

Publication number Publication date
EP4352963A4 (en) 2025-04-23
CN117897954A (zh) 2024-04-16
EP4352963A1 (en) 2024-04-17
US20240114185A1 (en) 2024-04-04
BR112023025493A2 (pt) 2024-02-27
WO2022260900A1 (en) 2022-12-15
KR20240051104A (ko) 2024-04-19

Similar Documents

Publication Publication Date Title
JP2024521572A (ja) 組み合わせられた可逆符号化および非可逆符号化のための機械向け映像符号化(vcm)エンコーダおよびデコーダ
CN107211131B (zh) 对数字图像块进行基于掩码的处理的系统和方法
JP2022523309A (ja) 指数関数的分割におけるインター予測
US20250280115A1 (en) Methods, systems and decoder for combined lossless and lossy coding
US20240283930A1 (en) Systems and methods for video encoding using image segmentation
JP2024514681A (ja) ハイブリッド特徴ビデオ・ビットストリーム用のシステム、方法、及びビットストリーム構造、及びデコーダ
US20250330583A1 (en) Methods and systems for combined lossless and lossy coding
JP2023062136A (ja) 文脈的区分化および処理のためのブロックベースのピクチャ融合
JP2026027303A (ja) 参照領域を使用する映像符号化の方法及びシステム
JP7253053B2 (ja) ピクチャのためのブロックベースの空間活性測度
US20240137502A1 (en) Systems and methods for encoding and decoding video with memory-efficient prediction mode selection
WO2022047144A1 (en) Methods and systems for combined lossless and lossy coding
WO2022047129A1 (en) Methods and systems for combined lossless and lossy coding

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20250519

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20250519

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20251114

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20251202

A601 Written request for extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A601

Effective date: 20260302