KR20240051076A - 기계용 비디오 코딩(vcm)을 위한 인코더 및 디코더 - Google Patents

기계용 비디오 코딩(vcm)을 위한 인코더 및 디코더 Download PDF

Info

Publication number
KR20240051076A
KR20240051076A KR1020237044890A KR20237044890A KR20240051076A KR 20240051076 A KR20240051076 A KR 20240051076A KR 1020237044890 A KR1020237044890 A KR 1020237044890A KR 20237044890 A KR20237044890 A KR 20237044890A KR 20240051076 A KR20240051076 A KR 20240051076A
Authority
KR
South Korea
Prior art keywords
encoder
feature
video
vcm
decoder
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
KR1020237044890A
Other languages
English (en)
Korean (ko)
Inventor
하리 칼바
보리보예 푸르트
벨리보르 아지치
Original Assignee
오피 솔루션즈, 엘엘씨
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 오피 솔루션즈, 엘엘씨 filed Critical 오피 솔루션즈, 엘엘씨
Publication of KR20240051076A publication Critical patent/KR20240051076A/ko
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • H04N21/2355Processing of additional data, e.g. scrambling of additional data or processing content descriptors involving reformatting operations of additional data, e.g. HTML pages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/7715Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • H04N21/4355Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Databases & Information Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Molecular Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Biomedical Technology (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
KR1020237044890A 2021-06-07 2022-06-03 기계용 비디오 코딩(vcm)을 위한 인코더 및 디코더 Pending KR20240051076A (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202163197834P 2021-06-07 2021-06-07
US63/197,834 2021-06-07
PCT/US2022/032048 WO2022260934A1 (en) 2021-06-07 2022-06-03 Encoder and decoder for video coding for machines (vcm)

Publications (1)

Publication Number Publication Date
KR20240051076A true KR20240051076A (ko) 2024-04-19

Family

ID=84425308

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020237044890A Pending KR20240051076A (ko) 2021-06-07 2022-06-03 기계용 비디오 코딩(vcm)을 위한 인코더 및 디코더

Country Status (6)

Country Link
US (1) US20240107088A1 (https=)
EP (1) EP4352701A4 (https=)
JP (1) JP2024520682A (https=)
KR (1) KR20240051076A (https=)
CN (1) CN117897736A (https=)
WO (1) WO2022260934A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2025254466A1 (ko) * 2024-06-05 2025-12-11 한화비전 주식회사 스케일링 파라미터를 이용하는 부호화/복호화 방법, 장치 및 기록 매체

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4427459A4 (en) * 2021-11-04 2026-01-28 Op Solutions Llc SYSTEMS AND METHODS FOR TRANSFERRING MOTION INFORMATION FROM A VISUAL DOMAIN WITH CHARACTERISTICS AND FINE-FINING MOTION VECTOR CONTROL ON THE DECODER SIDE BASED ON CHARACTERISTICS
CN120419187A (zh) * 2023-01-03 2025-08-01 Lg 电子株式会社 编码/解码方法和装置以及存储比特流的记录介质
US12309404B2 (en) * 2023-03-07 2025-05-20 Disney Enterprises, Inc. Contextual video compression framework with spatial-temporal cross-covariance transformers
CN120917753A (zh) * 2023-04-12 2025-11-07 Lg 电子株式会社 编码/解码方法、设备和存储比特流的记录介质
WO2025221719A1 (en) * 2024-04-15 2025-10-23 Op Solutions, Llc Systems, methods and bitstreams for removing non-essential feature map information in machine-based applications using regions of interest

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170249739A1 (en) * 2016-02-26 2017-08-31 Biomediq A/S Computer analysis of mammograms
US11019355B2 (en) * 2018-04-03 2021-05-25 Electronics And Telecommunications Research Institute Inter-prediction method and apparatus using reference frame generated based on deep learning
WO2020055279A1 (en) * 2018-09-10 2020-03-19 Huawei Technologies Co., Ltd. Hybrid video and feature coding and decoding
US11410275B2 (en) * 2019-09-23 2022-08-09 Tencent America LLC Video coding for machine (VCM) based system and method for video super resolution (SR)
WO2021095245A1 (ja) * 2019-11-15 2021-05-20 日本電信電話株式会社 画像処理方法、データ処理方法、画像処理装置、およびプログラム
WO2021172956A1 (ko) * 2020-02-28 2021-09-02 엘지전자 주식회사 영상 특징 정보 시그널링을 위한 영상 부호화/복호화 방법, 장치 및 비트스트림을 전송하는 방법
EP4136848A4 (en) * 2020-04-16 2024-04-03 INTEL Corporation PATCH-BASED VIDEO CODING FOR MACHINES
US11516478B2 (en) * 2020-12-30 2022-11-29 Hyundai Motor Company Method and apparatus for coding machine vision data using prediction

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2025254466A1 (ko) * 2024-06-05 2025-12-11 한화비전 주식회사 스케일링 파라미터를 이용하는 부호화/복호화 방법, 장치 및 기록 매체

Also Published As

Publication number Publication date
CN117897736A (zh) 2024-04-16
WO2022260934A1 (en) 2022-12-15
JP2024520682A (ja) 2024-05-24
EP4352701A4 (en) 2025-04-09
EP4352701A1 (en) 2024-04-17
US20240107088A1 (en) 2024-03-28

Similar Documents

Publication Publication Date Title
US20240107088A1 (en) Encoder and decoder for video coding for machines (vcm)
US20240357142A1 (en) Video and feature coding for multi-task machine learning
US20240338486A1 (en) Systems and methods for privacy protection in video communication systems
US20240283942A1 (en) Systems and methods for object and event detection and feature-based rate-distortion optimization for video coding
EP4388456A1 (en) Systems and methods for joint optimization training and encoder side downsampling
US20240267531A1 (en) Systems and methods for optimizing a loss function for video coding for machines
US20240340391A1 (en) Intelligent multi-stream video coding for video surveillance
US20240236342A1 (en) Systems and methods for scalable video coding for machines
US20240357107A1 (en) Systems and methods for video coding of features using subpictures
US20240291999A1 (en) Systems and methods for motion information transfer from visual to feature domain and feature-based decoder-side motion vector refinement control
CN118414829A (zh) 用于对象和事件检测以及用于视频编码的基于特征的率失真优化的系统和方法
CN118614062A (zh) 用于从视觉到特征域的运动信息传递的系统和方法
CN118119951A (zh) 用于联合优化训练和编码器侧下采样的系统和方法
WO2025007083A1 (en) Systems and method for decoded frame augmentation for video coding for machines
WO2025015116A2 (en) Systems and method for decoded frame augmentation for video coding for machines with dct filtering for map
CN118742904A (zh) 用于多任务机器学习的视频和特征编码

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20231226

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
A201 Request for examination
PA0201 Request for examination

Patent event code: PA02012R01D

Patent event date: 20250515

Comment text: Request for Examination of Application