CN118872268A - 用于对张量进行编码和解码的方法、设备和系统 - Google Patents

用于对张量进行编码和解码的方法、设备和系统 Download PDF

Info

Publication number
CN118872268A
CN118872268A CN202380027301.0A CN202380027301A CN118872268A CN 118872268 A CN118872268 A CN 118872268A CN 202380027301 A CN202380027301 A CN 202380027301A CN 118872268 A CN118872268 A CN 118872268A
Authority
CN
China
Prior art keywords
tensor
bitstream
quantized
mean feature
coefficients
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202380027301.0A
Other languages
English (en)
Chinese (zh)
Inventor
克里斯托弗·詹姆斯·罗斯沃恩
陈红绒·阮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Publication of CN118872268A publication Critical patent/CN118872268A/zh
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/18Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/184Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being bits, e.g. of the compressed video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/96Tree coding, e.g. quad-tree coding
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
CN202380027301.0A 2022-04-13 2023-04-11 用于对张量进行编码和解码的方法、设备和系统 Pending CN118872268A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
AU2022202471A AU2022202471A1 (en) 2022-04-13 2022-04-13 Method, apparatus and system for encoding and decoding a tensor
AU2022202471 2022-04-13
PCT/AU2023/050291 WO2023197031A1 (en) 2022-04-13 2023-04-11 Method, apparatus and system for encoding and decoding a tensor

Publications (1)

Publication Number Publication Date
CN118872268A true CN118872268A (zh) 2024-10-29

Family

ID=88328498

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202380027301.0A Pending CN118872268A (zh) 2022-04-13 2023-04-11 用于对张量进行编码和解码的方法、设备和系统

Country Status (5)

Country Link
US (1) US12556703B2 (https=)
JP (1) JP2025511538A (https=)
CN (1) CN118872268A (https=)
AU (1) AU2022202471A1 (https=)
WO (1) WO2023197031A1 (https=)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240169708A1 (en) * 2022-11-10 2024-05-23 Qualcomm Incorporated Processing video data using delta quantization
WO2025146073A1 (en) * 2024-01-03 2025-07-10 Douyin Vision Co., Ltd. Method, apparatus, and medium for visual data processing
US20260006206A1 (en) * 2024-07-01 2026-01-01 Interdigital Vc Holdings, Inc. Refinement of reconstructed feature tensors

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3413720B2 (ja) * 1998-06-26 2003-06-09 ソニー株式会社 画像符号化方法及び装置、並びに画像復号方法及び装置
US20220261616A1 (en) 2019-07-02 2022-08-18 Vid Scale, Inc. Clustering-based quantization for neural network compression
US11113584B2 (en) * 2020-02-04 2021-09-07 Nio Usa, Inc. Single frame 4D detection using deep fusion of camera image, imaging RADAR and LiDAR point cloud
WO2021254855A1 (en) 2020-06-17 2021-12-23 Interdigital Vc Holdings France, Sas Systems and methods for encoding/decoding a deep neural network
US11367271B2 (en) * 2020-06-19 2022-06-21 Adobe Inc. Similarity propagation for one-shot and few-shot image segmentation
US12190548B2 (en) * 2020-09-23 2025-01-07 Electronics And Telecommunications Research Institute Method, apparatus, system and computer-readable recording medium for feature information
US11665363B2 (en) * 2020-11-26 2023-05-30 Electronics And Telecommunications Research Institute Method, apparatus, system and computer-readable recording medium for feature map information
JP2025016813A (ja) * 2021-12-17 2025-02-05 シャープ株式会社 動画像符号化装置および動画像復号装置

Also Published As

Publication number Publication date
JP2025511538A (ja) 2025-04-16
US20250254307A1 (en) 2025-08-07
WO2023197031A1 (en) 2023-10-19
US12556703B2 (en) 2026-02-17
AU2022202471A1 (en) 2023-11-02

Similar Documents

Publication Publication Date Title
CN117546176A (zh) 特征图编码与常规视频编码的工具选择
AU2022202473B2 (en) Method, apparatus and system for encoding and decoding a tensor
US12556703B2 (en) Method, apparatus and system for encoding and decoding a tensor
CN118476215A (zh) 对视频样本块进行编码和解码的方法、设备和系统
AU2025201260A1 (en) Method, apparatus and system for encoding and decoding a tensor
JP2025533723A (ja) テンソルを符号化および復号するための方法、装置、およびシステム
CN118872269A (zh) 用于对张量进行编码和解码的方法、设备和系统
US20260105643A1 (en) Method, apparatus and system for encoding and decoding a tensor
CN120917743A (zh) 用于对张量进行编码和解码的方法、设备和系统
WO2024211956A1 (en) Method, apparatus and system for encoding and decoding a tensor
JP2025533727A (ja) テンソルを符号化および復号するための方法、装置、およびシステム
AU2022202474A1 (en) Method, apparatus and system for encoding and decoding a tensor
AU2022202472A1 (en) Method, apparatus and system for encoding and decoding a tensor
CN120476602A (zh) 用于对张量进行编码和解码的方法、设备和系统
CN120476603A (zh) 用于对张量进行编码和解码的方法、设备和系统
WO2025208169A1 (en) Method, apparatus and system for encoding and decoding a plurality of tensors
CN121040074A (zh) 用于对位流的内容的张量进行解码的方法、设备和系统
WO2025213210A1 (en) Method, apparatus and system for encoding and decoding a plurality of tensors

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination