BR112023023427A2 - Imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquina - Google Patents

Imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquina

Info

Publication number
BR112023023427A2
BR112023023427A2 BR112023023427A BR112023023427A BR112023023427A2 BR 112023023427 A2 BR112023023427 A2 BR 112023023427A2 BR 112023023427 A BR112023023427 A BR 112023023427A BR 112023023427 A BR112023023427 A BR 112023023427A BR 112023023427 A2 BR112023023427 A2 BR 112023023427A2
Authority
BR
Brazil
Prior art keywords
machine learning
learning systems
video compression
image
implied image
Prior art date
Application number
BR112023023427A
Other languages
English (en)
Inventor
Hinrich BREHMER Johann
Markus Nagel
Sebastiaan Cohen Taco
Jehan VAN ROZENDAAL Ties
Yunfan Zhang
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of BR112023023427A2 publication Critical patent/BR112023023427A2/pt

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • H04N19/126Details of normalisation or weighting functions, e.g. normalisation matrices or variable uniform quantisers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/002Image coding using neural networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/147Data rate or code amount at the encoder output according to rate distortion criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • H04N19/463Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/537Motion estimation other than block-based
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/177Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Image Analysis (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Image Processing (AREA)
  • Compression Of Band Width Or Redundancy In Fax (AREA)

Abstract

imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquina. são descritas técnicas para comprimir e descomprimir dados usando sistemas de aprendizado de máquina. um exemplo de processo pode incluir o recebimento de uma pluralidade de imagens para compressão por um sistema de compressão de rede neural. o processo pode incluir determinar, com base em uma primeira imagem da pluralidade de imagens, uma primeira pluralidade de valores de peso associados a um primeiro modelo do sistema de compressão de rede neural. o processo pode incluir a geração de um primeiro fluxo de bits compreendendo uma versão comprimida da primeira pluralidade de valores de peso. o processo pode incluir a saída do primeiro fluxo de bits para transmissão a um receptor.
BR112023023427A 2021-05-21 2022-03-31 Imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquina BR112023023427A2 (pt)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202163191606P 2021-05-21 2021-05-21
US17/645,018 US20220385907A1 (en) 2021-05-21 2021-12-17 Implicit image and video compression using machine learning systems
PCT/US2022/022881 WO2022245434A1 (en) 2021-05-21 2022-03-31 Implicit image and video compression using machine learning systems

Publications (1)

Publication Number Publication Date
BR112023023427A2 true BR112023023427A2 (pt) 2024-01-30

Family

ID=81392695

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112023023427A BR112023023427A2 (pt) 2021-05-21 2022-03-31 Imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquina

Country Status (7)

Country Link
US (1) US20220385907A1 (pt)
EP (1) EP4342178A1 (pt)
JP (1) JP2024519791A (pt)
KR (1) KR20240012374A (pt)
BR (1) BR112023023427A2 (pt)
TW (1) TW202247650A (pt)
WO (1) WO2022245434A1 (pt)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11922314B1 (en) * 2018-11-30 2024-03-05 Ansys, Inc. Systems and methods for building dynamic reduced order physical models
US11810225B2 (en) * 2021-03-30 2023-11-07 Zoox, Inc. Top-down scene generation
US11858514B2 (en) 2021-03-30 2024-01-02 Zoox, Inc. Top-down scene discrimination
US20220335655A1 (en) * 2021-04-19 2022-10-20 Tencent America LLC Substitutional input optimization for adaptive neural image compression with smooth quality control
US20230013421A1 (en) * 2021-07-14 2023-01-19 Sony Group Corporation Point cloud compression using occupancy networks
WO2023069699A1 (en) * 2021-10-21 2023-04-27 Visa International Service Association Method, system, and computer program product for embedding compression and regularization
US11743552B1 (en) * 2022-06-03 2023-08-29 International Business Machines Corporation Computer technology for enhancing images with a generative adversarial network
US11689601B1 (en) * 2022-06-17 2023-06-27 International Business Machines Corporation Stream quality enhancement
EP4390774A1 (en) * 2022-12-21 2024-06-26 Fondation B-COM Method and device for decoding a bitstream
CN115834890B (zh) * 2023-02-08 2023-04-28 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) 一种图像压缩方法、装置、设备及存储介质
CN116862019B (zh) * 2023-07-06 2024-03-19 清华大学 基于数据并行范式的模型训练方法及装置
CN117541791B (zh) * 2023-11-23 2024-05-28 北京师范大学 基于多域可变形卷积的眼部结构分割方法、系统及设备
CN117495741B (zh) * 2023-12-29 2024-04-12 成都货安计量技术中心有限公司 一种基于大卷积对比学习的畸变还原方法

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB201603144D0 (en) * 2016-02-23 2016-04-06 Magic Pony Technology Ltd Training end-to-end video processes
WO2017036370A1 (en) * 2015-09-03 2017-03-09 Mediatek Inc. Method and apparatus of neural network based processing in video coding
EP3451293A1 (en) * 2017-08-28 2019-03-06 Thomson Licensing Method and apparatus for filtering with multi-branch deep learning
TWI744827B (zh) * 2019-03-18 2021-11-01 弗勞恩霍夫爾協會 用以壓縮類神經網路參數之方法與裝置
EP3716158A3 (en) * 2019-03-25 2020-11-25 Nokia Technologies Oy Compressing weight updates for decoder-side neural networks
EP3792821A1 (en) * 2019-09-11 2021-03-17 Naver Corporation Action recognition using implicit pose representations
US20220261616A1 (en) * 2019-07-02 2022-08-18 Vid Scale, Inc. Clustering-based quantization for neural network compression
EP4144087A1 (en) * 2020-04-29 2023-03-08 Deep Render Ltd Image compression and decoding, video compression and decoding: methods and systems
WO2022017848A1 (en) * 2020-07-21 2022-01-27 Interdigital Vc Holdings France, Sas A method and an apparatus for updating a deep neural network-based image or video decoder
KR20230072487A (ko) * 2020-12-24 2023-05-24 후아웨이 테크놀러지 컴퍼니 리미티드 분할 정보의 시그널링으로 디코딩

Also Published As

Publication number Publication date
WO2022245434A1 (en) 2022-11-24
US20220385907A1 (en) 2022-12-01
JP2024519791A (ja) 2024-05-21
KR20240012374A (ko) 2024-01-29
EP4342178A1 (en) 2024-03-27
TW202247650A (zh) 2022-12-01

Similar Documents

Publication Publication Date Title
BR112023023427A2 (pt) Imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquina
BR112019006979A2 (pt) sequência para sequenciar transformações para síntese de fala via redes neurais recorrentes
WO2018155986A3 (ko) 비디오 신호 처리 방법 및 장치
BR112023014810A2 (pt) Otimizador de distorção de taxa com base em aprendizagem de máquina para compactação de vídeo
EP4319172A3 (en) Trusted contextual content
BR112022016793A2 (pt) Compressão de vídeo usando sistemas de aprendizado de máquina de base recorrente
BR112017024275A2 (pt) determinação de região de busca para inter-codificação dentro de imagem específica de dados de vídeo
GB2600327A (en) Techniques for classification with neural networks
BR112021018450A8 (pt) Controle de taxa para um codificador de vídeo
US20150326632A1 (en) Methods and systems to facilitate synchronization of multiple media streams
BR112021026284A2 (pt) Aparelho para decodificar ou codificar um bloco predeterminado de uma imagem, métodos e fluxo de dados
BR112022023315A2 (pt) Aparelho e método para receber fluxo de dados de vídeo, fluxo de dados de vídeo, codificador e decodificador de vídeo, e método para codificar um vídeo em um fluxo de dados de vídeo
BR112022013557A2 (pt) Sistema e método para dados de serviço de multidifusão/difusão
BR112022012807A2 (pt) Método de processamento de vídeo, aparelho para processar dados de vídeo e meios não transitórios legíveis por computador
MY197897A (en) Apparatus for splitting portion of picture into coding units, apparatus for encoding image of video sequence, apparatus for decoding image of video sequence, method for splitting portion of image into coding units and computer readable medium
US10848840B2 (en) Communication apparatus and signal relay method
BR112022001279A2 (pt) Método de processamento de vídeo, aparelho em um sistema de vídeo, e, produto de programa de computador
MX2019003123A (es) Sistema y metodo para la compresion de datos de movimiento, de alta fidelidad, para la transmision a traves de una red de ancho de banda limitado.
WO2018224839A3 (en) METHODS AND SYSTEMS FOR REACTION VIDEO GENERATION
SE1850827A1 (en) Method and apparatus for training a neural network classifier to classify an image depicting one or more objects of a biological sample
BR112022019663A2 (pt) Número de sinalização de candidatos de fusão de sub-bloco em codificação de vídeo
BR112022011316A2 (pt) Métodos para geração de relatório cirúrgico operacional aperfeiçoado que utilizam aprendizado por máquina e dispositivos associados
BR112022002204A2 (pt) Redimensionamento de previsão de gerenciamento de resolução adaptativa
US10943168B2 (en) System and method for determining an artificial intelligence model in a decentralized network
US9491494B2 (en) Distribution and use of video statistics for cloud-based video encoding