BR112023023427A2 - Imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquina - Google Patents
Imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquinaInfo
- Publication number
- BR112023023427A2 BR112023023427A2 BR112023023427A BR112023023427A BR112023023427A2 BR 112023023427 A2 BR112023023427 A2 BR 112023023427A2 BR 112023023427 A BR112023023427 A BR 112023023427A BR 112023023427 A BR112023023427 A BR 112023023427A BR 112023023427 A2 BR112023023427 A2 BR 112023023427A2
- Authority
- BR
- Brazil
- Prior art keywords
- machine learning
- learning systems
- video compression
- image
- implied image
- Prior art date
Links
- 230000006835 compression Effects 0.000 title abstract 5
- 238000007906 compression Methods 0.000 title abstract 5
- 238000010801 machine learning Methods 0.000 title abstract 3
- 238000000034 method Methods 0.000 abstract 5
- 238000013528 artificial neural network Methods 0.000 abstract 2
- 230000005540 biological transmission Effects 0.000 abstract 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
- H04N19/126—Details of normalisation or weighting functions, e.g. normalisation matrices or variable uniform quantisers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/002—Image coding using neural networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/147—Data rate or code amount at the encoder output according to rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
- H04N19/463—Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/537—Motion estimation other than block-based
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/91—Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/177—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Evolutionary Computation (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Image Analysis (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Image Processing (AREA)
- Compression Of Band Width Or Redundancy In Fax (AREA)
Abstract
imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquina. são descritas técnicas para comprimir e descomprimir dados usando sistemas de aprendizado de máquina. um exemplo de processo pode incluir o recebimento de uma pluralidade de imagens para compressão por um sistema de compressão de rede neural. o processo pode incluir determinar, com base em uma primeira imagem da pluralidade de imagens, uma primeira pluralidade de valores de peso associados a um primeiro modelo do sistema de compressão de rede neural. o processo pode incluir a geração de um primeiro fluxo de bits compreendendo uma versão comprimida da primeira pluralidade de valores de peso. o processo pode incluir a saída do primeiro fluxo de bits para transmissão a um receptor.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163191606P | 2021-05-21 | 2021-05-21 | |
US17/645,018 US20220385907A1 (en) | 2021-05-21 | 2021-12-17 | Implicit image and video compression using machine learning systems |
PCT/US2022/022881 WO2022245434A1 (en) | 2021-05-21 | 2022-03-31 | Implicit image and video compression using machine learning systems |
Publications (1)
Publication Number | Publication Date |
---|---|
BR112023023427A2 true BR112023023427A2 (pt) | 2024-01-30 |
Family
ID=81392695
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112023023427A BR112023023427A2 (pt) | 2021-05-21 | 2022-03-31 | Imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquina |
Country Status (7)
Country | Link |
---|---|
US (1) | US20220385907A1 (pt) |
EP (1) | EP4342178A1 (pt) |
JP (1) | JP2024519791A (pt) |
KR (1) | KR20240012374A (pt) |
BR (1) | BR112023023427A2 (pt) |
TW (1) | TW202247650A (pt) |
WO (1) | WO2022245434A1 (pt) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11922314B1 (en) * | 2018-11-30 | 2024-03-05 | Ansys, Inc. | Systems and methods for building dynamic reduced order physical models |
US11810225B2 (en) * | 2021-03-30 | 2023-11-07 | Zoox, Inc. | Top-down scene generation |
US11858514B2 (en) | 2021-03-30 | 2024-01-02 | Zoox, Inc. | Top-down scene discrimination |
US20220335655A1 (en) * | 2021-04-19 | 2022-10-20 | Tencent America LLC | Substitutional input optimization for adaptive neural image compression with smooth quality control |
US20230013421A1 (en) * | 2021-07-14 | 2023-01-19 | Sony Group Corporation | Point cloud compression using occupancy networks |
WO2023069699A1 (en) * | 2021-10-21 | 2023-04-27 | Visa International Service Association | Method, system, and computer program product for embedding compression and regularization |
US11743552B1 (en) * | 2022-06-03 | 2023-08-29 | International Business Machines Corporation | Computer technology for enhancing images with a generative adversarial network |
US11689601B1 (en) * | 2022-06-17 | 2023-06-27 | International Business Machines Corporation | Stream quality enhancement |
EP4390774A1 (en) * | 2022-12-21 | 2024-06-26 | Fondation B-COM | Method and device for decoding a bitstream |
CN115834890B (zh) * | 2023-02-08 | 2023-04-28 | 哈尔滨工业大学(深圳)(哈尔滨工业大学深圳科技创新研究院) | 一种图像压缩方法、装置、设备及存储介质 |
CN116862019B (zh) * | 2023-07-06 | 2024-03-19 | 清华大学 | 基于数据并行范式的模型训练方法及装置 |
CN117541791B (zh) * | 2023-11-23 | 2024-05-28 | 北京师范大学 | 基于多域可变形卷积的眼部结构分割方法、系统及设备 |
CN117495741B (zh) * | 2023-12-29 | 2024-04-12 | 成都货安计量技术中心有限公司 | 一种基于大卷积对比学习的畸变还原方法 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB201603144D0 (en) * | 2016-02-23 | 2016-04-06 | Magic Pony Technology Ltd | Training end-to-end video processes |
WO2017036370A1 (en) * | 2015-09-03 | 2017-03-09 | Mediatek Inc. | Method and apparatus of neural network based processing in video coding |
EP3451293A1 (en) * | 2017-08-28 | 2019-03-06 | Thomson Licensing | Method and apparatus for filtering with multi-branch deep learning |
TWI744827B (zh) * | 2019-03-18 | 2021-11-01 | 弗勞恩霍夫爾協會 | 用以壓縮類神經網路參數之方法與裝置 |
EP3716158A3 (en) * | 2019-03-25 | 2020-11-25 | Nokia Technologies Oy | Compressing weight updates for decoder-side neural networks |
EP3792821A1 (en) * | 2019-09-11 | 2021-03-17 | Naver Corporation | Action recognition using implicit pose representations |
US20220261616A1 (en) * | 2019-07-02 | 2022-08-18 | Vid Scale, Inc. | Clustering-based quantization for neural network compression |
EP4144087A1 (en) * | 2020-04-29 | 2023-03-08 | Deep Render Ltd | Image compression and decoding, video compression and decoding: methods and systems |
WO2022017848A1 (en) * | 2020-07-21 | 2022-01-27 | Interdigital Vc Holdings France, Sas | A method and an apparatus for updating a deep neural network-based image or video decoder |
KR20230072487A (ko) * | 2020-12-24 | 2023-05-24 | 후아웨이 테크놀러지 컴퍼니 리미티드 | 분할 정보의 시그널링으로 디코딩 |
-
2021
- 2021-12-17 US US17/645,018 patent/US20220385907A1/en active Pending
-
2022
- 2022-03-31 EP EP22719679.7A patent/EP4342178A1/en active Pending
- 2022-03-31 JP JP2023570426A patent/JP2024519791A/ja active Pending
- 2022-03-31 WO PCT/US2022/022881 patent/WO2022245434A1/en active Application Filing
- 2022-03-31 BR BR112023023427A patent/BR112023023427A2/pt unknown
- 2022-03-31 KR KR1020237039057A patent/KR20240012374A/ko unknown
- 2022-04-01 TW TW111112832A patent/TW202247650A/zh unknown
Also Published As
Publication number | Publication date |
---|---|
WO2022245434A1 (en) | 2022-11-24 |
US20220385907A1 (en) | 2022-12-01 |
JP2024519791A (ja) | 2024-05-21 |
KR20240012374A (ko) | 2024-01-29 |
EP4342178A1 (en) | 2024-03-27 |
TW202247650A (zh) | 2022-12-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR112023023427A2 (pt) | Imagem implícita e compressão de vídeo usando sistemas de aprendizado de máquina | |
BR112019006979A2 (pt) | sequência para sequenciar transformações para síntese de fala via redes neurais recorrentes | |
WO2018155986A3 (ko) | 비디오 신호 처리 방법 및 장치 | |
BR112023014810A2 (pt) | Otimizador de distorção de taxa com base em aprendizagem de máquina para compactação de vídeo | |
EP4319172A3 (en) | Trusted contextual content | |
BR112022016793A2 (pt) | Compressão de vídeo usando sistemas de aprendizado de máquina de base recorrente | |
BR112017024275A2 (pt) | determinação de região de busca para inter-codificação dentro de imagem específica de dados de vídeo | |
GB2600327A (en) | Techniques for classification with neural networks | |
BR112021018450A8 (pt) | Controle de taxa para um codificador de vídeo | |
US20150326632A1 (en) | Methods and systems to facilitate synchronization of multiple media streams | |
BR112021026284A2 (pt) | Aparelho para decodificar ou codificar um bloco predeterminado de uma imagem, métodos e fluxo de dados | |
BR112022023315A2 (pt) | Aparelho e método para receber fluxo de dados de vídeo, fluxo de dados de vídeo, codificador e decodificador de vídeo, e método para codificar um vídeo em um fluxo de dados de vídeo | |
BR112022013557A2 (pt) | Sistema e método para dados de serviço de multidifusão/difusão | |
BR112022012807A2 (pt) | Método de processamento de vídeo, aparelho para processar dados de vídeo e meios não transitórios legíveis por computador | |
MY197897A (en) | Apparatus for splitting portion of picture into coding units, apparatus for encoding image of video sequence, apparatus for decoding image of video sequence, method for splitting portion of image into coding units and computer readable medium | |
US10848840B2 (en) | Communication apparatus and signal relay method | |
BR112022001279A2 (pt) | Método de processamento de vídeo, aparelho em um sistema de vídeo, e, produto de programa de computador | |
MX2019003123A (es) | Sistema y metodo para la compresion de datos de movimiento, de alta fidelidad, para la transmision a traves de una red de ancho de banda limitado. | |
WO2018224839A3 (en) | METHODS AND SYSTEMS FOR REACTION VIDEO GENERATION | |
SE1850827A1 (en) | Method and apparatus for training a neural network classifier to classify an image depicting one or more objects of a biological sample | |
BR112022019663A2 (pt) | Número de sinalização de candidatos de fusão de sub-bloco em codificação de vídeo | |
BR112022011316A2 (pt) | Métodos para geração de relatório cirúrgico operacional aperfeiçoado que utilizam aprendizado por máquina e dispositivos associados | |
BR112022002204A2 (pt) | Redimensionamento de previsão de gerenciamento de resolução adaptativa | |
US10943168B2 (en) | System and method for determining an artificial intelligence model in a decentralized network | |
US9491494B2 (en) | Distribution and use of video statistics for cloud-based video encoding |