MX2023007191A - Un metodo y aparato para codificar o decodificar una imagen usando una red neuronal. - Google Patents
Un metodo y aparato para codificar o decodificar una imagen usando una red neuronal.Info
- Publication number
- MX2023007191A MX2023007191A MX2023007191A MX2023007191A MX2023007191A MX 2023007191 A MX2023007191 A MX 2023007191A MX 2023007191 A MX2023007191 A MX 2023007191A MX 2023007191 A MX2023007191 A MX 2023007191A MX 2023007191 A MX2023007191 A MX 2023007191A
- Authority
- MX
- Mexico
- Prior art keywords
- decoding
- encoding
- picture
- neural network
- bitstream
- Prior art date
Links
- 238000000034 method Methods 0.000 title abstract 4
- 238000013528 artificial neural network Methods 0.000 title abstract 3
- 230000006835 compression Effects 0.000 abstract 1
- 238000007906 compression Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/002—Image coding using neural networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
- G06N3/0455—Auto-encoder networks; Encoder-decoder networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
- G06V10/443—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
- G06V10/449—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
- G06V10/451—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
- G06V10/454—Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/132—Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/182—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- General Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Probability & Statistics with Applications (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Biodiversity & Conservation Biology (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Image Processing (AREA)
Abstract
En la presente se divulgan métodos y sistemas para codificar una imagen y decodificar un flujo de bits que puede representar una imagen codificada. Durante la codificación y decodificación, se aplican operaciones de cambio de escala para cambiar la escala de una entrada a un tamaño que se puede procesar por una capa de una red neuronal. Modalidades divulgadas en la presente proporcionan métodos para cambio de escala que logra un tamaño reducido del flujo de bits, mejorando así la compresión.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/EP2020/087333 WO2022128138A1 (en) | 2020-12-18 | 2020-12-18 | A method and apparatus for encoding or decoding a picture using a neural network |
Publications (1)
Publication Number | Publication Date |
---|---|
MX2023007191A true MX2023007191A (es) | 2023-07-03 |
Family
ID=74141531
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2023007191A MX2023007191A (es) | 2020-12-18 | 2020-12-18 | Un metodo y aparato para codificar o decodificar una imagen usando una red neuronal. |
Country Status (7)
Country | Link |
---|---|
US (1) | US20240015314A1 (es) |
EP (1) | EP4226325A1 (es) |
JP (1) | JP2024500744A (es) |
KR (1) | KR20230072491A (es) |
CN (1) | CN116648909A (es) |
MX (1) | MX2023007191A (es) |
WO (1) | WO2022128138A1 (es) |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016054779A1 (en) * | 2014-10-09 | 2016-04-14 | Microsoft Technology Licensing, Llc | Spatial pyramid pooling networks for image processing |
WO2018120013A1 (en) * | 2016-12-30 | 2018-07-05 | Nokia Technologies Oy | Artificial neural network |
-
2020
- 2020-12-18 MX MX2023007191A patent/MX2023007191A/es unknown
- 2020-12-18 WO PCT/EP2020/087333 patent/WO2022128138A1/en active Application Filing
- 2020-12-18 KR KR1020237013550A patent/KR20230072491A/ko unknown
- 2020-12-18 JP JP2023536909A patent/JP2024500744A/ja active Pending
- 2020-12-18 EP EP20838490.9A patent/EP4226325A1/en active Pending
- 2020-12-18 CN CN202080108045.4A patent/CN116648909A/zh active Pending
-
2023
- 2023-06-20 US US18/338,092 patent/US20240015314A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
JP2024500744A (ja) | 2024-01-10 |
WO2022128138A1 (en) | 2022-06-23 |
US20240015314A1 (en) | 2024-01-11 |
EP4226325A1 (en) | 2023-08-16 |
KR20230072491A (ko) | 2023-05-24 |
CN116648909A (zh) | 2023-08-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ZA202206622B (en) | Image reshaping in video coding using rate distortion optimization | |
PH12018500454A1 (en) | Method and apparatus of nueral network based processing in video coding | |
PH12017501638A1 (en) | Video encoding method with bit depth adjustment for fixed-point conversion and apparatus therefor, and video decoding method and apparatus therefor | |
ZA202107888B (en) | Context coding for transform skip mode | |
MX2018003654A (es) | Un aparato, un método y un programa informático para codificación y decodificación de video. | |
MY173976A (en) | Method, apparatus, and system for processing audio data | |
MY187124A (en) | Non-uniform parameter quantization for advanced coupling | |
ZA202200957B (en) | Quantization process for palette mode | |
CN104065976B (zh) | 一种基于视频的图像压缩及保密传输方法 | |
EP4113997A4 (en) | VIDEO CODING METHOD, VIDEO CODING METHOD AND ASSOCIATED APPARATUS | |
MX2021014277A (es) | Metodo y aparato de codificacion de video que utilizan conjunto de parametros adaptativos. | |
MX2022007280A (es) | Metodo y aparato de codificacion de video por capas con restricciones. | |
MX2022008502A (es) | Metodo de decodificacion de video y aparato para obtener el parametro de cuantizacion, y metodo de codificacion de video y aparato para transmitir el parametro de cuantizacion. | |
EP4054192A4 (en) | VIDEO DECODING METHOD AND APPARATUS, AND VIDEO CODING METHOD AND APPARATUS FOR PERFORMING INTER PREDICTION ACCORDING TO AN AFFINE MODEL | |
MX2022015674A (es) | Informacion de decodificador de video de referencia hipotetica anidada no escalable de se?alizacion. | |
PH12021550614A1 (en) | A method and an apparatus for encoding and decoding of digital image/video | |
MX2023007191A (es) | Un metodo y aparato para codificar o decodificar una imagen usando una red neuronal. | |
MX2022005905A (es) | Metodo y aparato para se?alizacion de compensacion de movimiento envolvente horizontal en la codificacion de video vr360. | |
EP4250729A4 (en) | AI-BASED IMAGE ENCODING AND DECODING APPARATUS AND RELATED METHOD | |
EP4250742A4 (en) | VIDEO DECODING METHOD, VIDEO ENCODING METHOD AND RELATED APPARATUS | |
EP4261824A4 (en) | AUDIO CODING METHOD AND DEVICE AND AUDIO DECODING METHOD AND DEVICE | |
EP4202921A4 (en) | AUDIO ENCODING APPARATUS AND METHOD AND AUDIO DECODING APPARATUS AND METHOD | |
MX2023007993A (es) | Codificacion de video basado en la extraccion de caracteristica y sintesis de foto. | |
EP4131968A4 (en) | RESIDUAL ENCODING IMAGE DECODING METHOD AND ASSOCIATED DEVICE | |
EP4030756A4 (en) | VIDEO ENCODING METHOD, VIDEO DECODING METHOD AND RELATED DEVICE |