MX2023007191A - Un metodo y aparato para codificar o decodificar una imagen usando una red neuronal. - Google Patents

Un metodo y aparato para codificar o decodificar una imagen usando una red neuronal.

Info

Publication number
MX2023007191A
MX2023007191A MX2023007191A MX2023007191A MX2023007191A MX 2023007191 A MX2023007191 A MX 2023007191A MX 2023007191 A MX2023007191 A MX 2023007191A MX 2023007191 A MX2023007191 A MX 2023007191A MX 2023007191 A MX2023007191 A MX 2023007191A
Authority
MX
Mexico
Prior art keywords
decoding
encoding
picture
neural network
bitstream
Prior art date
Application number
MX2023007191A
Other languages
English (en)
Inventor
Semih Esenlik
Han Gao
Elena Alexandrovna Alshina
Original Assignee
Huawei Tech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Tech Co Ltd filed Critical Huawei Tech Co Ltd
Publication of MX2023007191A publication Critical patent/MX2023007191A/es

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/002Image coding using neural networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • G06N3/0455Auto-encoder networks; Encoder-decoder networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/047Probabilistic or stochastic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/443Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
    • G06V10/449Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
    • G06V10/451Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
    • G06V10/454Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/182Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Image Processing (AREA)

Abstract

En la presente se divulgan métodos y sistemas para codificar una imagen y decodificar un flujo de bits que puede representar una imagen codificada. Durante la codificación y decodificación, se aplican operaciones de cambio de escala para cambiar la escala de una entrada a un tamaño que se puede procesar por una capa de una red neuronal. Modalidades divulgadas en la presente proporcionan métodos para cambio de escala que logra un tamaño reducido del flujo de bits, mejorando así la compresión.
MX2023007191A 2020-12-18 2020-12-18 Un metodo y aparato para codificar o decodificar una imagen usando una red neuronal. MX2023007191A (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2020/087333 WO2022128138A1 (en) 2020-12-18 2020-12-18 A method and apparatus for encoding or decoding a picture using a neural network

Publications (1)

Publication Number Publication Date
MX2023007191A true MX2023007191A (es) 2023-07-03

Family

ID=74141531

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2023007191A MX2023007191A (es) 2020-12-18 2020-12-18 Un metodo y aparato para codificar o decodificar una imagen usando una red neuronal.

Country Status (7)

Country Link
US (1) US20240015314A1 (es)
EP (1) EP4226325A1 (es)
JP (1) JP2024500744A (es)
KR (1) KR20230072491A (es)
CN (1) CN116648909A (es)
MX (1) MX2023007191A (es)
WO (1) WO2022128138A1 (es)

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016054779A1 (en) * 2014-10-09 2016-04-14 Microsoft Technology Licensing, Llc Spatial pyramid pooling networks for image processing
WO2018120013A1 (en) * 2016-12-30 2018-07-05 Nokia Technologies Oy Artificial neural network

Also Published As

Publication number Publication date
JP2024500744A (ja) 2024-01-10
WO2022128138A1 (en) 2022-06-23
US20240015314A1 (en) 2024-01-11
EP4226325A1 (en) 2023-08-16
KR20230072491A (ko) 2023-05-24
CN116648909A (zh) 2023-08-25

Similar Documents

Publication Publication Date Title
ZA202206622B (en) Image reshaping in video coding using rate distortion optimization
PH12018500454A1 (en) Method and apparatus of nueral network based processing in video coding
PH12017501638A1 (en) Video encoding method with bit depth adjustment for fixed-point conversion and apparatus therefor, and video decoding method and apparatus therefor
ZA202107888B (en) Context coding for transform skip mode
MX2018003654A (es) Un aparato, un método y un programa informático para codificación y decodificación de video.
MY173976A (en) Method, apparatus, and system for processing audio data
MY187124A (en) Non-uniform parameter quantization for advanced coupling
ZA202200957B (en) Quantization process for palette mode
CN104065976B (zh) 一种基于视频的图像压缩及保密传输方法
EP4113997A4 (en) VIDEO CODING METHOD, VIDEO CODING METHOD AND ASSOCIATED APPARATUS
MX2021014277A (es) Metodo y aparato de codificacion de video que utilizan conjunto de parametros adaptativos.
MX2022007280A (es) Metodo y aparato de codificacion de video por capas con restricciones.
MX2022008502A (es) Metodo de decodificacion de video y aparato para obtener el parametro de cuantizacion, y metodo de codificacion de video y aparato para transmitir el parametro de cuantizacion.
EP4054192A4 (en) VIDEO DECODING METHOD AND APPARATUS, AND VIDEO CODING METHOD AND APPARATUS FOR PERFORMING INTER PREDICTION ACCORDING TO AN AFFINE MODEL
MX2022015674A (es) Informacion de decodificador de video de referencia hipotetica anidada no escalable de se?alizacion.
PH12021550614A1 (en) A method and an apparatus for encoding and decoding of digital image/video
MX2023007191A (es) Un metodo y aparato para codificar o decodificar una imagen usando una red neuronal.
MX2022005905A (es) Metodo y aparato para se?alizacion de compensacion de movimiento envolvente horizontal en la codificacion de video vr360.
EP4250729A4 (en) AI-BASED IMAGE ENCODING AND DECODING APPARATUS AND RELATED METHOD
EP4250742A4 (en) VIDEO DECODING METHOD, VIDEO ENCODING METHOD AND RELATED APPARATUS
EP4261824A4 (en) AUDIO CODING METHOD AND DEVICE AND AUDIO DECODING METHOD AND DEVICE
EP4202921A4 (en) AUDIO ENCODING APPARATUS AND METHOD AND AUDIO DECODING APPARATUS AND METHOD
MX2023007993A (es) Codificacion de video basado en la extraccion de caracteristica y sintesis de foto.
EP4131968A4 (en) RESIDUAL ENCODING IMAGE DECODING METHOD AND ASSOCIATED DEVICE
EP4030756A4 (en) VIDEO ENCODING METHOD, VIDEO DECODING METHOD AND RELATED DEVICE