MX2023003531A - Procesamiento de imagenes utilizando redes neuronales basadas en autoatencion. - Google Patents

Procesamiento de imagenes utilizando redes neuronales basadas en autoatencion.

Info

Publication number
MX2023003531A
MX2023003531A MX2023003531A MX2023003531A MX2023003531A MX 2023003531 A MX2023003531 A MX 2023003531A MX 2023003531 A MX2023003531 A MX 2023003531A MX 2023003531 A MX2023003531 A MX 2023003531A MX 2023003531 A MX2023003531 A MX 2023003531A
Authority
MX
Mexico
Prior art keywords
image
images
self
input
processing
Prior art date
Application number
MX2023003531A
Other languages
English (en)
Inventor
Neil Matthew Tinmouth Houlsby
Sylvain Gelly
Jakob D Uszkoreit
Xiaohua Zhai
Georg Heigold
Lucas Klaus Beyer
Alexander Kolesnikov
Matthias Johannes Lorenz Minderer
Dirk Weissenborn
Mostafa Deghani
Alexey Dosovitskiy
Thomas Unterthiner
Original Assignee
Google Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google Llc filed Critical Google Llc
Publication of MX2023003531A publication Critical patent/MX2023003531A/es

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/97Determining parameters from multiple pictures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

Métodos, sistemas y aparatos, incluyendo programas de computadora codificados en un medio de almacenamiento en computadora, para procesar imágenes utilizando redes neuronales basadas en atención propia. Uno de los métodos incluye obtener una o más imágenes que comprenden una pluralidad de pixeles; determinar, para cada imagen de la imagen o imágenes, una pluralidad de parches de imagen correspondientes a la imagen, en donde cada parche de imagen comprende un subconjunto diferente de los pixeles de la imagen; procesar, para cada imagen de la imagen o imágenes, la pluralidad correspondiente de parches de imagen para generar una secuencia de entrada comprendiendo un elemento de entrada respectivo en cada una de una pluralidad de posiciones de entrada, en donde una pluralidad de los elementos de entrada corresponde a diferentes parches de imagen respectivos; y procesar las secuencias de entrada utilizando una red neuronal para generar una salida de red que diferencia a la imagen o imágenes, en donde la red neuronal comprende una o más capas de red neuronal de autoatención.
MX2023003531A 2020-10-02 2021-10-04 Procesamiento de imagenes utilizando redes neuronales basadas en autoatencion. MX2023003531A (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202063087135P 2020-10-02 2020-10-02
PCT/US2021/053424 WO2022072940A1 (en) 2020-10-02 2021-10-04 Processing images using self-attention based neural networks

Publications (1)

Publication Number Publication Date
MX2023003531A true MX2023003531A (es) 2023-04-19

Family

ID=78414760

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2023003531A MX2023003531A (es) 2020-10-02 2021-10-04 Procesamiento de imagenes utilizando redes neuronales basadas en autoatencion.

Country Status (11)

Country Link
US (2) US20220108478A1 (es)
EP (1) EP4196917A1 (es)
JP (1) JP2023533907A (es)
KR (1) KR20230004710A (es)
CN (1) CN115605878A (es)
AU (2) AU2021354030B2 (es)
BR (1) BR112023005490A2 (es)
CA (1) CA3193958A1 (es)
MX (1) MX2023003531A (es)
TW (1) TW202215303A (es)
WO (1) WO2022072940A1 (es)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112287978B (zh) * 2020-10-07 2022-04-15 武汉大学 一种基于自注意力上下文网络的高光谱遥感图像分类方法
US11983920B2 (en) * 2021-12-20 2024-05-14 International Business Machines Corporation Unified framework for multigrid neural network architecture
WO2023229094A1 (ko) * 2022-05-27 2023-11-30 주식회사 엔씨소프트 행동 예측 방법 및 장치
CN114972897A (zh) * 2022-06-06 2022-08-30 京东科技控股股份有限公司 图像特征处理方法、装置、产品、介质及设备
CN114862881A (zh) * 2022-07-11 2022-08-05 四川大学 一种基于pet-ct的跨模态注意力肿瘤分割方法、系统
KR102663467B1 (ko) * 2022-11-09 2024-05-09 국민대학교산학협력단 포인트 클라우드의 고해상화 장치 및 방법
CN115457042B (zh) * 2022-11-14 2023-03-24 四川路桥华东建设有限责任公司 一种基于蒸馏学习的螺纹套丝表面缺陷检测的方法及系统

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6850252B1 (en) * 1999-10-05 2005-02-01 Steven M. Hoffberg Intelligent electronic appliance system and method
US7006881B1 (en) * 1991-12-23 2006-02-28 Steven Hoffberg Media recording device with remote graphic user interface
US6058190A (en) * 1997-05-27 2000-05-02 Pitney Bowes Inc. Method and system for automatic recognition of digital indicia images deliberately distorted to be non readable
US7966078B2 (en) * 1999-02-01 2011-06-21 Steven Hoffberg Network media appliance system and method
JP4098021B2 (ja) 2002-07-30 2008-06-11 富士フイルム株式会社 シーン識別方法および装置ならびにプログラム
JP5258694B2 (ja) * 2009-07-27 2013-08-07 富士フイルム株式会社 医用画像処理装置および方法並びにプログラム
ITRM20130022A1 (it) * 2013-01-11 2014-07-12 Natural Intelligent Technologies S R L Procedimento e apparato di riconoscimento di scrittura a mano
US9536293B2 (en) * 2014-07-30 2017-01-03 Adobe Systems Incorporated Image assessment using deep convolutional neural networks
US10803143B2 (en) * 2015-07-30 2020-10-13 Siemens Healthcare Gmbh Virtual biopsy techniques for analyzing diseases
EP3267368B1 (en) * 2016-07-06 2020-06-03 Accenture Global Solutions Limited Machine learning image processing
KR102559202B1 (ko) * 2018-03-27 2023-07-25 삼성전자주식회사 3d 렌더링 방법 및 장치
US10853725B2 (en) 2018-05-18 2020-12-01 Deepmind Technologies Limited Neural networks with relational memory
CA3109571A1 (en) * 2018-07-16 2020-01-23 Accel Robotics Corporation Autonomous store tracking system
EP3932318A4 (en) 2019-02-28 2022-04-20 FUJIFILM Corporation LEARNING METHOD, LEARNING SYSTEM, LEARNED MODEL, PROGRAM AND DEVICE FOR GENERATION OF SUPER RESOLUTION IMAGES
US10825221B1 (en) * 2019-04-23 2020-11-03 Adobe Inc. Music driven human dancing video synthesis
JP7444235B2 (ja) 2020-03-03 2024-03-06 日本電気株式会社 注意機構、画像認識システム、特徴変換方法およびプログラム

Also Published As

Publication number Publication date
US11983903B2 (en) 2024-05-14
AU2021354030B2 (en) 2023-11-30
WO2022072940A1 (en) 2022-04-07
CN115605878A (zh) 2023-01-13
EP4196917A1 (en) 2023-06-21
JP2023533907A (ja) 2023-08-07
AU2021354030A1 (en) 2022-11-24
AU2024201361A1 (en) 2024-03-21
BR112023005490A2 (pt) 2023-04-25
US20240062426A1 (en) 2024-02-22
CA3193958A1 (en) 2022-04-07
US20220108478A1 (en) 2022-04-07
KR20230004710A (ko) 2023-01-06
TW202215303A (zh) 2022-04-16

Similar Documents

Publication Publication Date Title
MX2023003531A (es) Procesamiento de imagenes utilizando redes neuronales basadas en autoatencion.
US11080809B2 (en) Hiding information and images via deep learning
MX2020014293A (es) Generación de metadatos de secuenciación basada en inteligencia artificial.
Zhou et al. Coverless image steganography using partial-duplicate image retrieval
KR101967089B1 (ko) 컨볼루션 신경망 기반의 완전 기준 이미지 품질 평가
MX2020013580A (es) Aparato y metodo para procesamiento de imagenes y sistema para entrenamiento de red neuronal.
SG10201804213UA (en) Projection neural networks
EP3333771A1 (en) Method, program, and apparatus for comparing data hypergraphs
Mathon et al. Optimal transport for secure spread-spectrum watermarking of still images
MX2017009879A (es) Capas de normalizacion por lotes.
WO2018154092A1 (en) Multiscale image generation
CN106464772A (zh) 用于嵌入水印、视频帧的系统和方法以及用于检测嵌入的水印的系统和方法
PH12021550290A1 (en) Video image component prediction method and device, and computer storage medium
Bondzulic et al. Performance of peak signal‐to‐noise ratio quality assessment in video streaming with packet losses
US20220215104A1 (en) Methods of providing data privacy for neural network based inference
US20160150235A1 (en) Layer-based video encoding
US20130251190A1 (en) Device and method for embedding watermark into image
Chen et al. Universal stego post-processing for enhancing image steganography
Nilizadeh et al. Information Hiding in RGB Images Using an Improved Matrix Pattern Approach.
Bhuiyan et al. An improved image steganography algorithm based on PVD
Yousfi et al. JPEG steganalysis detectors scalable with respect to compression quality
WO2020174458A3 (en) Partial activation of multiple pathways in neural networks
Manu et al. Tamper detection of social media images using quality artifacts and texture features
CN109859092B (zh) 信息隐藏方法、装置、设备及计算机可读存储介质
CN116228010A (zh) 信息调整方法、装置、电子设备和计算机可读介质