BR112023018094A2 - Amostragem com base em pontos-chaves para estimação de pose - Google Patents

Amostragem com base em pontos-chaves para estimação de pose

Info

Publication number
BR112023018094A2
BR112023018094A2 BR112023018094A BR112023018094A BR112023018094A2 BR 112023018094 A2 BR112023018094 A2 BR 112023018094A2 BR 112023018094 A BR112023018094 A BR 112023018094A BR 112023018094 A BR112023018094 A BR 112023018094A BR 112023018094 A2 BR112023018094 A2 BR 112023018094A2
Authority
BR
Brazil
Prior art keywords
keypoint
pose estimation
based sampling
include determining
image
Prior art date
Application number
BR112023018094A
Other languages
English (en)
Inventor
Clemens Arth
Shreyas Hampali
Vincent Lepetit
Original Assignee
Qualcomm Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Technologies Inc filed Critical Qualcomm Technologies Inc
Priority claimed from PCT/US2022/013754 external-priority patent/WO2022197367A1/en
Publication of BR112023018094A2 publication Critical patent/BR112023018094A2/pt

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • G06T7/73Determining position or orientation of objects or cameras using feature-based methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/20Scenes; Scene-specific elements in augmented reality scenes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition
    • G06V40/28Recognition of hand or arm movements, e.g. recognition of deaf sign language
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/2431Multiple classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • G06T15/005General purpose rendering architectures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T19/00Manipulating 3D models or images for computer graphics
    • G06T19/006Mixed reality
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/80Analysis of captured images to determine intrinsic or extrinsic camera parameters, i.e. camera calibration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/002Image coding using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/443Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
    • G06V10/449Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
    • G06V10/451Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
    • G06V10/454Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/469Contour-based spatial representations, e.g. vector-coding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/107Static hand or arm
    • G06V40/11Hand-related biometrics; Hand pose recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2200/00Indexing scheme for image data processing or generation, in general
    • G06T2200/08Indexing scheme for image data processing or generation, in general involving all processing steps from image acquisition to 3D model generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30196Human being; Person
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30244Camera pose

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Mathematical Physics (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Human Computer Interaction (AREA)
  • Computer Graphics (AREA)
  • Psychiatry (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Social Psychology (AREA)
  • Computer Hardware Design (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)
  • Manipulator (AREA)

Abstract

amostragem com base em pontos-chaves para estimação de pose. sistemas e técnicas são fornecidos para determinar uma ou mais poses de um ou mais objetos. por exemplo, um processo pode incluir determinar, com utilização de um sistema de aprendizado de máquina, uma pluralidade de pontos-chaves proveniente de uma imagem. a pluralidade de pontos-chaves está associada a pelo menos um objeto na imagem. o processo pode incluir determinar uma pluralidade de características do sistema de aprendizado de máquina com base na pluralidade de pontos-chaves. o processo pode incluir classificar a pluralidade de características em uma pluralidade de tipos de articulação. o processo pode incluir determinar parâmetros de pose para pelo menos um objeto com base na pluralidade de tipos de articulação.
BR112023018094A 2021-03-17 2022-01-25 Amostragem com base em pontos-chaves para estimação de pose BR112023018094A2 (pt)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202163162305P 2021-03-17 2021-03-17
US17/457,408 US11804040B2 (en) 2021-03-17 2021-12-02 Keypoint-based sampling for pose estimation
PCT/US2022/013754 WO2022197367A1 (en) 2021-03-17 2022-01-25 Keypoint-based sampling for pose estimation

Publications (1)

Publication Number Publication Date
BR112023018094A2 true BR112023018094A2 (pt) 2023-10-03

Family

ID=83283894

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112023018094A BR112023018094A2 (pt) 2021-03-17 2022-01-25 Amostragem com base em pontos-chaves para estimação de pose

Country Status (5)

Country Link
US (1) US11804040B2 (pt)
EP (1) EP4309151A1 (pt)
KR (1) KR20230156056A (pt)
CN (1) CN116997941A (pt)
BR (1) BR112023018094A2 (pt)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220189195A1 (en) * 2020-12-15 2022-06-16 Digitrack Llc Methods and apparatus for automatic hand pose estimation using machine learning
US11804040B2 (en) * 2021-03-17 2023-10-31 Qualcomm Incorporated Keypoint-based sampling for pose estimation
US20230021408A1 (en) * 2021-07-21 2023-01-26 The Open University Object tracking by event camera
KR20230079618A (ko) * 2021-11-29 2023-06-07 삼성전자주식회사 인체를 3차원 모델링하는 방법 및 장치
CN115984384B (zh) * 2023-03-20 2023-07-21 乐歌人体工学科技股份有限公司 一种基于面部姿态图像估计的桌面升降控制方法
CN116129228B (zh) * 2023-04-19 2023-07-18 中国科学技术大学 图像匹配模型的训练方法、图像匹配方法及其装置
CN116486489B (zh) * 2023-06-26 2023-08-29 江西农业大学 基于语义感知图卷积的三维手物姿态估计方法及系统
CN117292407B (zh) * 2023-11-27 2024-03-26 安徽炬视科技有限公司 一种3d人体姿态估计方法及系统

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9383895B1 (en) * 2012-05-05 2016-07-05 F. Vinayak Methods and systems for interactively producing shapes in three-dimensional space
EP3862852A1 (en) * 2015-10-20 2021-08-11 Magic Leap, Inc. Selecting virtual objects in a three-dimensional space
US10096125B1 (en) * 2017-04-07 2018-10-09 Adobe Systems Incorporated Forecasting multiple poses based on a graphical image
EP3467707B1 (en) 2017-10-07 2024-03-13 Tata Consultancy Services Limited System and method for deep learning based hand gesture recognition in first person view
US10796452B2 (en) * 2017-12-03 2020-10-06 Facebook, Inc. Optimizations for structure mapping and up-sampling
US11494932B2 (en) * 2020-06-02 2022-11-08 Naver Corporation Distillation of part experts for whole-body pose estimation
US11804040B2 (en) * 2021-03-17 2023-10-31 Qualcomm Incorporated Keypoint-based sampling for pose estimation

Also Published As

Publication number Publication date
US11804040B2 (en) 2023-10-31
KR20230156056A (ko) 2023-11-13
EP4309151A1 (en) 2024-01-24
US20220301304A1 (en) 2022-09-22
CN116997941A (zh) 2023-11-03

Similar Documents

Publication Publication Date Title
BR112023018094A2 (pt) Amostragem com base em pontos-chaves para estimação de pose
BR112021019996A2 (pt) Treinamento de segmentação de instância baseado em aprendizagem profunda por meio de camadas de regressão
BR112022012980A2 (pt) Estimativa de tamanho de objeto usando mapa de câmera e/ou informações de radar
EP4319054A3 (en) Identifying legitimate websites to remove false positives from domain discovery analysis
BR112018077322A2 (pt) sistemas e métoodos para identificar conteúdo de correspondência
EP4250140A3 (en) Intelligent digital assistant in a multi-tasking environment
GB2580805A (en) Training data update
ATE528724T1 (de) Auf hierarchischen gliedern basierte erkennung von gegenständen
DE60231005D1 (de) Systeme, verfahren und software zum klassifizieren von dokumenten
BR112018074017A2 (pt) automação de validação de imagem
BR112016024885A2 (pt) identificação de intenção de pesquisa
BR112021006327A2 (pt) aprimoramento de vídeo de endoscópio automático
BR102019021121A2 (pt) sistema e método para executar a detecção automatizada de defeitos
BR112022004552A2 (pt) Predição de vetor de movimento baseada em histórico
BR112021001778A8 (pt) Sistemas e métodos para prevenir falsificação
BR0006894A (pt) Processos para representar um objeto e uma pluralidade de objetos que aparecem em uma imagem parada ou de vìdeo processando sinais que correspondem à imagem, para pesquisar um objeto em uma imagem parada ou de vìdeo processando sinais que correspondem a imagens, para representar objetos em imagens paradas ou de vìdeo, e para pesquisar objetos em imagens paradas ou de vìdeo, aparelho, programa de computador, sistema de computador, e, meio de armazenagem que pode ser lido por computador
BR112016024471A2 (pt) sistema e método para criptografia em modo de predição de bloco para compressão de fluxo de visor (dsc)
BR112019000310A2 (pt) arbitragem de pedido de memória
EP4300501A3 (en) Methods of sequencing data read realignment
BR112022024406A2 (pt) Modelo de aprendizado por máquina para análise de dados de patologia a partir de locais metastáticos
BR112023025190A2 (pt) Detecção de objeções usando imagens e informações de mensagens
BR112023018923A2 (pt) Método implementado por computador, e, dispositivo de armazenamento legível por computador
BR112022019869A2 (pt) Modo de relatório de medição condicional para posicionamento
BR112022024803A2 (pt) Sistemas de rastreamento visual de baixa potência
BR112023019163A2 (pt) Uso adaptativo de modelos de vídeo para compreensão holística de vídeo