BR112023018094A2 - Amostragem com base em pontos-chaves para estimação de pose - Google Patents
Amostragem com base em pontos-chaves para estimação de poseInfo
- Publication number
- BR112023018094A2 BR112023018094A2 BR112023018094A BR112023018094A BR112023018094A2 BR 112023018094 A2 BR112023018094 A2 BR 112023018094A2 BR 112023018094 A BR112023018094 A BR 112023018094A BR 112023018094 A BR112023018094 A BR 112023018094A BR 112023018094 A2 BR112023018094 A2 BR 112023018094A2
- Authority
- BR
- Brazil
- Prior art keywords
- keypoint
- pose estimation
- based sampling
- include determining
- image
- Prior art date
Links
- 238000005070 sampling Methods 0.000 title abstract 2
- 238000000034 method Methods 0.000 abstract 5
- 238000010801 machine learning Methods 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/70—Determining position or orientation of objects or cameras
- G06T7/73—Determining position or orientation of objects or cameras using feature-based methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/20—Scenes; Scene-specific elements in augmented reality scenes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
- G06V40/28—Recognition of hand or arm movements, e.g. recognition of deaf sign language
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/2431—Multiple classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/088—Non-supervised learning, e.g. competitive learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/005—General purpose rendering architectures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T19/00—Manipulating 3D models or images for computer graphics
- G06T19/006—Mixed reality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/50—Depth or shape recovery
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/80—Analysis of captured images to determine intrinsic or extrinsic camera parameters, i.e. camera calibration
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/002—Image coding using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
- G06V10/443—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
- G06V10/449—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
- G06V10/451—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
- G06V10/454—Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/46—Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
- G06V10/469—Contour-based spatial representations, e.g. vector-coding
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/107—Static hand or arm
- G06V40/11—Hand-related biometrics; Hand pose recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G06T2200/08—Indexing scheme for image data processing or generation, in general involving all processing steps from image acquisition to 3D model generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30196—Human being; Person
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30244—Camera pose
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Mathematical Physics (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Human Computer Interaction (AREA)
- Computer Graphics (AREA)
- Psychiatry (AREA)
- Biodiversity & Conservation Biology (AREA)
- Social Psychology (AREA)
- Computer Hardware Design (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Image Analysis (AREA)
- Image Processing (AREA)
- Manipulator (AREA)
Abstract
amostragem com base em pontos-chaves para estimação de pose. sistemas e técnicas são fornecidos para determinar uma ou mais poses de um ou mais objetos. por exemplo, um processo pode incluir determinar, com utilização de um sistema de aprendizado de máquina, uma pluralidade de pontos-chaves proveniente de uma imagem. a pluralidade de pontos-chaves está associada a pelo menos um objeto na imagem. o processo pode incluir determinar uma pluralidade de características do sistema de aprendizado de máquina com base na pluralidade de pontos-chaves. o processo pode incluir classificar a pluralidade de características em uma pluralidade de tipos de articulação. o processo pode incluir determinar parâmetros de pose para pelo menos um objeto com base na pluralidade de tipos de articulação.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163162305P | 2021-03-17 | 2021-03-17 | |
US17/457,408 US11804040B2 (en) | 2021-03-17 | 2021-12-02 | Keypoint-based sampling for pose estimation |
PCT/US2022/013754 WO2022197367A1 (en) | 2021-03-17 | 2022-01-25 | Keypoint-based sampling for pose estimation |
Publications (1)
Publication Number | Publication Date |
---|---|
BR112023018094A2 true BR112023018094A2 (pt) | 2023-10-03 |
Family
ID=83283894
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112023018094A BR112023018094A2 (pt) | 2021-03-17 | 2022-01-25 | Amostragem com base em pontos-chaves para estimação de pose |
Country Status (5)
Country | Link |
---|---|
US (1) | US11804040B2 (pt) |
EP (1) | EP4309151A1 (pt) |
KR (1) | KR20230156056A (pt) |
CN (1) | CN116997941A (pt) |
BR (1) | BR112023018094A2 (pt) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20220189195A1 (en) * | 2020-12-15 | 2022-06-16 | Digitrack Llc | Methods and apparatus for automatic hand pose estimation using machine learning |
US11804040B2 (en) * | 2021-03-17 | 2023-10-31 | Qualcomm Incorporated | Keypoint-based sampling for pose estimation |
US20230021408A1 (en) * | 2021-07-21 | 2023-01-26 | The Open University | Object tracking by event camera |
KR20230079618A (ko) * | 2021-11-29 | 2023-06-07 | 삼성전자주식회사 | 인체를 3차원 모델링하는 방법 및 장치 |
CN115984384B (zh) * | 2023-03-20 | 2023-07-21 | 乐歌人体工学科技股份有限公司 | 一种基于面部姿态图像估计的桌面升降控制方法 |
CN116129228B (zh) * | 2023-04-19 | 2023-07-18 | 中国科学技术大学 | 图像匹配模型的训练方法、图像匹配方法及其装置 |
CN116486489B (zh) * | 2023-06-26 | 2023-08-29 | 江西农业大学 | 基于语义感知图卷积的三维手物姿态估计方法及系统 |
CN117292407B (zh) * | 2023-11-27 | 2024-03-26 | 安徽炬视科技有限公司 | 一种3d人体姿态估计方法及系统 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9383895B1 (en) * | 2012-05-05 | 2016-07-05 | F. Vinayak | Methods and systems for interactively producing shapes in three-dimensional space |
EP3862852A1 (en) * | 2015-10-20 | 2021-08-11 | Magic Leap, Inc. | Selecting virtual objects in a three-dimensional space |
US10096125B1 (en) * | 2017-04-07 | 2018-10-09 | Adobe Systems Incorporated | Forecasting multiple poses based on a graphical image |
EP3467707B1 (en) | 2017-10-07 | 2024-03-13 | Tata Consultancy Services Limited | System and method for deep learning based hand gesture recognition in first person view |
US10796452B2 (en) * | 2017-12-03 | 2020-10-06 | Facebook, Inc. | Optimizations for structure mapping and up-sampling |
US11494932B2 (en) * | 2020-06-02 | 2022-11-08 | Naver Corporation | Distillation of part experts for whole-body pose estimation |
US11804040B2 (en) * | 2021-03-17 | 2023-10-31 | Qualcomm Incorporated | Keypoint-based sampling for pose estimation |
-
2021
- 2021-12-02 US US17/457,408 patent/US11804040B2/en active Active
-
2022
- 2022-01-25 EP EP22704155.5A patent/EP4309151A1/en active Pending
- 2022-01-25 KR KR1020237030455A patent/KR20230156056A/ko unknown
- 2022-01-25 BR BR112023018094A patent/BR112023018094A2/pt unknown
- 2022-01-25 CN CN202280019720.5A patent/CN116997941A/zh active Pending
Also Published As
Publication number | Publication date |
---|---|
US11804040B2 (en) | 2023-10-31 |
KR20230156056A (ko) | 2023-11-13 |
EP4309151A1 (en) | 2024-01-24 |
US20220301304A1 (en) | 2022-09-22 |
CN116997941A (zh) | 2023-11-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR112023018094A2 (pt) | Amostragem com base em pontos-chaves para estimação de pose | |
BR112021019996A2 (pt) | Treinamento de segmentação de instância baseado em aprendizagem profunda por meio de camadas de regressão | |
BR112022012980A2 (pt) | Estimativa de tamanho de objeto usando mapa de câmera e/ou informações de radar | |
EP4319054A3 (en) | Identifying legitimate websites to remove false positives from domain discovery analysis | |
BR112018077322A2 (pt) | sistemas e métoodos para identificar conteúdo de correspondência | |
EP4250140A3 (en) | Intelligent digital assistant in a multi-tasking environment | |
GB2580805A (en) | Training data update | |
ATE528724T1 (de) | Auf hierarchischen gliedern basierte erkennung von gegenständen | |
DE60231005D1 (de) | Systeme, verfahren und software zum klassifizieren von dokumenten | |
BR112018074017A2 (pt) | automação de validação de imagem | |
BR112016024885A2 (pt) | identificação de intenção de pesquisa | |
BR112021006327A2 (pt) | aprimoramento de vídeo de endoscópio automático | |
BR102019021121A2 (pt) | sistema e método para executar a detecção automatizada de defeitos | |
BR112022004552A2 (pt) | Predição de vetor de movimento baseada em histórico | |
BR112021001778A8 (pt) | Sistemas e métodos para prevenir falsificação | |
BR0006894A (pt) | Processos para representar um objeto e uma pluralidade de objetos que aparecem em uma imagem parada ou de vìdeo processando sinais que correspondem à imagem, para pesquisar um objeto em uma imagem parada ou de vìdeo processando sinais que correspondem a imagens, para representar objetos em imagens paradas ou de vìdeo, e para pesquisar objetos em imagens paradas ou de vìdeo, aparelho, programa de computador, sistema de computador, e, meio de armazenagem que pode ser lido por computador | |
BR112016024471A2 (pt) | sistema e método para criptografia em modo de predição de bloco para compressão de fluxo de visor (dsc) | |
BR112019000310A2 (pt) | arbitragem de pedido de memória | |
EP4300501A3 (en) | Methods of sequencing data read realignment | |
BR112022024406A2 (pt) | Modelo de aprendizado por máquina para análise de dados de patologia a partir de locais metastáticos | |
BR112023025190A2 (pt) | Detecção de objeções usando imagens e informações de mensagens | |
BR112023018923A2 (pt) | Método implementado por computador, e, dispositivo de armazenamento legível por computador | |
BR112022019869A2 (pt) | Modo de relatório de medição condicional para posicionamento | |
BR112022024803A2 (pt) | Sistemas de rastreamento visual de baixa potência | |
BR112023019163A2 (pt) | Uso adaptativo de modelos de vídeo para compreensão holística de vídeo |