BR112019021201A8 - Pesquisa de imagens de aprendizado de máquina - Google Patents

Pesquisa de imagens de aprendizado de máquina

Info

Publication number
BR112019021201A8
BR112019021201A8 BR112019021201A BR112019021201A BR112019021201A8 BR 112019021201 A8 BR112019021201 A8 BR 112019021201A8 BR 112019021201 A BR112019021201 A BR 112019021201A BR 112019021201 A BR112019021201 A BR 112019021201A BR 112019021201 A8 BR112019021201 A8 BR 112019021201A8
Authority
BR
Brazil
Prior art keywords
machine learning
image search
learning image
image
representable
Prior art date
Application number
BR112019021201A
Other languages
English (en)
Other versions
BR112019021201A2 (pt
Inventor
Samuel Perone Christian
da Silva Paula Thomas
Pereira Silveira Roberto
Original Assignee
Hewlett Packard Development Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hewlett Packard Development Co filed Critical Hewlett Packard Development Co
Publication of BR112019021201A2 publication Critical patent/BR112019021201A2/pt
Publication of BR112019021201A8 publication Critical patent/BR112019021201A8/pt

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5846Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/51Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/56Information retrieval; Database structures therefor; File system structures therefor of still image data having vectorial format
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/251Fusion techniques of input or preprocessed data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/443Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
    • G06V10/449Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
    • G06V10/451Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
    • G06V10/454Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/761Proximity, similarity or dissimilarity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/803Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of input or preprocessed data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Computing Systems (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Medical Informatics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Library & Information Science (AREA)
  • Mathematical Physics (AREA)
  • Biophysics (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)

Abstract

Um codificador de aprendizado de máquina codifica imagens para vetores de atributos de imagens representáveis em um espaço multimodal. O codificador também codifica uma consulta para um vetor de atributo textual representável no espaço multimodal. Os vetores de atributos de imagens são comparados ao atributo textual no espaço multimodal para identificar uma imagem casando com a consulta com base na comparação.
BR112019021201A 2017-04-10 2017-04-10 Pesquisa de imagens de aprendizado de máquina BR112019021201A8 (pt)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2017/026829 WO2018190792A1 (en) 2017-04-10 2017-04-10 Machine learning image search

Publications (2)

Publication Number Publication Date
BR112019021201A2 BR112019021201A2 (pt) 2020-04-28
BR112019021201A8 true BR112019021201A8 (pt) 2023-04-04

Family

ID=63792678

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112019021201A BR112019021201A8 (pt) 2017-04-10 2017-04-10 Pesquisa de imagens de aprendizado de máquina

Country Status (5)

Country Link
US (1) US20210089571A1 (pt)
EP (1) EP3610414A4 (pt)
CN (1) CN110352419A (pt)
BR (1) BR112019021201A8 (pt)
WO (1) WO2018190792A1 (pt)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111033521A (zh) * 2017-05-16 2020-04-17 雅腾帝卡(私人)有限公司 用于文化制品的分析的数字数据细节处理
US11120334B1 (en) * 2017-09-08 2021-09-14 Snap Inc. Multimodal named entity recognition
US11308133B2 (en) * 2018-09-28 2022-04-19 International Business Machines Corporation Entity matching using visual information
CN109871736B (zh) * 2018-11-23 2023-01-31 腾讯科技(深圳)有限公司 自然语言描述信息的生成方法及装置
CN114375448A (zh) * 2019-06-07 2022-04-19 徕卡显微系统Cms有限公司 用于处理生物学相关数据的系统和方法、用于控制显微镜的系统和方法及显微镜
DE102020120479A1 (de) * 2019-08-07 2021-02-11 Harman Becker Automotive Systems Gmbh Fusion von Strassenkarten
US11163760B2 (en) * 2019-12-17 2021-11-02 Mastercard International Incorporated Providing a data query service to a user based on natural language request data
US11321382B2 (en) * 2020-02-11 2022-05-03 International Business Machines Corporation Secure matching and identification of patterns
CN113282779A (zh) * 2020-02-19 2021-08-20 阿里巴巴集团控股有限公司 图像搜索方法、装置、设备
CN111460231A (zh) * 2020-03-10 2020-07-28 华为技术有限公司 电子设备以及电子设备的搜索方法、介质
US11132514B1 (en) * 2020-03-16 2021-09-28 Hong Kong Applied Science and Technology Research Institute Company Limited Apparatus and method for applying image encoding recognition in natural language processing
US11501071B2 (en) 2020-07-08 2022-11-15 International Business Machines Corporation Word and image relationships in combined vector space
US11394929B2 (en) * 2020-09-11 2022-07-19 Samsung Electronics Co., Ltd. System and method for language-guided video analytics at the edge
CN113076433B (zh) * 2021-04-26 2022-05-17 支付宝(杭州)信息技术有限公司 具有多模态信息的检索对象的检索方法和装置
CN113627508B (zh) * 2021-08-03 2022-09-02 北京百度网讯科技有限公司 陈列场景识别方法、装置、设备以及存储介质
CN114003758B (zh) * 2021-12-30 2022-03-08 航天宏康智能科技(北京)有限公司 图像检索模型的训练方法和装置以及检索方法和装置

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6774917B1 (en) * 1999-03-11 2004-08-10 Fuji Xerox Co., Ltd. Methods and apparatuses for interactive similarity searching, retrieval, and browsing of video
US20080059452A1 (en) * 2006-08-04 2008-03-06 Metacarta, Inc. Systems and methods for obtaining and using information from map images
WO2008067191A2 (en) * 2006-11-27 2008-06-05 Designin Corporation Systems, methods, and computer program products for home and landscape design
CN102422319B (zh) * 2009-03-04 2014-04-30 公立大学法人大阪府立大学 图像检索方法和图像存储方法
US9049117B1 (en) * 2009-10-21 2015-06-02 Narus, Inc. System and method for collecting and processing information of an internet user via IP-web correlation
US20120117051A1 (en) * 2010-11-05 2012-05-10 Microsoft Corporation Multi-modal approach to search query input
WO2012103191A2 (en) * 2011-01-26 2012-08-02 Veveo, Inc. Method of and system for error correction in multiple input modality search engines
IL226219A (en) * 2013-05-07 2016-10-31 Picscout (Israel) Ltd Efficient comparison of images for large groups of images
US9836671B2 (en) * 2015-08-28 2017-12-05 Microsoft Technology Licensing, Llc Discovery of semantic similarities between images and text

Also Published As

Publication number Publication date
WO2018190792A1 (en) 2018-10-18
EP3610414A4 (en) 2020-11-18
EP3610414A1 (en) 2020-02-19
US20210089571A1 (en) 2021-03-25
CN110352419A (zh) 2019-10-18
BR112019021201A2 (pt) 2020-04-28

Similar Documents

Publication Publication Date Title
BR112019021201A8 (pt) Pesquisa de imagens de aprendizado de máquina
CO2017009675A2 (es) Derivación del vector de movimiento en la codificación de video
MX2019015871A (es) Refinamiento de vectores de movimiento para predicción de multi-referencia.
WO2018014109A8 (en) System and method for analyzing and searching for features associated with objects
WO2019147851A3 (en) Systems and methods for generating machine learning applications
CL2019003677A1 (es) Predicción de vectores de movimiento.
BR112018000502A2 (pt) prioris baseadas em contexto para detecção de objeto em imagens
WO2015200110A3 (en) Techniques for machine language translation of text from an image based on non-textual context information from the image
WO2018031112A9 (en) Systems and methods for determining feature point motion
MX2016014986A (es) Busqueda de imagenes del lenguaje natural.
MX2016016289A (es) Aprendizaje y uso de reglas de recuperacion de contenido contextual para desambiguacion de consulta.
GB2544660A (en) Visual interactive search
EP3799693A4 (en) FULL PIXEL RESOLUTION MOTION VECTOR REFINEMENT SEARCH
MY194555A (en) Method and Apparatus for Image Coding and Decoding Through Inter Prediction
BR112016014522A2 (pt) Sistema e método para estabilizar a exibição de uma caixa de rastreamento de objeto
MX2020009560A (es) Clasificacion y presentacion de resultados de motor de busqueda con base en modelos de clasificacion especificos de categoria.
WO2014107485A3 (en) Adjacent search results exploration
WO2015170191A3 (en) Method and apparatus for screening promotion keywords
PH12019501920A1 (en) Image processing method and apparatus
CL2021000671A1 (es) Método de codificación/decodificación de señales de imagen y dispositivo para lo mismo
EP3977739A4 (en) REORGANIZATION OF MERGER CANDIDATES ACCORDING TO A GLOBAL MOTION VECTOR CROSS-REFERENCE TO RELATED APPLICATIONS
MX346698B (es) Metodo y dispositivo de agrupamiento.
GB2564785A (en) Search navigation element
MX2020006142A (es) Procesamiento de una imagen.
BR112021024670A2 (pt) Método e aparelho para recomendação de produto cosmético

Legal Events

Date Code Title Description
B350 Update of information on the portal [chapter 15.35 patent gazette]
B06W Patent application suspended after preliminary examination (for patents with searches from other patent authorities) chapter 6.23 patent gazette]
B15K Others concerning applications: alteration of classification

Free format text: AS CLASSIFICACOES ANTERIORES ERAM: G06K 9/62 , G06F 17/30 , G06F 15/18

Ipc: G06V 10/44 (2022.01), G06V 10/74 (2022.01), G06V 1

B11B Dismissal acc. art. 36, par 1 of ipl - no reply within 90 days to fullfil the necessary requirements