BR112019021201A8 - Pesquisa de imagens de aprendizado de máquina - Google Patents
Pesquisa de imagens de aprendizado de máquinaInfo
- Publication number
- BR112019021201A8 BR112019021201A8 BR112019021201A BR112019021201A BR112019021201A8 BR 112019021201 A8 BR112019021201 A8 BR 112019021201A8 BR 112019021201 A BR112019021201 A BR 112019021201A BR 112019021201 A BR112019021201 A BR 112019021201A BR 112019021201 A8 BR112019021201 A8 BR 112019021201A8
- Authority
- BR
- Brazil
- Prior art keywords
- machine learning
- image search
- learning image
- image
- representable
- Prior art date
Links
- 238000010801 machine learning Methods 0.000 title abstract 2
- 239000013598 vector Substances 0.000 abstract 2
- 238000003491 array Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/5846—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/51—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/56—Information retrieval; Database structures therefor; File system structures therefor of still image data having vectorial format
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/251—Fusion techniques of input or preprocessed data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/216—Parsing using statistical methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
- G06V10/443—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
- G06V10/449—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
- G06V10/451—Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
- G06V10/454—Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/74—Image or video pattern matching; Proximity measures in feature spaces
- G06V10/761—Proximity, similarity or dissimilarity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/774—Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/803—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of input or preprocessed data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Computing Systems (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medical Informatics (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Library & Information Science (AREA)
- Mathematical Physics (AREA)
- Biophysics (AREA)
- Biodiversity & Conservation Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Probability & Statistics with Applications (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
Abstract
Um codificador de aprendizado de máquina codifica imagens para vetores de atributos de imagens representáveis em um espaço multimodal. O codificador também codifica uma consulta para um vetor de atributo textual representável no espaço multimodal. Os vetores de atributos de imagens são comparados ao atributo textual no espaço multimodal para identificar uma imagem casando com a consulta com base na comparação.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2017/026829 WO2018190792A1 (en) | 2017-04-10 | 2017-04-10 | Machine learning image search |
Publications (2)
Publication Number | Publication Date |
---|---|
BR112019021201A2 BR112019021201A2 (pt) | 2020-04-28 |
BR112019021201A8 true BR112019021201A8 (pt) | 2023-04-04 |
Family
ID=63792678
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112019021201A BR112019021201A8 (pt) | 2017-04-10 | 2017-04-10 | Pesquisa de imagens de aprendizado de máquina |
Country Status (5)
Country | Link |
---|---|
US (1) | US20210089571A1 (pt) |
EP (1) | EP3610414A4 (pt) |
CN (1) | CN110352419A (pt) |
BR (1) | BR112019021201A8 (pt) |
WO (1) | WO2018190792A1 (pt) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111033521A (zh) * | 2017-05-16 | 2020-04-17 | 雅腾帝卡(私人)有限公司 | 用于文化制品的分析的数字数据细节处理 |
US11120334B1 (en) * | 2017-09-08 | 2021-09-14 | Snap Inc. | Multimodal named entity recognition |
US11308133B2 (en) * | 2018-09-28 | 2022-04-19 | International Business Machines Corporation | Entity matching using visual information |
CN109871736B (zh) * | 2018-11-23 | 2023-01-31 | 腾讯科技(深圳)有限公司 | 自然语言描述信息的生成方法及装置 |
CN114375448A (zh) * | 2019-06-07 | 2022-04-19 | 徕卡显微系统Cms有限公司 | 用于处理生物学相关数据的系统和方法、用于控制显微镜的系统和方法及显微镜 |
DE102020120479A1 (de) * | 2019-08-07 | 2021-02-11 | Harman Becker Automotive Systems Gmbh | Fusion von Strassenkarten |
US11163760B2 (en) * | 2019-12-17 | 2021-11-02 | Mastercard International Incorporated | Providing a data query service to a user based on natural language request data |
US11321382B2 (en) * | 2020-02-11 | 2022-05-03 | International Business Machines Corporation | Secure matching and identification of patterns |
CN113282779A (zh) * | 2020-02-19 | 2021-08-20 | 阿里巴巴集团控股有限公司 | 图像搜索方法、装置、设备 |
CN111460231A (zh) * | 2020-03-10 | 2020-07-28 | 华为技术有限公司 | 电子设备以及电子设备的搜索方法、介质 |
US11132514B1 (en) * | 2020-03-16 | 2021-09-28 | Hong Kong Applied Science and Technology Research Institute Company Limited | Apparatus and method for applying image encoding recognition in natural language processing |
US11501071B2 (en) | 2020-07-08 | 2022-11-15 | International Business Machines Corporation | Word and image relationships in combined vector space |
US11394929B2 (en) * | 2020-09-11 | 2022-07-19 | Samsung Electronics Co., Ltd. | System and method for language-guided video analytics at the edge |
CN113076433B (zh) * | 2021-04-26 | 2022-05-17 | 支付宝(杭州)信息技术有限公司 | 具有多模态信息的检索对象的检索方法和装置 |
CN113627508B (zh) * | 2021-08-03 | 2022-09-02 | 北京百度网讯科技有限公司 | 陈列场景识别方法、装置、设备以及存储介质 |
CN114003758B (zh) * | 2021-12-30 | 2022-03-08 | 航天宏康智能科技(北京)有限公司 | 图像检索模型的训练方法和装置以及检索方法和装置 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6774917B1 (en) * | 1999-03-11 | 2004-08-10 | Fuji Xerox Co., Ltd. | Methods and apparatuses for interactive similarity searching, retrieval, and browsing of video |
US20080059452A1 (en) * | 2006-08-04 | 2008-03-06 | Metacarta, Inc. | Systems and methods for obtaining and using information from map images |
WO2008067191A2 (en) * | 2006-11-27 | 2008-06-05 | Designin Corporation | Systems, methods, and computer program products for home and landscape design |
CN102422319B (zh) * | 2009-03-04 | 2014-04-30 | 公立大学法人大阪府立大学 | 图像检索方法和图像存储方法 |
US9049117B1 (en) * | 2009-10-21 | 2015-06-02 | Narus, Inc. | System and method for collecting and processing information of an internet user via IP-web correlation |
US20120117051A1 (en) * | 2010-11-05 | 2012-05-10 | Microsoft Corporation | Multi-modal approach to search query input |
WO2012103191A2 (en) * | 2011-01-26 | 2012-08-02 | Veveo, Inc. | Method of and system for error correction in multiple input modality search engines |
IL226219A (en) * | 2013-05-07 | 2016-10-31 | Picscout (Israel) Ltd | Efficient comparison of images for large groups of images |
US9836671B2 (en) * | 2015-08-28 | 2017-12-05 | Microsoft Technology Licensing, Llc | Discovery of semantic similarities between images and text |
-
2017
- 2017-04-10 US US16/498,952 patent/US20210089571A1/en not_active Abandoned
- 2017-04-10 BR BR112019021201A patent/BR112019021201A8/pt not_active Application Discontinuation
- 2017-04-10 WO PCT/US2017/026829 patent/WO2018190792A1/en unknown
- 2017-04-10 EP EP17905693.2A patent/EP3610414A4/en not_active Withdrawn
- 2017-04-10 CN CN201780087676.0A patent/CN110352419A/zh active Pending
Also Published As
Publication number | Publication date |
---|---|
WO2018190792A1 (en) | 2018-10-18 |
EP3610414A4 (en) | 2020-11-18 |
EP3610414A1 (en) | 2020-02-19 |
US20210089571A1 (en) | 2021-03-25 |
CN110352419A (zh) | 2019-10-18 |
BR112019021201A2 (pt) | 2020-04-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR112019021201A8 (pt) | Pesquisa de imagens de aprendizado de máquina | |
CO2017009675A2 (es) | Derivación del vector de movimiento en la codificación de video | |
MX2019015871A (es) | Refinamiento de vectores de movimiento para predicción de multi-referencia. | |
WO2018014109A8 (en) | System and method for analyzing and searching for features associated with objects | |
WO2019147851A3 (en) | Systems and methods for generating machine learning applications | |
CL2019003677A1 (es) | Predicción de vectores de movimiento. | |
BR112018000502A2 (pt) | prioris baseadas em contexto para detecção de objeto em imagens | |
WO2015200110A3 (en) | Techniques for machine language translation of text from an image based on non-textual context information from the image | |
WO2018031112A9 (en) | Systems and methods for determining feature point motion | |
MX2016014986A (es) | Busqueda de imagenes del lenguaje natural. | |
MX2016016289A (es) | Aprendizaje y uso de reglas de recuperacion de contenido contextual para desambiguacion de consulta. | |
GB2544660A (en) | Visual interactive search | |
EP3799693A4 (en) | FULL PIXEL RESOLUTION MOTION VECTOR REFINEMENT SEARCH | |
MY194555A (en) | Method and Apparatus for Image Coding and Decoding Through Inter Prediction | |
BR112016014522A2 (pt) | Sistema e método para estabilizar a exibição de uma caixa de rastreamento de objeto | |
MX2020009560A (es) | Clasificacion y presentacion de resultados de motor de busqueda con base en modelos de clasificacion especificos de categoria. | |
WO2014107485A3 (en) | Adjacent search results exploration | |
WO2015170191A3 (en) | Method and apparatus for screening promotion keywords | |
PH12019501920A1 (en) | Image processing method and apparatus | |
CL2021000671A1 (es) | Método de codificación/decodificación de señales de imagen y dispositivo para lo mismo | |
EP3977739A4 (en) | REORGANIZATION OF MERGER CANDIDATES ACCORDING TO A GLOBAL MOTION VECTOR CROSS-REFERENCE TO RELATED APPLICATIONS | |
MX346698B (es) | Metodo y dispositivo de agrupamiento. | |
GB2564785A (en) | Search navigation element | |
MX2020006142A (es) | Procesamiento de una imagen. | |
BR112021024670A2 (pt) | Método e aparelho para recomendação de produto cosmético |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B350 | Update of information on the portal [chapter 15.35 patent gazette] | ||
B06W | Patent application suspended after preliminary examination (for patents with searches from other patent authorities) chapter 6.23 patent gazette] | ||
B15K | Others concerning applications: alteration of classification |
Free format text: AS CLASSIFICACOES ANTERIORES ERAM: G06K 9/62 , G06F 17/30 , G06F 15/18 Ipc: G06V 10/44 (2022.01), G06V 10/74 (2022.01), G06V 1 |
|
B11B | Dismissal acc. art. 36, par 1 of ipl - no reply within 90 days to fullfil the necessary requirements |