CN103930903A - 用于移动视觉搜索的方法和设备 - Google Patents

用于移动视觉搜索的方法和设备 Download PDF

Info

Publication number
CN103930903A
CN103930903A CN201280054713.5A CN201280054713A CN103930903A CN 103930903 A CN103930903 A CN 103930903A CN 201280054713 A CN201280054713 A CN 201280054713A CN 103930903 A CN103930903 A CN 103930903A
Authority
CN
China
Prior art keywords
word
vector
aggregated
visual
processor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201280054713.5A
Other languages
English (en)
Chinese (zh)
Inventor
R·韦旦萨姆
R·格热茨祖克
D·M·陈
S-H·蔡
B·格罗德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Inc
Leland Stanford Junior University
Original Assignee
Nokia Inc
Leland Stanford Junior University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Inc, Leland Stanford Junior University filed Critical Nokia Inc
Publication of CN103930903A publication Critical patent/CN103930903A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/10Image acquisition
    • G06V10/17Image acquisition using hand-held instruments
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • G06F18/2132Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on discrimination criteria, e.g. discriminant analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/28Determining representative reference patterns, e.g. by averaging or distorting; Generating dictionaries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/462Salient features, e.g. scale invariant feature transforms [SIFT]
    • G06V10/464Salient features, e.g. scale invariant feature transforms [SIFT] using a plurality of salient features, e.g. bag-of-words [BoW] representations
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/7715Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/772Determining representative reference patterns, e.g. averaging or distorting patterns; Generating dictionaries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/08Feature extraction
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/12Classification; Matching

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Library & Information Science (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Medical Informatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Collating Specific Patterns (AREA)
  • Image Analysis (AREA)
CN201280054713.5A 2011-11-07 2012-11-01 用于移动视觉搜索的方法和设备 Pending CN103930903A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/290,658 2011-11-07
US13/290,658 US20130114900A1 (en) 2011-11-07 2011-11-07 Methods and apparatuses for mobile visual search
PCT/FI2012/051062 WO2013068638A2 (en) 2011-11-07 2012-11-01 Methods and apparatuses for mobile visual search

Publications (1)

Publication Number Publication Date
CN103930903A true CN103930903A (zh) 2014-07-16

Family

ID=48223750

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280054713.5A Pending CN103930903A (zh) 2011-11-07 2012-11-01 用于移动视觉搜索的方法和设备

Country Status (5)

Country Link
US (1) US20130114900A1 (enExample)
EP (1) EP2776981A4 (enExample)
CN (1) CN103930903A (enExample)
IN (1) IN2014CN04188A (enExample)
WO (1) WO2013068638A2 (enExample)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20140093957A (ko) * 2011-11-24 2014-07-29 마이크로소프트 코포레이션 상호작용 멀티-모달 이미지 검색 기법
CN103164713B (zh) 2011-12-12 2016-04-06 阿里巴巴集团控股有限公司 图像分类方法和装置
KR20140102038A (ko) * 2013-02-13 2014-08-21 삼성전자주식회사 영상 정합 장치 및 영상 정합 방법
US9922271B2 (en) 2015-03-20 2018-03-20 Netra, Inc. Object detection and classification
US9760792B2 (en) 2015-03-20 2017-09-12 Netra, Inc. Object detection and classification
US11004131B2 (en) * 2016-10-16 2021-05-11 Ebay Inc. Intelligent online personal assistant with multi-turn dialog based on visual search
US11748978B2 (en) 2016-10-16 2023-09-05 Ebay Inc. Intelligent online personal assistant with offline visual search database
US10970768B2 (en) 2016-11-11 2021-04-06 Ebay Inc. Method, medium, and system for image text localization and comparison
US11120070B2 (en) * 2018-05-21 2021-09-14 Microsoft Technology Licensing, Llc System and method for attribute-based visual search over a computer communication network
US10997459B2 (en) * 2019-05-23 2021-05-04 Webkontrol, Inc. Video content indexing and searching
CN111323037B (zh) * 2020-02-28 2022-07-05 海博(苏州)机器人科技有限公司 一种移动机器人新型骨架提取的Voronoi路径规划算法

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7657126B2 (en) * 2005-05-09 2010-02-02 Like.Com System and method for search portions of objects in images and features thereof
US20070122041A1 (en) * 2005-11-29 2007-05-31 Baback Moghaddam Spectral method for sparse linear discriminant analysis
US7860317B2 (en) * 2006-04-04 2010-12-28 Microsoft Corporation Generating search results based on duplicate image detection
US9202140B2 (en) * 2008-09-05 2015-12-01 Siemens Medical Solutions Usa, Inc. Quotient appearance manifold mapping for image classification
WO2010071617A1 (en) * 2008-12-15 2010-06-24 Thomson Licensing Method and apparatus for performing image processing
US8538102B2 (en) * 2008-12-17 2013-09-17 Synarc Inc Optimised region of interest selection
KR101640077B1 (ko) * 2009-06-05 2016-07-15 삼성전자주식회사 인체 동작 및 얼굴 표정 모델링 및 인식을 위한 비디오 센서 기반의 장치 및 방법
US8571306B2 (en) * 2011-08-10 2013-10-29 Qualcomm Incorporated Coding of feature location information

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
HERVE JEGOU ET AL.: "Aggregating local descriptors into a compact image representation", 《IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION》 *
HERVE JEGOU ET AL.: "Improving bag-of-features for large scale image search", 《INTERNATIONAL JOURNAL OF COMPUTER VISION》 *

Also Published As

Publication number Publication date
EP2776981A2 (en) 2014-09-17
US20130114900A1 (en) 2013-05-09
EP2776981A4 (en) 2016-09-28
WO2013068638A3 (en) 2013-09-19
WO2013068638A2 (en) 2013-05-16
IN2014CN04188A (enExample) 2015-07-17

Similar Documents

Publication Publication Date Title
CN103930903A (zh) 用于移动视觉搜索的方法和设备
Duan et al. Overview of the MPEG-CDVS standard
CN105917359B (zh) 移动视频搜索
US8571306B2 (en) Coding of feature location information
Chen et al. Residual enhanced visual vector as a compact signature for mobile visual search
JP5926291B2 (ja) 類似画像を識別する方法および装置
CN110263220A (zh) 一种视频精彩片段识别方法及装置
US20140310314A1 (en) Matching performance and compression efficiency with descriptor code segment collision probability optimization
US20190005043A1 (en) Automated Digital Asset Tagging using Multiple Vocabulary Sets
CN103649955A (zh) 用于视觉搜索的图像拓扑编码
CN104169946A (zh) 用于视觉搜索的可扩展查询
JP6042778B2 (ja) 画像に基づくバイナリ局所特徴ベクトルを用いた検索装置、システム、プログラム及び方法
JP2015201042A (ja) ハッシュ関数生成方法、ハッシュ値生成方法、装置、及びプログラム
US8755605B2 (en) System and method for compact descriptor for visual search
US20130121598A1 (en) System and Method for Randomized Point Set Geometry Verification for Image Identification
US20140270541A1 (en) Apparatus and method for processing image based on feature point
US9202108B2 (en) Methods and apparatuses for facilitating face image analysis
CN116595220A (zh) 一种图像提取模型构建、图像查询和视频生成方法、装置
CN119693634B (zh) 目标检测方法、设备及存储介质
CN117409207B (zh) 一种用于嵌入式设备的图像分割方法、系统及计算机
JP2015079333A (ja) ハッシュ関数生成方法、ハッシュ値生成方法、ハッシュ関数生成装置、ハッシュ値生成装置、ハッシュ関数生成プログラム及びハッシュ値生成プログラム
Uchida et al. Binary feature-based image retrieval with effective indexing and scoring
Fornaciari et al. Lightweight sign recognition for mobile devices
Reznik et al. Fast quantization and matching of histogram-based image features
Zhang et al. Transmitting informative components of fisher codes for mobile visual search

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20140716