TW201612779A - Image based search to identify objects in documents - Google Patents

Image based search to identify objects in documents

Info

Publication number
TW201612779A
TW201612779A TW104119442A TW104119442A TW201612779A TW 201612779 A TW201612779 A TW 201612779A TW 104119442 A TW104119442 A TW 104119442A TW 104119442 A TW104119442 A TW 104119442A TW 201612779 A TW201612779 A TW 201612779A
Authority
TW
Taiwan
Prior art keywords
documents
image based
based search
identify objects
image
Prior art date
Application number
TW104119442A
Other languages
English (en)
Inventor
Matthew Vogel
Original Assignee
Microsoft Technology Licensing Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing Llc filed Critical Microsoft Technology Licensing Llc
Publication of TW201612779A publication Critical patent/TW201612779A/zh

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/51Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/412Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5846Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5854Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using shape and object relationship
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7837Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using objects detected or recognised in the video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7844Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using original textual content or text extracted from visual content or transcript of audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/416Extracting the logical structure, e.g. chapters, sections or page numbers; Identifying elements of the document, e.g. authors

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Library & Information Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Processing Or Creating Images (AREA)
TW104119442A 2014-07-28 2015-06-16 Image based search to identify objects in documents TW201612779A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US14/445,040 US20160026858A1 (en) 2014-07-28 2014-07-28 Image based search to identify objects in documents

Publications (1)

Publication Number Publication Date
TW201612779A true TW201612779A (en) 2016-04-01

Family

ID=53765589

Family Applications (1)

Application Number Title Priority Date Filing Date
TW104119442A TW201612779A (en) 2014-07-28 2015-06-16 Image based search to identify objects in documents

Country Status (5)

Country Link
US (1) US20160026858A1 (zh)
EP (1) EP3175375A1 (zh)
CN (1) CN106575300A (zh)
TW (1) TW201612779A (zh)
WO (1) WO2016018683A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI709117B (zh) * 2019-06-05 2020-11-01 弘光科技大學 雲端智能物品影像辨識系統

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2807604A1 (en) 2012-01-23 2014-12-03 Microsoft Corporation Vector graphics classification engine
EP2807608B1 (en) 2012-01-23 2024-04-10 Microsoft Technology Licensing, LLC Borderless table detection engine
US10354419B2 (en) * 2015-05-25 2019-07-16 Colin Frederick Ritchie Methods and systems for dynamic graph generating
US20170220858A1 (en) * 2016-02-01 2017-08-03 Microsoft Technology Licensing, Llc Optical recognition of tables
CN107291949B (zh) * 2017-07-17 2020-11-13 绿湾网络科技有限公司 信息搜索方法及装置
CN107679024B (zh) * 2017-09-11 2023-04-18 畅捷通信息技术股份有限公司 识别表格的方法、系统、计算机设备、可读存储介质
CN107742096A (zh) * 2017-09-26 2018-02-27 阿里巴巴集团控股有限公司 获取图表特征信息的方法及装置、电子设备、存储介质
CN110889310B (zh) * 2018-09-07 2023-05-09 深圳市赢时胜信息技术股份有限公司 金融文档信息智能提取系统及方法
CN112307265A (zh) * 2019-07-26 2021-02-02 珠海金山办公软件有限公司 一种在文档中查找图表的方法、系统、存储介质和终端
TW202207007A (zh) * 2020-08-14 2022-02-16 新穎數位文創股份有限公司 物件辨識裝置與物件辨識方法
CN115617957B (zh) * 2022-12-19 2023-04-07 铭台(北京)科技有限公司 基于大数据的文档智能检索方法

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20010103394A (ko) * 2000-05-10 2001-11-23 박정관 신분증 인식 기술을 이용한 고객 정보 관리 시스템 및 방법
US6996268B2 (en) * 2001-12-28 2006-02-07 International Business Machines Corporation System and method for gathering, indexing, and supplying publicly available data charts
US7502033B1 (en) * 2002-09-30 2009-03-10 Dale Axelrod Artists' color display system
US8341152B1 (en) * 2006-09-12 2012-12-25 Creatier Interactive Llc System and method for enabling objects within video to be searched on the internet or intranet
US8631012B2 (en) * 2006-09-29 2014-01-14 A9.Com, Inc. Method and system for identifying and displaying images in response to search queries
CN101908136B (zh) * 2009-06-08 2013-02-13 比亚迪股份有限公司 一种表格识别处理方法及系统
JP5361574B2 (ja) * 2009-07-01 2013-12-04 キヤノン株式会社 画像処理装置、画像処理方法、及びプログラム
CN101639760A (zh) * 2009-08-27 2010-02-03 上海合合信息科技发展有限公司 联系信息输入方法及系统
US9198623B2 (en) * 2010-04-22 2015-12-01 Abbott Diabetes Care Inc. Devices, systems, and methods related to analyte monitoring and management
CN101923643B (zh) * 2010-08-11 2012-11-21 中科院成都信息技术有限公司 通用表格识别方法
US8723870B1 (en) * 2012-01-30 2014-05-13 Google Inc. Selection of object types with data transferability
US9275291B2 (en) * 2013-06-17 2016-03-01 Texifter, LLC System and method of classifier ranking for incorporation into enhanced machine learning
US9740995B2 (en) * 2013-10-28 2017-08-22 Morningstar, Inc. Coordinate-based document processing and data entry system and method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI709117B (zh) * 2019-06-05 2020-11-01 弘光科技大學 雲端智能物品影像辨識系統

Also Published As

Publication number Publication date
EP3175375A1 (en) 2017-06-07
US20160026858A1 (en) 2016-01-28
WO2016018683A1 (en) 2016-02-04
CN106575300A (zh) 2017-04-19

Similar Documents

Publication Publication Date Title
TW201612779A (en) Image based search to identify objects in documents
MX2017013951A (es) Recopilación y reproducción de metadatos mejorados.
GB201717959D0 (en) Collection strategies that facilitate arranging portions of documents into content collections
PH12015000372B1 (en) Conversion of documents of different types to a uniform and an editable or a searchable format
AU201610993S (en) Sleeve for an intraoral scanner
GB201618163D0 (en) Improved method, system and software for searching, identifying, retrieving and presenting electronic documents
MX2016006229A (es) Busqueda con base en imagen.
EP3201833A4 (en) Schemes for retrieving and associating content items with real-world objects using augmented reality and object recognition
EP3180699A4 (en) Metadata index search in file system
AU359496S (en) Phone case
MX2016003315A (es) Segmentacion de contenido de video basado en contenido.
GB2528206A (en) Guided article authorship
AU366418S (en) Case for a tablet computer
ZA201807033B (en) Content based search and retrieval of trademark images
EP3461413A3 (en) Information processing apparatus, information processing method, and computer-readable storage medium
EP3216201A4 (en) System and method for sorting scanned documents to selected output trays
GB202011326D0 (en) Searching multilingual documents based on document structure extraction
GB202009248D0 (en) Semantic normalization in document digitization
TW201614507A (en) Methods and devices for finding settings to be used in relation to a sensor unit connected to a processing unit
IN2014DE00500A (zh)
AU201612518S (en) Sample transport pod for biological materials
MX2017000824A (es) Reconocimiento de entrada para el mejoramiento de la productividad de documentos.
MX2016009614A (es) Metadatos agregados proporcionados para contenido de programacion.
EP3195155A4 (en) A system and method of designating documents to associate with a search record
AU2014100038A4 (en) Under Bench Coffee Machine with exposed bench top group heads.