CN104885098B - 基于移动装置的文本检测及跟踪 - Google Patents

基于移动装置的文本检测及跟踪 Download PDF

Info

Publication number
CN104885098B
CN104885098B CN201380069165.8A CN201380069165A CN104885098B CN 104885098 B CN104885098 B CN 104885098B CN 201380069165 A CN201380069165 A CN 201380069165A CN 104885098 B CN104885098 B CN 104885098B
Authority
CN
China
Prior art keywords
frame
text block
subsequent image
text
camera
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201380069165.8A
Other languages
English (en)
Chinese (zh)
Other versions
CN104885098A (zh
Inventor
迈克尔·盖尔沃茨
杰优恩·金
佩尔·O·尼尔森
罗伊·劳伦斯·阿索克·伊妮果
潘琪
罗曼·塔罗尼优
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of CN104885098A publication Critical patent/CN104885098A/zh
Application granted granted Critical
Publication of CN104885098B publication Critical patent/CN104885098B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/22Character recognition characterised by the type of writing
    • G06V30/224Character recognition characterised by the type of writing of printed characters having additional code marks or containing code marks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/002Specific input/output arrangements not covered by G06F3/01 - G06F3/16
    • G06F3/005Input arrangements through a video camera
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • G06T7/73Determining position or orientation of objects or cameras using feature-based methods
    • G06T7/74Determining position or orientation of objects or cameras using feature-based methods involving reference images or patches
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/142Image acquisition using hand-held instruments; Constructional details of the instruments
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00127Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture
    • H04N1/00204Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a digital computer or a digital computer system, e.g. an internet server
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30244Camera pose

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computing Systems (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Studio Devices (AREA)
  • Character Input (AREA)
  • Character Discrimination (AREA)
  • Image Analysis (AREA)
CN201380069165.8A 2013-01-04 2013-11-22 基于移动装置的文本检测及跟踪 Expired - Fee Related CN104885098B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201361749248P 2013-01-04 2013-01-04
US61/749,248 2013-01-04
US14/021,337 US20140192210A1 (en) 2013-01-04 2013-09-09 Mobile device based text detection and tracking
US14/021,337 2013-09-09
PCT/US2013/071518 WO2014107246A1 (en) 2013-01-04 2013-11-22 Mobile device based text detection and tracking

Publications (2)

Publication Number Publication Date
CN104885098A CN104885098A (zh) 2015-09-02
CN104885098B true CN104885098B (zh) 2020-02-21

Family

ID=51060682

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201380069165.8A Expired - Fee Related CN104885098B (zh) 2013-01-04 2013-11-22 基于移动装置的文本检测及跟踪

Country Status (6)

Country Link
US (1) US20140192210A1 (enExample)
EP (1) EP2941736B1 (enExample)
JP (1) JP6338595B2 (enExample)
KR (1) KR20150104126A (enExample)
CN (1) CN104885098B (enExample)
WO (1) WO2014107246A1 (enExample)

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109040553B (zh) * 2013-06-13 2021-04-13 核心光电有限公司 双孔径变焦数字摄影机
US10474921B2 (en) * 2013-06-14 2019-11-12 Qualcomm Incorporated Tracker assisted image capture
US9710440B2 (en) * 2013-08-21 2017-07-18 Microsoft Technology Licensing, Llc Presenting fixed format documents in reflowed format
US20150123966A1 (en) * 2013-10-03 2015-05-07 Compedia - Software And Hardware Development Limited Interactive augmented virtual reality and perceptual computing platform
US9449239B2 (en) 2014-05-30 2016-09-20 Apple Inc. Credit card auto-fill
US9565370B2 (en) * 2014-05-30 2017-02-07 Apple Inc. System and method for assisting in computer interpretation of surfaces carrying symbols or characters
US20160092747A1 (en) * 2014-09-29 2016-03-31 Qualcomm Incorporated Devices and methods for facilitating digital imagery encoding based on detection of text and computer generated graphics
JP2016111633A (ja) * 2014-12-09 2016-06-20 キヤノン株式会社 回路情報に従って論理回路を構成可能な回路を持つデバイスと、複数の制御手段とを有する情報処理システム
US9613273B2 (en) * 2015-05-19 2017-04-04 Toyota Motor Engineering & Manufacturing North America, Inc. Apparatus and method for object tracking
RU2619712C1 (ru) * 2016-05-13 2017-05-17 Общество с ограниченной ответственностью "Аби Девелопмент" Оптическое распознавание символов серии изображений
US10108856B2 (en) 2016-05-13 2018-10-23 Abbyy Development Llc Data entry from series of images of a patterned document
RU2613849C1 (ru) 2016-05-13 2017-03-21 Общество с ограниченной ответственностью "Аби Девелопмент" Оптическое распознавание символов серии изображений
US10701261B2 (en) * 2016-08-01 2020-06-30 International Business Machines Corporation Method, system and computer program product for selective image capture
GB2557237B (en) * 2016-12-01 2022-05-11 Crane Payment Innovations Ltd Method and apparatus for money item processing
CN108629843B (zh) * 2017-03-24 2021-07-13 成都理想境界科技有限公司 一种实现增强现实的方法及设备
JPWO2018235219A1 (ja) * 2017-06-22 2020-03-19 日本電気株式会社 自己位置推定方法、自己位置推定装置および自己位置推定プログラム
WO2019009916A1 (en) 2017-07-07 2019-01-10 Hewlett-Packard Development Company, L.P. ALIGNMENTS OF IMAGES THROUGH OPTICAL RECOGNITION OF CHARACTERS
KR102402148B1 (ko) 2017-08-22 2022-05-26 삼성전자주식회사 전자 장치 및 그의 문자 인식 방법
RU2657181C1 (ru) 2017-09-01 2018-06-08 Общество с ограниченной ответственностью "Аби Продакшн" Способ улучшения качества распознавания отдельного кадра
CN107679135A (zh) * 2017-09-22 2018-02-09 深圳市易图资讯股份有限公司 面向网络文本大数据的话题检测与跟踪方法、装置
RU2673015C1 (ru) 2017-12-22 2018-11-21 Общество с ограниченной ответственностью "Аби Продакшн" Способы и системы оптического распознавания символов серии изображений
US10699145B1 (en) * 2018-11-14 2020-06-30 Omniscience Corp. Systems and methods for augmented reality assisted form data capture
CN109917644B (zh) * 2018-12-26 2022-06-14 达闼科技(北京)有限公司 一种提高视觉惯导系统鲁棒性的方法、装置和机器人设备
KR102744083B1 (ko) * 2019-08-15 2024-12-19 엘지전자 주식회사 지능형 진단 디바이스
US11461164B2 (en) 2020-05-01 2022-10-04 UiPath, Inc. Screen response validation of robot execution for robotic process automation
US11080548B1 (en) 2020-05-01 2021-08-03 UiPath, Inc. Text detection, caret tracking, and active element detection
US11200441B2 (en) 2020-05-01 2021-12-14 UiPath, Inc. Text detection, caret tracking, and active element detection
CN111931571B (zh) * 2020-07-07 2022-05-17 华中科技大学 基于在线增强检测的视频文字目标追踪方法与电子设备
TR202101347A1 (tr) * 2021-01-28 2022-08-22 Univ Yildiz Teknik Bir sesli okuma cihazı.
CN115797815B (zh) * 2021-09-08 2023-12-15 荣耀终端有限公司 Ar翻译的处理方法及电子设备
CN114064959B (zh) * 2021-09-29 2024-11-26 北京搜狗科技发展有限公司 信息提取方法、装置及介质
US12008829B2 (en) * 2022-02-16 2024-06-11 Vastec, Inc. System and method for improved OCR efficacy through image segmentation

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6137491A (en) * 1998-06-05 2000-10-24 Microsoft Corporation Method and apparatus for reconstructing geometry using geometrically constrained structure from motion with points on planes
CN1443339A (zh) * 2000-07-19 2003-09-17 雅各布·威特曼 移动捕捉、处理、存储和传输文本包含字符和图像的混合信息的方法和装置

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003331217A (ja) * 2002-03-08 2003-11-21 Nec Corp 文字入力装置、文字入力方法及び文字入力プログラム
US7659915B2 (en) * 2004-04-02 2010-02-09 K-Nfb Reading Technology, Inc. Portable reading device with mode processing
US8107721B2 (en) * 2008-05-29 2012-01-31 Mitsubishi Electric Research Laboratories, Inc. Method and system for determining poses of semi-specular objects
FR2947657B1 (fr) * 2009-07-06 2016-05-27 Valeo Vision Procede de detection d'un obstacle pour vehicule automobile
US20110090253A1 (en) * 2009-10-19 2011-04-21 Quest Visual, Inc. Augmented reality language translation system and method
US20120092329A1 (en) * 2010-10-13 2012-04-19 Qualcomm Incorporated Text-based 3d augmented reality
JP6061502B2 (ja) * 2012-06-04 2017-01-18 キヤノン株式会社 画像処理装置、画像処理方法及びプログラム

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6137491A (en) * 1998-06-05 2000-10-24 Microsoft Corporation Method and apparatus for reconstructing geometry using geometrically constrained structure from motion with points on planes
CN1443339A (zh) * 2000-07-19 2003-09-17 雅各布·威特曼 移动捕捉、处理、存储和传输文本包含字符和图像的混合信息的方法和装置

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
TranslatAR: a mobile augmented reality translator;Victor Fragoso 等;《APPLICATIONS OF COMPUTER VISION(WACV)》;20110105;第497-502页 *

Also Published As

Publication number Publication date
CN104885098A (zh) 2015-09-02
JP2016502218A (ja) 2016-01-21
EP2941736B1 (en) 2019-08-14
JP6338595B2 (ja) 2018-06-06
US20140192210A1 (en) 2014-07-10
EP2941736A1 (en) 2015-11-11
WO2014107246A1 (en) 2014-07-10
KR20150104126A (ko) 2015-09-14

Similar Documents

Publication Publication Date Title
CN104885098B (zh) 基于移动装置的文本检测及跟踪
CN105283905B (zh) 使用点和线特征的稳健跟踪
US9747516B2 (en) Keypoint detection with trackability measurements
EP3627445B1 (en) Head pose estimation using rgbd camera
CN103155001B (zh) 用于多用户扩增现实的在线基准生成和跟踪的方法、装置和系统
US9576183B2 (en) Fast initialization for monocular visual SLAM
CN109683699B (zh) 基于深度学习实现增强现实的方法、装置及移动终端
US7554575B2 (en) Fast imaging system calibration
US20140369557A1 (en) Systems and Methods for Feature-Based Tracking
CN110869974A (zh) 点云处理方法、设备及存储介质
WO2019042426A1 (zh) 增强现实场景的处理方法、设备及计算机存储介质
JP2018028899A (ja) 画像レジストレーションの方法及びシステム
CN107646109B (zh) 管理电子设备上的环境映射的特征数据
CN103492899A (zh) 用于扩增实境的在线参考补丁产生和姿态估计
TW202030697A (zh) 電子裝置及恢復深度圖的方法
WO2022237048A1 (zh) 位姿获取方法、装置、电子设备、存储介质及程序
CN110310325B (zh) 一种虚拟测量方法、电子设备及计算机可读存储介质
JP2011071746A (ja) 映像出力装置及び映像出力方法
JP2018198030A (ja) 情報処理装置及びプログラム
US20220230342A1 (en) Information processing apparatus that estimates object depth, method therefor, and storage medium holding program therefor
US9245192B2 (en) Ad collateral detection
JPWO2015141324A1 (ja) 画像処理装置,方法,およびそのプログラム

Legal Events

Date Code Title Description
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200221

Termination date: 20201122

CF01 Termination of patent right due to non-payment of annual fee