CN103765440B - 使用背景信息的移动装置上的光学字符辨识 - Google Patents

使用背景信息的移动装置上的光学字符辨识 Download PDF

Info

Publication number
CN103765440B
CN103765440B CN201280041851.XA CN201280041851A CN103765440B CN 103765440 B CN103765440 B CN 103765440B CN 201280041851 A CN201280041851 A CN 201280041851A CN 103765440 B CN103765440 B CN 103765440B
Authority
CN
China
Prior art keywords
ocr
background
image
drawing object
group
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201280041851.XA
Other languages
English (en)
Chinese (zh)
Other versions
CN103765440A (zh
Inventor
黄奎雄
太元·李
金杜勋
延奇宣
真珉豪
金泰殊
朝玄默
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of CN103765440A publication Critical patent/CN103765440A/zh
Application granted granted Critical
Publication of CN103765440B publication Critical patent/CN103765440B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/12Detection or correction of errors, e.g. by rescanning the pattern
    • G06V30/127Detection or correction of errors, e.g. by rescanning the pattern with the intervention of an operator
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/768Arrangements for image or video recognition or understanding using pattern recognition or machine learning using context analysis, e.g. recognition aided by known co-occurring patterns
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/98Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns
    • G06V10/987Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns with the intervention of an operator
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • G06V30/268Lexical context
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • G06V30/274Syntactic or semantic context, e.g. balancing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/247Telephone sets including user guidance or feature selection means facilitating their use
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/10Recognition assisted with metadata
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/28Character recognition specially adapted to the type of the alphabet, e.g. Latin alphabet
    • G06V30/287Character recognition specially adapted to the type of the alphabet, e.g. Latin alphabet of Kanji, Hiragana or Katakana characters

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Quality & Reliability (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Signal Processing (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • User Interface Of Digital Computer (AREA)
  • Character Discrimination (AREA)
  • Character Input (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN201280041851.XA 2011-08-29 2012-08-06 使用背景信息的移动装置上的光学字符辨识 Expired - Fee Related CN103765440B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201161528741P 2011-08-29 2011-08-29
US61/528,741 2011-08-29
US13/450,016 2012-04-18
US13/450,016 US9082035B2 (en) 2011-08-29 2012-04-18 Camera OCR with context information
PCT/US2012/049786 WO2013032639A2 (en) 2011-08-29 2012-08-06 Camera ocr with context information

Publications (2)

Publication Number Publication Date
CN103765440A CN103765440A (zh) 2014-04-30
CN103765440B true CN103765440B (zh) 2018-04-03

Family

ID=46642660

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280041851.XA Expired - Fee Related CN103765440B (zh) 2011-08-29 2012-08-06 使用背景信息的移动装置上的光学字符辨识

Country Status (6)

Country Link
US (1) US9082035B2 (https=)
EP (1) EP2751741A2 (https=)
JP (2) JP6148235B2 (https=)
KR (1) KR101667463B1 (https=)
CN (1) CN103765440B (https=)
WO (1) WO2013032639A2 (https=)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9292498B2 (en) * 2012-03-21 2016-03-22 Paypal, Inc. Device orientation based translation system
US9519641B2 (en) * 2012-09-18 2016-12-13 Abbyy Development Llc Photography recognition translation
JP5708689B2 (ja) * 2013-03-13 2015-04-30 株式会社デンソー 物体検出装置
US9367811B2 (en) * 2013-03-15 2016-06-14 Qualcomm Incorporated Context aware localization, mapping, and tracking
US9727535B2 (en) 2013-06-11 2017-08-08 Microsoft Technology Licensing, Llc Authoring presentations with ink
US10769362B2 (en) 2013-08-02 2020-09-08 Symbol Technologies, Llc Method and apparatus for capturing and extracting content from documents on a mobile device
US10140257B2 (en) 2013-08-02 2018-11-27 Symbol Technologies, Llc Method and apparatus for capturing and processing content from context sensitive documents on a mobile device
KR101520389B1 (ko) * 2014-01-10 2015-05-14 윤창수 검색정보 획득방법, 이를 실행하기 위한 프로그램을 저장한 기록매체 및 휴대용 단말기
JP2015153342A (ja) * 2014-02-19 2015-08-24 三菱電機株式会社 設備点検装置および設備点検管理システム
US9355336B1 (en) * 2014-04-23 2016-05-31 Amazon Technologies, Inc. Recognizing text from frames of image data using contextual information
US9436682B2 (en) * 2014-06-24 2016-09-06 Google Inc. Techniques for machine language translation of text from an image based on non-textual context information from the image
JP6027580B2 (ja) * 2014-08-27 2016-11-16 京セラドキュメントソリューションズ株式会社 情報表示システムおよび情報表示プログラム
JP2016076167A (ja) * 2014-10-08 2016-05-12 ソニー株式会社 情報処理装置および情報処理方法
IL235565B (en) * 2014-11-06 2019-06-30 Kolton Achiav Position-based optical character recognition
US10530720B2 (en) * 2015-08-27 2020-01-07 Mcafee, Llc Contextual privacy engine for notifications
US10943398B2 (en) * 2016-07-15 2021-03-09 Samsung Electronics Co., Ltd. Augmented reality device and operation thereof
US10579741B2 (en) 2016-08-17 2020-03-03 International Business Machines Corporation Proactive input selection for improved machine translation
US10311330B2 (en) 2016-08-17 2019-06-04 International Business Machines Corporation Proactive input selection for improved image analysis and/or processing workflows
EP3522038A4 (en) * 2016-09-28 2020-06-03 Systran International Co. Ltd. METHOD FOR TRANSLATING CHARACTERS AND DEVICE THEREFOR
KR102721107B1 (ko) * 2017-01-02 2024-10-24 삼성전자주식회사 텍스트를 인식하는 방법 및 단말기
US11263399B2 (en) * 2017-07-31 2022-03-01 Apple Inc. Correcting input based on user context
CN110532571B (zh) * 2017-09-12 2022-11-18 腾讯科技(深圳)有限公司 文本处理方法及相关装置
KR102478396B1 (ko) * 2017-11-29 2022-12-19 삼성전자주식회사 이미지에서 텍스트를 인식할 수 있는 전자 장치
JP2023532669A (ja) * 2020-06-25 2023-07-31 プリオン インコーポレーテッド 文書処理および応答生成システム
KR20220056004A (ko) 2020-10-27 2022-05-04 삼성전자주식회사 전자 장치 및 이의 제어 방법
JP7600006B2 (ja) * 2021-03-22 2024-12-16 株式会社東芝 情報処理装置および情報入力システム
US20250369873A1 (en) * 2021-06-18 2025-12-04 Panasonic Intellectual Property Management Co., Ltd. Inspection system, inspection method, model generation system, determination system, model generation method, and program
CN115809672A (zh) * 2021-09-14 2023-03-17 北京小米移动软件有限公司 翻译方法、装置、ar眼镜、存储介质及计算机程序产品
US12380720B2 (en) 2022-12-30 2025-08-05 Konica Minolta Business Solutions U.S.A., Inc. Method, apparatus, and system for character recognition using context

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000348142A (ja) * 1999-06-08 2000-12-15 Nippon Telegr & Teleph Corp <Ntt> 文字認識装置,文字認識方法,および文字認識方法を実行するプログラムを記録した記録媒体
JP2003108551A (ja) * 2001-09-28 2003-04-11 Toshiba Corp 携帯型機械翻訳装置、翻訳方法及び翻訳プログラム
CN1615478A (zh) * 2001-12-10 2005-05-11 三菱电机株式会社 便携终端式图像处理系统、便携终端和服务器
CN1737822A (zh) * 2004-05-20 2006-02-22 微软公司 用于照相机获得的文件的低分辨率光学字符识别
CN1741034A (zh) * 2004-08-25 2006-03-01 富士施乐株式会社 字符识别装置和字符识别方法
JP2009086349A (ja) * 2007-09-28 2009-04-23 Fujifilm Corp 撮影装置及び撮影制御方法
JP2009258871A (ja) * 2008-04-15 2009-11-05 Casio Comput Co Ltd 翻訳装置及びプログラム
JP2011134144A (ja) * 2009-12-25 2011-07-07 Square Enix Co Ltd リアルタイムなカメラ辞書
US20110176720A1 (en) * 2010-01-15 2011-07-21 Robert Michael Van Osten Digital Image Transitions

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0520300A (ja) * 1991-07-15 1993-01-29 Sharp Corp 文書処理装置
CA2155891A1 (en) * 1994-10-18 1996-04-19 Raymond Amand Lorie Optical character recognition system having context analyzer
JP2002209262A (ja) 2001-01-09 2002-07-26 Casio Comput Co Ltd 携帯通信装置
JP4269811B2 (ja) * 2003-07-09 2009-05-27 株式会社日立製作所 携帯電話
EP1703445A4 (en) 2004-01-08 2011-07-27 Nec Corp CHARACTER DETECTION DEVICE, MOBILE COMMUNICATION SYSTEM, MOBILE TERMINAL, FIXING DEVICE, CHARACTER DETECTION METHOD AND CHARACTER RECOGNITION PROGRAM
US7565139B2 (en) 2004-02-20 2009-07-21 Google Inc. Image-based search engine for mobile phones with camera
US7840033B2 (en) * 2004-04-02 2010-11-23 K-Nfb Reading Technology, Inc. Text stitching from multiple images
US8036895B2 (en) 2004-04-02 2011-10-11 K-Nfb Reading Technology, Inc. Cooperative processing for portable reading machine
US7415171B2 (en) * 2005-03-28 2008-08-19 United States Postal Service Multigraph optical character reader enhancement systems and methods
US7826665B2 (en) * 2005-12-12 2010-11-02 Xerox Corporation Personal information retrieval using knowledge bases for optical character recognition correction
US7814040B1 (en) * 2006-01-31 2010-10-12 The Research Foundation Of State University Of New York System and method for image annotation and multi-modal image retrieval using probabilistic semantic models
US20070257934A1 (en) 2006-05-08 2007-11-08 David Doermann System and method for efficient enhancement to enable computer vision on mobile devices
US8041555B2 (en) 2007-08-15 2011-10-18 International Business Machines Corporation Language translation based on a location of a wireless device
US8000956B2 (en) 2008-02-08 2011-08-16 Xerox Corporation Semantic compatibility checking for automatic correction and discovery of named entities
JP2009199102A (ja) * 2008-02-19 2009-09-03 Fujitsu Ltd 文字認識プログラム、文字認識装置及び文字認識方法
US8406531B2 (en) 2008-05-15 2013-03-26 Yahoo! Inc. Data access based on content of image recorded by a mobile device

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000348142A (ja) * 1999-06-08 2000-12-15 Nippon Telegr & Teleph Corp <Ntt> 文字認識装置,文字認識方法,および文字認識方法を実行するプログラムを記録した記録媒体
JP2003108551A (ja) * 2001-09-28 2003-04-11 Toshiba Corp 携帯型機械翻訳装置、翻訳方法及び翻訳プログラム
CN1615478A (zh) * 2001-12-10 2005-05-11 三菱电机株式会社 便携终端式图像处理系统、便携终端和服务器
CN1737822A (zh) * 2004-05-20 2006-02-22 微软公司 用于照相机获得的文件的低分辨率光学字符识别
CN1741034A (zh) * 2004-08-25 2006-03-01 富士施乐株式会社 字符识别装置和字符识别方法
JP2009086349A (ja) * 2007-09-28 2009-04-23 Fujifilm Corp 撮影装置及び撮影制御方法
JP2009258871A (ja) * 2008-04-15 2009-11-05 Casio Comput Co Ltd 翻訳装置及びプログラム
JP2011134144A (ja) * 2009-12-25 2011-07-07 Square Enix Co Ltd リアルタイムなカメラ辞書
US20110176720A1 (en) * 2010-01-15 2011-07-21 Robert Michael Van Osten Digital Image Transitions

Also Published As

Publication number Publication date
JP2014529822A (ja) 2014-11-13
WO2013032639A3 (en) 2013-07-18
JP6148235B2 (ja) 2017-06-14
CN103765440A (zh) 2014-04-30
KR20140059834A (ko) 2014-05-16
KR101667463B1 (ko) 2016-10-18
US20130108115A1 (en) 2013-05-02
EP2751741A2 (en) 2014-07-09
WO2013032639A2 (en) 2013-03-07
JP6138305B2 (ja) 2017-05-31
JP2016146187A (ja) 2016-08-12
US9082035B2 (en) 2015-07-14

Similar Documents

Publication Publication Date Title
CN103765440B (zh) 使用背景信息的移动装置上的光学字符辨识
US10741167B2 (en) Document mode processing for portable reading machine enabling document navigation
US7840033B2 (en) Text stitching from multiple images
CN103946838B (zh) 交互式多模图像搜索
WO2020232861A1 (zh) 命名实体识别方法、电子装置及存储介质
US20160344860A1 (en) Document and image processing
US20120083294A1 (en) Integrated image detection and contextual commands
US20170109615A1 (en) Systems and Methods for Automatically Classifying Businesses from Images
CN111465918B (zh) 在预览界面中显示业务信息的方法及电子设备
US20060017752A1 (en) Image resizing for optical character recognition in portable reading machine
US7325735B2 (en) Directed reading mode for portable reading machine
CN109189879B (zh) 电子书籍显示方法及装置
US11550754B2 (en) Electronic apparatus and control method thereof
JP2012185722A (ja) 文字認識装置、文字認識方法、文字認識システム、および文字認識プログラム
CN107608618B (zh) 一种用于可穿戴设备的交互方法、装置和可穿戴设备
Pu et al. Framework based on mobile augmented reality for translating food menu in Thai language to Malay language
US11651280B2 (en) Recording medium, information processing system, and information processing method
US20110294522A1 (en) Character recognizing system and method for the same
Shilkrot et al. FingerReader: A finger-worn assistive augmentation
Bhargava et al. Speech enabled integrated AR-based multimodal language translation
Gaudissart et al. SYPOLE: a mobile assistant for the blind
US20190384795A1 (en) Information processing device, information processing terminal, and information processing method
CN111460134A (zh) 手写轨迹的笔记摘录方法、终端及计算机存储介质
US12504875B1 (en) Content selection and action determination based on a gesture input
Charishma et al. Review of Android Based Portable Sign and Text Recognition System

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180403

Termination date: 20190806

CF01 Termination of patent right due to non-payment of annual fee