CN103765440B - 使用背景信息的移动装置上的光学字符辨识 - Google Patents

使用背景信息的移动装置上的光学字符辨识 Download PDF

Info

Publication number
CN103765440B
CN103765440B CN201280041851.XA CN201280041851A CN103765440B CN 103765440 B CN103765440 B CN 103765440B CN 201280041851 A CN201280041851 A CN 201280041851A CN 103765440 B CN103765440 B CN 103765440B
Authority
CN
China
Prior art keywords
ocr
background
image
drawing object
group
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201280041851.XA
Other languages
English (en)
Chinese (zh)
Other versions
CN103765440A (zh
Inventor
黄奎雄
太元·李
金杜勋
延奇宣
真珉豪
金泰殊
朝玄默
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of CN103765440A publication Critical patent/CN103765440A/zh
Application granted granted Critical
Publication of CN103765440B publication Critical patent/CN103765440B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/12Detection or correction of errors, e.g. by rescanning the pattern
    • G06V30/127Detection or correction of errors, e.g. by rescanning the pattern with the intervention of an operator
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/768Arrangements for image or video recognition or understanding using pattern recognition or machine learning using context analysis, e.g. recognition aided by known co-occurring patterns
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/98Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns
    • G06V10/987Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns with the intervention of an operator
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • G06V30/268Lexical context
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • G06V30/274Syntactic or semantic context, e.g. balancing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/247Telephone sets including user guidance or feature selection means facilitating their use
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/10Recognition assisted with metadata
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/28Character recognition specially adapted to the type of the alphabet, e.g. Latin alphabet
    • G06V30/287Character recognition specially adapted to the type of the alphabet, e.g. Latin alphabet of Kanji, Hiragana or Katakana characters

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • User Interface Of Digital Computer (AREA)
  • Character Discrimination (AREA)
  • Character Input (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN201280041851.XA 2011-08-29 2012-08-06 使用背景信息的移动装置上的光学字符辨识 Expired - Fee Related CN103765440B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201161528741P 2011-08-29 2011-08-29
US61/528,741 2011-08-29
US13/450,016 US9082035B2 (en) 2011-08-29 2012-04-18 Camera OCR with context information
US13/450,016 2012-04-18
PCT/US2012/049786 WO2013032639A2 (en) 2011-08-29 2012-08-06 Camera ocr with context information

Publications (2)

Publication Number Publication Date
CN103765440A CN103765440A (zh) 2014-04-30
CN103765440B true CN103765440B (zh) 2018-04-03

Family

ID=46642660

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280041851.XA Expired - Fee Related CN103765440B (zh) 2011-08-29 2012-08-06 使用背景信息的移动装置上的光学字符辨识

Country Status (6)

Country Link
US (1) US9082035B2 (enExample)
EP (1) EP2751741A2 (enExample)
JP (2) JP6148235B2 (enExample)
KR (1) KR101667463B1 (enExample)
CN (1) CN103765440B (enExample)
WO (1) WO2013032639A2 (enExample)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9292498B2 (en) * 2012-03-21 2016-03-22 Paypal, Inc. Device orientation based translation system
US9519641B2 (en) * 2012-09-18 2016-12-13 Abbyy Development Llc Photography recognition translation
JP5708689B2 (ja) * 2013-03-13 2015-04-30 株式会社デンソー 物体検出装置
US9367811B2 (en) * 2013-03-15 2016-06-14 Qualcomm Incorporated Context aware localization, mapping, and tracking
US9727535B2 (en) 2013-06-11 2017-08-08 Microsoft Technology Licensing, Llc Authoring presentations with ink
US10140257B2 (en) 2013-08-02 2018-11-27 Symbol Technologies, Llc Method and apparatus for capturing and processing content from context sensitive documents on a mobile device
US10769362B2 (en) 2013-08-02 2020-09-08 Symbol Technologies, Llc Method and apparatus for capturing and extracting content from documents on a mobile device
KR101520389B1 (ko) * 2014-01-10 2015-05-14 윤창수 검색정보 획득방법, 이를 실행하기 위한 프로그램을 저장한 기록매체 및 휴대용 단말기
JP2015153342A (ja) * 2014-02-19 2015-08-24 三菱電機株式会社 設備点検装置および設備点検管理システム
US9355336B1 (en) * 2014-04-23 2016-05-31 Amazon Technologies, Inc. Recognizing text from frames of image data using contextual information
US9436682B2 (en) * 2014-06-24 2016-09-06 Google Inc. Techniques for machine language translation of text from an image based on non-textual context information from the image
JP6027580B2 (ja) * 2014-08-27 2016-11-16 京セラドキュメントソリューションズ株式会社 情報表示システムおよび情報表示プログラム
JP2016076167A (ja) * 2014-10-08 2016-05-12 ソニー株式会社 情報処理装置および情報処理方法
IL235565B (en) * 2014-11-06 2019-06-30 Kolton Achiav Position-based optical character recognition
US10530720B2 (en) * 2015-08-27 2020-01-07 Mcafee, Llc Contextual privacy engine for notifications
US10943398B2 (en) * 2016-07-15 2021-03-09 Samsung Electronics Co., Ltd. Augmented reality device and operation thereof
US10311330B2 (en) 2016-08-17 2019-06-04 International Business Machines Corporation Proactive input selection for improved image analysis and/or processing workflows
US10579741B2 (en) 2016-08-17 2020-03-03 International Business Machines Corporation Proactive input selection for improved machine translation
US20200026766A1 (en) * 2016-09-28 2020-01-23 Systran International Co., Ltd. Method for translating characters and apparatus therefor
KR102721107B1 (ko) * 2017-01-02 2024-10-24 삼성전자주식회사 텍스트를 인식하는 방법 및 단말기
US11263399B2 (en) * 2017-07-31 2022-03-01 Apple Inc. Correcting input based on user context
CN108288067B (zh) * 2017-09-12 2020-07-24 腾讯科技(深圳)有限公司 图像文本匹配模型的训练方法、双向搜索方法及相关装置
KR102478396B1 (ko) * 2017-11-29 2022-12-19 삼성전자주식회사 이미지에서 텍스트를 인식할 수 있는 전자 장치
EP4172805A4 (en) * 2020-06-25 2024-06-19 Pryon Incorporated DOCUMENT PROCESSING AND RESPONSE GENERATION SYSTEM
KR20220056004A (ko) 2020-10-27 2022-05-04 삼성전자주식회사 전자 장치 및 이의 제어 방법
CN115809672A (zh) * 2021-09-14 2023-03-17 北京小米移动软件有限公司 翻译方法、装置、ar眼镜、存储介质及计算机程序产品
US12380720B2 (en) 2022-12-30 2025-08-05 Konica Minolta Business Solutions U.S.A., Inc. Method, apparatus, and system for character recognition using context

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000348142A (ja) * 1999-06-08 2000-12-15 Nippon Telegr & Teleph Corp <Ntt> 文字認識装置,文字認識方法,および文字認識方法を実行するプログラムを記録した記録媒体
JP2003108551A (ja) * 2001-09-28 2003-04-11 Toshiba Corp 携帯型機械翻訳装置、翻訳方法及び翻訳プログラム
CN1615478A (zh) * 2001-12-10 2005-05-11 三菱电机株式会社 便携终端式图像处理系统、便携终端和服务器
CN1737822A (zh) * 2004-05-20 2006-02-22 微软公司 用于照相机获得的文件的低分辨率光学字符识别
CN1741034A (zh) * 2004-08-25 2006-03-01 富士施乐株式会社 字符识别装置和字符识别方法
JP2009086349A (ja) * 2007-09-28 2009-04-23 Fujifilm Corp 撮影装置及び撮影制御方法
JP2009258871A (ja) * 2008-04-15 2009-11-05 Casio Comput Co Ltd 翻訳装置及びプログラム
JP2011134144A (ja) * 2009-12-25 2011-07-07 Square Enix Co Ltd リアルタイムなカメラ辞書
US20110176720A1 (en) * 2010-01-15 2011-07-21 Robert Michael Van Osten Digital Image Transitions

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0520300A (ja) * 1991-07-15 1993-01-29 Sharp Corp 文書処理装置
CA2155891A1 (en) * 1994-10-18 1996-04-19 Raymond Amand Lorie Optical character recognition system having context analyzer
JP2002209262A (ja) 2001-01-09 2002-07-26 Casio Comput Co Ltd 携帯通信装置
JP4269811B2 (ja) * 2003-07-09 2009-05-27 株式会社日立製作所 携帯電話
WO2005066882A1 (ja) 2004-01-08 2005-07-21 Nec Corporation 文字認識装置、移動通信システム、移動端末装置、固定局装置、文字認識方法および文字認識プログラム
US7565139B2 (en) 2004-02-20 2009-07-21 Google Inc. Image-based search engine for mobile phones with camera
US7840033B2 (en) * 2004-04-02 2010-11-23 K-Nfb Reading Technology, Inc. Text stitching from multiple images
US8036895B2 (en) 2004-04-02 2011-10-11 K-Nfb Reading Technology, Inc. Cooperative processing for portable reading machine
WO2006105108A2 (en) * 2005-03-28 2006-10-05 United States Postal Service Multigraph optical character reader enhancement systems and methods
US7826665B2 (en) * 2005-12-12 2010-11-02 Xerox Corporation Personal information retrieval using knowledge bases for optical character recognition correction
US7814040B1 (en) * 2006-01-31 2010-10-12 The Research Foundation Of State University Of New York System and method for image annotation and multi-modal image retrieval using probabilistic semantic models
US20070257934A1 (en) 2006-05-08 2007-11-08 David Doermann System and method for efficient enhancement to enable computer vision on mobile devices
US8041555B2 (en) 2007-08-15 2011-10-18 International Business Machines Corporation Language translation based on a location of a wireless device
US8000956B2 (en) 2008-02-08 2011-08-16 Xerox Corporation Semantic compatibility checking for automatic correction and discovery of named entities
JP2009199102A (ja) * 2008-02-19 2009-09-03 Fujitsu Ltd 文字認識プログラム、文字認識装置及び文字認識方法
US8406531B2 (en) 2008-05-15 2013-03-26 Yahoo! Inc. Data access based on content of image recorded by a mobile device

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000348142A (ja) * 1999-06-08 2000-12-15 Nippon Telegr & Teleph Corp <Ntt> 文字認識装置,文字認識方法,および文字認識方法を実行するプログラムを記録した記録媒体
JP2003108551A (ja) * 2001-09-28 2003-04-11 Toshiba Corp 携帯型機械翻訳装置、翻訳方法及び翻訳プログラム
CN1615478A (zh) * 2001-12-10 2005-05-11 三菱电机株式会社 便携终端式图像处理系统、便携终端和服务器
CN1737822A (zh) * 2004-05-20 2006-02-22 微软公司 用于照相机获得的文件的低分辨率光学字符识别
CN1741034A (zh) * 2004-08-25 2006-03-01 富士施乐株式会社 字符识别装置和字符识别方法
JP2009086349A (ja) * 2007-09-28 2009-04-23 Fujifilm Corp 撮影装置及び撮影制御方法
JP2009258871A (ja) * 2008-04-15 2009-11-05 Casio Comput Co Ltd 翻訳装置及びプログラム
JP2011134144A (ja) * 2009-12-25 2011-07-07 Square Enix Co Ltd リアルタイムなカメラ辞書
US20110176720A1 (en) * 2010-01-15 2011-07-21 Robert Michael Van Osten Digital Image Transitions

Also Published As

Publication number Publication date
US20130108115A1 (en) 2013-05-02
WO2013032639A2 (en) 2013-03-07
KR101667463B1 (ko) 2016-10-18
US9082035B2 (en) 2015-07-14
WO2013032639A3 (en) 2013-07-18
JP2014529822A (ja) 2014-11-13
JP2016146187A (ja) 2016-08-12
KR20140059834A (ko) 2014-05-16
JP6148235B2 (ja) 2017-06-14
CN103765440A (zh) 2014-04-30
JP6138305B2 (ja) 2017-05-31
EP2751741A2 (en) 2014-07-09

Similar Documents

Publication Publication Date Title
CN103765440B (zh) 使用背景信息的移动装置上的光学字符辨识
US7840033B2 (en) Text stitching from multiple images
WO2020232861A1 (zh) 命名实体识别方法、电子装置及存储介质
US20160344860A1 (en) Document and image processing
US20170109615A1 (en) Systems and Methods for Automatically Classifying Businesses from Images
US7325735B2 (en) Directed reading mode for portable reading machine
CN109189879B (zh) 电子书籍显示方法及装置
US11550754B2 (en) Electronic apparatus and control method thereof
CN111465918B (zh) 在预览界面中显示业务信息的方法及电子设备
US20060015342A1 (en) Document mode processing for portable reading machine enabling document navigation
JP2014102669A (ja) 情報処理装置、情報処理方法およびプログラム
JP2012185722A (ja) 文字認識装置、文字認識方法、文字認識システム、および文字認識プログラム
Ramiah et al. Detecting text based image with optical character recognition for English translation and speech using Android
EP2806336A1 (en) Text prediction in a text input associated with an image
Pu et al. Framework based on mobile augmented reality for translating food menu in Thai language to Malay language
Noorian et al. St-sem: A multimodal method for points-of-interest classification using street-level imagery
US11651280B2 (en) Recording medium, information processing system, and information processing method
JP2011165092A (ja) 文書画像関連情報提供装置、及び文書画像関連情報取得システム
Shilkrot et al. FingerReader: A finger-worn assistive augmentation
Rai et al. MyOcrTool: visualization system for generating associative images of Chinese characters in smart devices
Selvaraj et al. Enhanced portable text to speech converter for visually impaired
Gaudissart et al. SYPOLE: a mobile assistant for the blind
NS et al. Smart reader for visually impaired
CN112364700A (zh) 一种内容标记方法及终端设备
CN111460134A (zh) 手写轨迹的笔记摘录方法、终端及计算机存储介质

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180403

Termination date: 20190806

CF01 Termination of patent right due to non-payment of annual fee