CN103765440B - 使用背景信息的移动装置上的光学字符辨识 - Google Patents
使用背景信息的移动装置上的光学字符辨识 Download PDFInfo
- Publication number
- CN103765440B CN103765440B CN201280041851.XA CN201280041851A CN103765440B CN 103765440 B CN103765440 B CN 103765440B CN 201280041851 A CN201280041851 A CN 201280041851A CN 103765440 B CN103765440 B CN 103765440B
- Authority
- CN
- China
- Prior art keywords
- ocr
- background
- image
- drawing object
- group
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/12—Detection or correction of errors, e.g. by rescanning the pattern
- G06V30/127—Detection or correction of errors, e.g. by rescanning the pattern with the intervention of an operator
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/22—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/768—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using context analysis, e.g. recognition aided by known co-occurring patterns
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/98—Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns
- G06V10/987—Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns with the intervention of an operator
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/26—Techniques for post-processing, e.g. correcting the recognition result
- G06V30/262—Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
- G06V30/268—Lexical context
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/26—Techniques for post-processing, e.g. correcting the recognition result
- G06V30/262—Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
- G06V30/274—Syntactic or semantic context, e.g. balancing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/247—Telephone sets including user guidance or feature selection means facilitating their use
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V2201/00—Indexing scheme relating to image or video recognition or understanding
- G06V2201/10—Recognition assisted with metadata
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/28—Character recognition specially adapted to the type of the alphabet, e.g. Latin alphabet
- G06V30/287—Character recognition specially adapted to the type of the alphabet, e.g. Latin alphabet of Kanji, Hiragana or Katakana characters
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- Artificial Intelligence (AREA)
- Computing Systems (AREA)
- Databases & Information Systems (AREA)
- Human Computer Interaction (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Software Systems (AREA)
- User Interface Of Digital Computer (AREA)
- Character Discrimination (AREA)
- Character Input (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201161528741P | 2011-08-29 | 2011-08-29 | |
| US61/528,741 | 2011-08-29 | ||
| US13/450,016 US9082035B2 (en) | 2011-08-29 | 2012-04-18 | Camera OCR with context information |
| US13/450,016 | 2012-04-18 | ||
| PCT/US2012/049786 WO2013032639A2 (en) | 2011-08-29 | 2012-08-06 | Camera ocr with context information |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN103765440A CN103765440A (zh) | 2014-04-30 |
| CN103765440B true CN103765440B (zh) | 2018-04-03 |
Family
ID=46642660
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201280041851.XA Expired - Fee Related CN103765440B (zh) | 2011-08-29 | 2012-08-06 | 使用背景信息的移动装置上的光学字符辨识 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US9082035B2 (enExample) |
| EP (1) | EP2751741A2 (enExample) |
| JP (2) | JP6148235B2 (enExample) |
| KR (1) | KR101667463B1 (enExample) |
| CN (1) | CN103765440B (enExample) |
| WO (1) | WO2013032639A2 (enExample) |
Families Citing this family (27)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9292498B2 (en) * | 2012-03-21 | 2016-03-22 | Paypal, Inc. | Device orientation based translation system |
| US9519641B2 (en) * | 2012-09-18 | 2016-12-13 | Abbyy Development Llc | Photography recognition translation |
| JP5708689B2 (ja) * | 2013-03-13 | 2015-04-30 | 株式会社デンソー | 物体検出装置 |
| US9367811B2 (en) * | 2013-03-15 | 2016-06-14 | Qualcomm Incorporated | Context aware localization, mapping, and tracking |
| US9727535B2 (en) | 2013-06-11 | 2017-08-08 | Microsoft Technology Licensing, Llc | Authoring presentations with ink |
| US10140257B2 (en) | 2013-08-02 | 2018-11-27 | Symbol Technologies, Llc | Method and apparatus for capturing and processing content from context sensitive documents on a mobile device |
| US10769362B2 (en) | 2013-08-02 | 2020-09-08 | Symbol Technologies, Llc | Method and apparatus for capturing and extracting content from documents on a mobile device |
| KR101520389B1 (ko) * | 2014-01-10 | 2015-05-14 | 윤창수 | 검색정보 획득방법, 이를 실행하기 위한 프로그램을 저장한 기록매체 및 휴대용 단말기 |
| JP2015153342A (ja) * | 2014-02-19 | 2015-08-24 | 三菱電機株式会社 | 設備点検装置および設備点検管理システム |
| US9355336B1 (en) * | 2014-04-23 | 2016-05-31 | Amazon Technologies, Inc. | Recognizing text from frames of image data using contextual information |
| US9436682B2 (en) * | 2014-06-24 | 2016-09-06 | Google Inc. | Techniques for machine language translation of text from an image based on non-textual context information from the image |
| JP6027580B2 (ja) * | 2014-08-27 | 2016-11-16 | 京セラドキュメントソリューションズ株式会社 | 情報表示システムおよび情報表示プログラム |
| JP2016076167A (ja) * | 2014-10-08 | 2016-05-12 | ソニー株式会社 | 情報処理装置および情報処理方法 |
| IL235565B (en) * | 2014-11-06 | 2019-06-30 | Kolton Achiav | Position-based optical character recognition |
| US10530720B2 (en) * | 2015-08-27 | 2020-01-07 | Mcafee, Llc | Contextual privacy engine for notifications |
| US10943398B2 (en) * | 2016-07-15 | 2021-03-09 | Samsung Electronics Co., Ltd. | Augmented reality device and operation thereof |
| US10311330B2 (en) | 2016-08-17 | 2019-06-04 | International Business Machines Corporation | Proactive input selection for improved image analysis and/or processing workflows |
| US10579741B2 (en) | 2016-08-17 | 2020-03-03 | International Business Machines Corporation | Proactive input selection for improved machine translation |
| US20200026766A1 (en) * | 2016-09-28 | 2020-01-23 | Systran International Co., Ltd. | Method for translating characters and apparatus therefor |
| KR102721107B1 (ko) * | 2017-01-02 | 2024-10-24 | 삼성전자주식회사 | 텍스트를 인식하는 방법 및 단말기 |
| US11263399B2 (en) * | 2017-07-31 | 2022-03-01 | Apple Inc. | Correcting input based on user context |
| CN108288067B (zh) * | 2017-09-12 | 2020-07-24 | 腾讯科技(深圳)有限公司 | 图像文本匹配模型的训练方法、双向搜索方法及相关装置 |
| KR102478396B1 (ko) * | 2017-11-29 | 2022-12-19 | 삼성전자주식회사 | 이미지에서 텍스트를 인식할 수 있는 전자 장치 |
| EP4172805A4 (en) * | 2020-06-25 | 2024-06-19 | Pryon Incorporated | DOCUMENT PROCESSING AND RESPONSE GENERATION SYSTEM |
| KR20220056004A (ko) | 2020-10-27 | 2022-05-04 | 삼성전자주식회사 | 전자 장치 및 이의 제어 방법 |
| CN115809672A (zh) * | 2021-09-14 | 2023-03-17 | 北京小米移动软件有限公司 | 翻译方法、装置、ar眼镜、存储介质及计算机程序产品 |
| US12380720B2 (en) | 2022-12-30 | 2025-08-05 | Konica Minolta Business Solutions U.S.A., Inc. | Method, apparatus, and system for character recognition using context |
Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2000348142A (ja) * | 1999-06-08 | 2000-12-15 | Nippon Telegr & Teleph Corp <Ntt> | 文字認識装置,文字認識方法,および文字認識方法を実行するプログラムを記録した記録媒体 |
| JP2003108551A (ja) * | 2001-09-28 | 2003-04-11 | Toshiba Corp | 携帯型機械翻訳装置、翻訳方法及び翻訳プログラム |
| CN1615478A (zh) * | 2001-12-10 | 2005-05-11 | 三菱电机株式会社 | 便携终端式图像处理系统、便携终端和服务器 |
| CN1737822A (zh) * | 2004-05-20 | 2006-02-22 | 微软公司 | 用于照相机获得的文件的低分辨率光学字符识别 |
| CN1741034A (zh) * | 2004-08-25 | 2006-03-01 | 富士施乐株式会社 | 字符识别装置和字符识别方法 |
| JP2009086349A (ja) * | 2007-09-28 | 2009-04-23 | Fujifilm Corp | 撮影装置及び撮影制御方法 |
| JP2009258871A (ja) * | 2008-04-15 | 2009-11-05 | Casio Comput Co Ltd | 翻訳装置及びプログラム |
| JP2011134144A (ja) * | 2009-12-25 | 2011-07-07 | Square Enix Co Ltd | リアルタイムなカメラ辞書 |
| US20110176720A1 (en) * | 2010-01-15 | 2011-07-21 | Robert Michael Van Osten | Digital Image Transitions |
Family Cites Families (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH0520300A (ja) * | 1991-07-15 | 1993-01-29 | Sharp Corp | 文書処理装置 |
| CA2155891A1 (en) * | 1994-10-18 | 1996-04-19 | Raymond Amand Lorie | Optical character recognition system having context analyzer |
| JP2002209262A (ja) | 2001-01-09 | 2002-07-26 | Casio Comput Co Ltd | 携帯通信装置 |
| JP4269811B2 (ja) * | 2003-07-09 | 2009-05-27 | 株式会社日立製作所 | 携帯電話 |
| WO2005066882A1 (ja) | 2004-01-08 | 2005-07-21 | Nec Corporation | 文字認識装置、移動通信システム、移動端末装置、固定局装置、文字認識方法および文字認識プログラム |
| US7565139B2 (en) | 2004-02-20 | 2009-07-21 | Google Inc. | Image-based search engine for mobile phones with camera |
| US7840033B2 (en) * | 2004-04-02 | 2010-11-23 | K-Nfb Reading Technology, Inc. | Text stitching from multiple images |
| US8036895B2 (en) | 2004-04-02 | 2011-10-11 | K-Nfb Reading Technology, Inc. | Cooperative processing for portable reading machine |
| WO2006105108A2 (en) * | 2005-03-28 | 2006-10-05 | United States Postal Service | Multigraph optical character reader enhancement systems and methods |
| US7826665B2 (en) * | 2005-12-12 | 2010-11-02 | Xerox Corporation | Personal information retrieval using knowledge bases for optical character recognition correction |
| US7814040B1 (en) * | 2006-01-31 | 2010-10-12 | The Research Foundation Of State University Of New York | System and method for image annotation and multi-modal image retrieval using probabilistic semantic models |
| US20070257934A1 (en) | 2006-05-08 | 2007-11-08 | David Doermann | System and method for efficient enhancement to enable computer vision on mobile devices |
| US8041555B2 (en) | 2007-08-15 | 2011-10-18 | International Business Machines Corporation | Language translation based on a location of a wireless device |
| US8000956B2 (en) | 2008-02-08 | 2011-08-16 | Xerox Corporation | Semantic compatibility checking for automatic correction and discovery of named entities |
| JP2009199102A (ja) * | 2008-02-19 | 2009-09-03 | Fujitsu Ltd | 文字認識プログラム、文字認識装置及び文字認識方法 |
| US8406531B2 (en) | 2008-05-15 | 2013-03-26 | Yahoo! Inc. | Data access based on content of image recorded by a mobile device |
-
2012
- 2012-04-18 US US13/450,016 patent/US9082035B2/en not_active Expired - Fee Related
- 2012-08-06 JP JP2014528410A patent/JP6148235B2/ja not_active Expired - Fee Related
- 2012-08-06 CN CN201280041851.XA patent/CN103765440B/zh not_active Expired - Fee Related
- 2012-08-06 EP EP12745986.5A patent/EP2751741A2/en not_active Withdrawn
- 2012-08-06 WO PCT/US2012/049786 patent/WO2013032639A2/en not_active Ceased
- 2012-08-06 KR KR1020147008404A patent/KR101667463B1/ko not_active Expired - Fee Related
-
2016
- 2016-03-02 JP JP2016039844A patent/JP6138305B2/ja not_active Expired - Fee Related
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2000348142A (ja) * | 1999-06-08 | 2000-12-15 | Nippon Telegr & Teleph Corp <Ntt> | 文字認識装置,文字認識方法,および文字認識方法を実行するプログラムを記録した記録媒体 |
| JP2003108551A (ja) * | 2001-09-28 | 2003-04-11 | Toshiba Corp | 携帯型機械翻訳装置、翻訳方法及び翻訳プログラム |
| CN1615478A (zh) * | 2001-12-10 | 2005-05-11 | 三菱电机株式会社 | 便携终端式图像处理系统、便携终端和服务器 |
| CN1737822A (zh) * | 2004-05-20 | 2006-02-22 | 微软公司 | 用于照相机获得的文件的低分辨率光学字符识别 |
| CN1741034A (zh) * | 2004-08-25 | 2006-03-01 | 富士施乐株式会社 | 字符识别装置和字符识别方法 |
| JP2009086349A (ja) * | 2007-09-28 | 2009-04-23 | Fujifilm Corp | 撮影装置及び撮影制御方法 |
| JP2009258871A (ja) * | 2008-04-15 | 2009-11-05 | Casio Comput Co Ltd | 翻訳装置及びプログラム |
| JP2011134144A (ja) * | 2009-12-25 | 2011-07-07 | Square Enix Co Ltd | リアルタイムなカメラ辞書 |
| US20110176720A1 (en) * | 2010-01-15 | 2011-07-21 | Robert Michael Van Osten | Digital Image Transitions |
Also Published As
| Publication number | Publication date |
|---|---|
| US20130108115A1 (en) | 2013-05-02 |
| WO2013032639A2 (en) | 2013-03-07 |
| KR101667463B1 (ko) | 2016-10-18 |
| US9082035B2 (en) | 2015-07-14 |
| WO2013032639A3 (en) | 2013-07-18 |
| JP2014529822A (ja) | 2014-11-13 |
| JP2016146187A (ja) | 2016-08-12 |
| KR20140059834A (ko) | 2014-05-16 |
| JP6148235B2 (ja) | 2017-06-14 |
| CN103765440A (zh) | 2014-04-30 |
| JP6138305B2 (ja) | 2017-05-31 |
| EP2751741A2 (en) | 2014-07-09 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN103765440B (zh) | 使用背景信息的移动装置上的光学字符辨识 | |
| US7840033B2 (en) | Text stitching from multiple images | |
| WO2020232861A1 (zh) | 命名实体识别方法、电子装置及存储介质 | |
| US20160344860A1 (en) | Document and image processing | |
| US20170109615A1 (en) | Systems and Methods for Automatically Classifying Businesses from Images | |
| US7325735B2 (en) | Directed reading mode for portable reading machine | |
| CN109189879B (zh) | 电子书籍显示方法及装置 | |
| US11550754B2 (en) | Electronic apparatus and control method thereof | |
| CN111465918B (zh) | 在预览界面中显示业务信息的方法及电子设备 | |
| US20060015342A1 (en) | Document mode processing for portable reading machine enabling document navigation | |
| JP2014102669A (ja) | 情報処理装置、情報処理方法およびプログラム | |
| JP2012185722A (ja) | 文字認識装置、文字認識方法、文字認識システム、および文字認識プログラム | |
| Ramiah et al. | Detecting text based image with optical character recognition for English translation and speech using Android | |
| EP2806336A1 (en) | Text prediction in a text input associated with an image | |
| Pu et al. | Framework based on mobile augmented reality for translating food menu in Thai language to Malay language | |
| Noorian et al. | St-sem: A multimodal method for points-of-interest classification using street-level imagery | |
| US11651280B2 (en) | Recording medium, information processing system, and information processing method | |
| JP2011165092A (ja) | 文書画像関連情報提供装置、及び文書画像関連情報取得システム | |
| Shilkrot et al. | FingerReader: A finger-worn assistive augmentation | |
| Rai et al. | MyOcrTool: visualization system for generating associative images of Chinese characters in smart devices | |
| Selvaraj et al. | Enhanced portable text to speech converter for visually impaired | |
| Gaudissart et al. | SYPOLE: a mobile assistant for the blind | |
| NS et al. | Smart reader for visually impaired | |
| CN112364700A (zh) | 一种内容标记方法及终端设备 | |
| CN111460134A (zh) | 手写轨迹的笔记摘录方法、终端及计算机存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20180403 Termination date: 20190806 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |