CN104871180B - 用于ocr的基于文本图像质量的反馈 - Google Patents

用于ocr的基于文本图像质量的反馈 Download PDF

Info

Publication number
CN104871180B
CN104871180B CN201380064784.8A CN201380064784A CN104871180B CN 104871180 B CN104871180 B CN 104871180B CN 201380064784 A CN201380064784 A CN 201380064784A CN 104871180 B CN104871180 B CN 104871180B
Authority
CN
China
Prior art keywords
text
image
ocr
text filed
zoom
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201380064784.8A
Other languages
English (en)
Chinese (zh)
Other versions
CN104871180A (zh
Inventor
P·K·拜哈提
A·S·比塞恩
R·桑德拉拉简
D·A·戈尔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of CN104871180A publication Critical patent/CN104871180A/zh
Application granted granted Critical
Publication of CN104871180B publication Critical patent/CN104871180B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/1444Selective acquisition, locating or processing of specific regions, e.g. highlighted text, fiducial marks or predetermined fields
    • G06V30/1456Selective acquisition, locating or processing of specific regions, e.g. highlighted text, fiducial marks or predetermined fields based on user interactions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • G06V20/63Scene text, e.g. street names
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Character Input (AREA)
  • Character Discrimination (AREA)
CN201380064784.8A 2012-12-13 2013-11-22 用于ocr的基于文本图像质量的反馈 Expired - Fee Related CN104871180B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
IN5200CH2012 2012-12-13
IN5200/CHE/2012 2012-12-13
US13/843,637 2013-03-15
US13/843,637 US9317764B2 (en) 2012-12-13 2013-03-15 Text image quality based feedback for improving OCR
PCT/US2013/071479 WO2014092978A1 (en) 2012-12-13 2013-11-22 Text image quality based feedback for ocr

Publications (2)

Publication Number Publication Date
CN104871180A CN104871180A (zh) 2015-08-26
CN104871180B true CN104871180B (zh) 2017-05-03

Family

ID=50930450

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201380064784.8A Expired - Fee Related CN104871180B (zh) 2012-12-13 2013-11-22 用于ocr的基于文本图像质量的反馈

Country Status (5)

Country Link
US (1) US9317764B2 (enExample)
EP (1) EP2932437A1 (enExample)
JP (1) JP6129987B2 (enExample)
CN (1) CN104871180B (enExample)
WO (1) WO2014092978A1 (enExample)

Families Citing this family (67)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130103306A1 (en) * 2010-06-15 2013-04-25 Navitime Japan Co., Ltd. Navigation system, terminal apparatus, navigation server, navigation apparatus, navigation method, and computer program product
US20130194448A1 (en) 2012-01-26 2013-08-01 Qualcomm Incorporated Rules for merging blocks of connected components in natural images
US9064191B2 (en) 2012-01-26 2015-06-23 Qualcomm Incorporated Lower modifier detection and extraction from devanagari text images to improve OCR performance
US9047540B2 (en) 2012-07-19 2015-06-02 Qualcomm Incorporated Trellis based word decoder with reverse pass
US9183458B2 (en) 2012-07-19 2015-11-10 Qualcomm Incorporated Parameter selection and coarse localization of interest regions for MSER processing
US9262699B2 (en) 2012-07-19 2016-02-16 Qualcomm Incorporated Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR
US9076242B2 (en) * 2012-07-19 2015-07-07 Qualcomm Incorporated Automatic correction of skew in natural images and video
US9141874B2 (en) 2012-07-19 2015-09-22 Qualcomm Incorporated Feature extraction and use with a probability density function (PDF) divergence metric
DE102013005658A1 (de) * 2013-04-02 2014-10-02 Docuware Gmbh Erfassung eines dokuments
US9141865B2 (en) * 2013-10-28 2015-09-22 Itseez, Inc. Fast single-pass interest operator for text and object detection
US9465774B2 (en) 2014-04-02 2016-10-11 Benoit Maison Optical character recognition system using multiple images and method of use
GB2525170A (en) 2014-04-07 2015-10-21 Nokia Technologies Oy Stereo viewing
JP2015207181A (ja) * 2014-04-22 2015-11-19 ソニー株式会社 情報処理装置、情報処理方法及びコンピュータプログラム
CN104200236B (zh) * 2014-08-22 2018-10-26 浙江生辉照明有限公司 基于dpm的快速目标检测方法
US9639951B2 (en) * 2014-10-23 2017-05-02 Khalifa University of Science, Technology & Research Object detection and tracking using depth data
KR102448565B1 (ko) 2014-12-11 2022-09-29 삼성전자주식회사 사용자 단말 장치 및 이의 제어 방법
US9256775B1 (en) * 2014-12-23 2016-02-09 Toshiba Tec Kabushiki Kaisha Image recognition apparatus and commodity information processing apparatus
US9953216B2 (en) * 2015-01-13 2018-04-24 Google Llc Systems and methods for performing actions in response to user gestures in captured images
US9830508B1 (en) 2015-01-30 2017-11-28 Quest Consultants LLC Systems and methods of extracting text from a digital image
US9984287B2 (en) * 2015-03-05 2018-05-29 Wipro Limited Method and image processing apparatus for performing optical character recognition (OCR) of an article
US9466001B1 (en) * 2015-04-07 2016-10-11 Toshiba Tec Kabushiki Kaisha Image processing apparatus and computer-readable storage medium
US9619701B2 (en) * 2015-05-20 2017-04-11 Xerox Corporation Using motion tracking and image categorization for document indexing and validation
US10242277B1 (en) * 2015-07-08 2019-03-26 Amazon Technologies, Inc. Validating digital content rendering
US10721407B1 (en) * 2015-09-23 2020-07-21 Charles W. Moyes Real-time image capture system
US10121232B1 (en) * 2015-12-23 2018-11-06 Evernote Corporation Visual quality of photographs with handwritten content
WO2017120660A1 (en) 2016-01-12 2017-07-20 Esight Corp. Language element vision augmentation methods and devices
US10002435B2 (en) * 2016-01-29 2018-06-19 Google Llc Detecting motion in images
US20170286383A1 (en) * 2016-03-30 2017-10-05 Microsoft Technology Licensing, Llc Augmented imaging assistance for visual impairment
RU2613849C1 (ru) * 2016-05-13 2017-03-21 Общество с ограниченной ответственностью "Аби Девелопмент" Оптическое распознавание символов серии изображений
CN105975955B (zh) * 2016-05-27 2019-07-02 北京医拍智能科技有限公司 一种图像中文本区域的检测方法
US10210384B2 (en) 2016-07-25 2019-02-19 Intuit Inc. Optical character recognition (OCR) accuracy by combining results across video frames
JP6531738B2 (ja) * 2016-08-08 2019-06-19 京セラドキュメントソリューションズ株式会社 画像処理装置
JP6917688B2 (ja) * 2016-09-02 2021-08-11 株式会社東芝 帳票読取装置、帳票読取方法、プログラム、および帳票読取システム
RU2640296C1 (ru) * 2016-12-06 2017-12-27 Общество с ограниченной ответственностью "Аби Девелопмент" Способ и устройство для определения пригодности документа для оптического распознавания символов (ocr) на сервере
BE1025006B1 (fr) * 2017-02-27 2018-09-25 I.R.I.S. Procède mis en oeuvre par ordinateur et système de reconnaissance de caractère optique
JP6448696B2 (ja) * 2017-03-22 2019-01-09 株式会社東芝 情報処理装置、方法及びプログラム
CN107194890B (zh) * 2017-05-18 2020-07-28 上海兆芯集成电路有限公司 使用多分辨率改善图像质量的方法及装置
CN107194891B (zh) 2017-05-18 2020-11-10 上海兆芯集成电路有限公司 改善图像质量的方法及虚拟实境装置
WO2019017961A1 (en) * 2017-07-21 2019-01-24 Hewlett-Packard Development Company, L.P. OPTICAL RECOGNITION OF CHARACTERS BY CONSENSUS OF DATA SETS
EP3659066A4 (en) * 2017-07-25 2021-02-24 Hewlett-Packard Development Company, L.P. DETERMINATIONS OF SHARPNESS OF CHARACTER RECOGNITION
CN108229483A (zh) * 2018-01-11 2018-06-29 中国计量大学 基于caffe与软触发下的门牌压印字符识别装置
JP2019211595A (ja) * 2018-06-04 2019-12-12 富士ゼロックス株式会社 表示制御装置、プログラム及び表示システム
CN110609877B (zh) * 2018-06-14 2023-04-18 百度在线网络技术(北京)有限公司 一种图片采集的方法、装置、设备和计算机存储介质
US20200004815A1 (en) * 2018-06-29 2020-01-02 Microsoft Technology Licensing, Llc Text entity detection and recognition from images
CN110766014B (zh) * 2018-09-06 2020-05-29 邬国锐 票据信息定位方法、系统及计算机可读存储介质
US11373400B1 (en) * 2019-03-18 2022-06-28 Express Scripts Strategic Development, Inc. Methods and systems for image processing to present data in augmented reality
US11631266B2 (en) * 2019-04-02 2023-04-18 Wilco Source Inc Automated document intake and processing system
US11687796B2 (en) 2019-04-17 2023-06-27 International Business Machines Corporation Document type-specific quality model
CN113993374A (zh) * 2019-06-21 2022-01-28 松下知识产权经营株式会社 动物信息管理系统和动物信息管理方法
US11176410B2 (en) * 2019-10-27 2021-11-16 John Snow Labs Inc. Preprocessing images for OCR using character pixel height estimation and cycle generative adversarial networks for better character recognition
CN111444794B (zh) * 2020-03-13 2023-12-12 安诚迈科(北京)信息技术有限公司 基于ocr的票据识别辅助方法、设备、存储介质及装置
CN111639566B (zh) * 2020-05-19 2024-08-09 浙江大华技术股份有限公司 一种提取表单信息的方法及装置
CN111709414A (zh) * 2020-06-29 2020-09-25 济南浪潮高新科技投资发展有限公司 Ar设备及其文字识别方法、装置和计算机可读存储介质
EP3933678A1 (en) * 2020-06-30 2022-01-05 Ricoh Company, Ltd. Information processing system, data output system, image processing method, and carrier means
US11417079B2 (en) * 2020-07-14 2022-08-16 International Business Machines Corporation Viewfinder assistant for visually impaired
TWI790471B (zh) * 2020-08-26 2023-01-21 財團法人工業技術研究院 基於深度學習的影像校正方法及系統
US11494944B2 (en) 2020-11-18 2022-11-08 Disney Enterprises, Inc. Automatic low contrast detection
US11544828B2 (en) 2020-11-18 2023-01-03 Disney Enterprises, Inc. Automatic occlusion detection
JP2022092837A (ja) * 2020-12-11 2022-06-23 株式会社東海理化電機製作所 制御装置およびプログラム
US11893784B2 (en) 2021-05-14 2024-02-06 Abbyy Development Inc. Assessment of image quality for optical character recognition using machine learning
CN113221801B (zh) * 2021-05-24 2023-08-18 北京奇艺世纪科技有限公司 版号信息识别方法、装置、电子设备及可读存储介质
US12342070B2 (en) 2021-08-12 2025-06-24 Google Llc Low power machine learning using real-time captured regions of interest
WO2023019247A1 (en) * 2021-08-12 2023-02-16 Google Llc Low power machine learning using real-time captured regions of interest
WO2023094861A1 (en) * 2021-11-25 2023-06-01 L&T Technology Services Limited A system and method for visual text transformation
EP4242988A1 (en) 2022-03-11 2023-09-13 Tata Consultancy Services Limited Method and system to detect a text from multimedia content captured at a scene
AU2023249062B2 (en) * 2022-04-08 2025-12-04 Thomson Reuters Enterprise Centre Gmbh System and method for machine learning document partitioning
US12425713B2 (en) 2023-08-07 2025-09-23 Motorola Solutions, Inc. Imaging system with object recognition feedback

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101609505A (zh) * 2008-06-19 2009-12-23 三星电子株式会社 识别字符的方法和装置
CN101689328A (zh) * 2008-06-11 2010-03-31 松下电器产业株式会社 图案识别设备、图案识别方法、图像处理设备、以及图像处理方法
CN101753846A (zh) * 2008-12-05 2010-06-23 三星电子株式会社 使用照相机自动调整字符大小的装置和方法

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08106510A (ja) * 1994-10-05 1996-04-23 Casio Comput Co Ltd 文字読取り装置及び文字認識装置
US7734500B1 (en) * 2001-10-17 2010-06-08 United Toll Systems, Inc. Multiple RF read zone system
US6922487B2 (en) * 2001-11-02 2005-07-26 Xerox Corporation Method and apparatus for capturing text images
JP2007520934A (ja) * 2003-12-24 2007-07-26 ウオーカー ディジタル、エルエルシー 画像を自動的に捕捉し、管理する方法および装置
US8320708B2 (en) 2004-04-02 2012-11-27 K-Nfb Reading Technology, Inc. Tilt adjustment for optical character recognition in portable reading machine
US8600989B2 (en) 2004-10-01 2013-12-03 Ricoh Co., Ltd. Method and system for image matching in a mixed media environment
JP2006186414A (ja) * 2004-12-24 2006-07-13 Canon Software Inc 画像読取装置及び方法、画像読取システム、プログラム、並びに記憶媒体
US7903878B2 (en) 2006-03-30 2011-03-08 Loquitur, Inc. Capturing and presenting text during optical character recognition
US8098934B2 (en) 2006-06-29 2012-01-17 Google Inc. Using extracted image text
US8577118B2 (en) 2008-01-18 2013-11-05 Mitek Systems Systems for mobile image capture and remittance processing
US9842331B2 (en) 2008-01-18 2017-12-12 Mitek Systems, Inc. Systems and methods for mobile image capture and processing of checks
CN101639760A (zh) 2009-08-27 2010-02-03 上海合合信息科技发展有限公司 联系信息输入方法及系统
EP2333695B1 (en) 2009-12-10 2017-08-02 beyo GmbH Method for optimized camera position finding for systems with optical character recognition
US8675923B2 (en) 2010-07-21 2014-03-18 Intuit Inc. Providing feedback about an image of a financial document
US20120030103A1 (en) 2010-07-27 2012-02-02 Gregory Hughes Image-Based Submission and Verification of Redemption Codes

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101689328A (zh) * 2008-06-11 2010-03-31 松下电器产业株式会社 图案识别设备、图案识别方法、图像处理设备、以及图像处理方法
CN101609505A (zh) * 2008-06-19 2009-12-23 三星电子株式会社 识别字符的方法和装置
CN101753846A (zh) * 2008-12-05 2010-06-23 三星电子株式会社 使用照相机自动调整字符大小的装置和方法

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
A video based interface to textual information for the visually impaired;Ali Zandifar等;《Multimodal Interfaces,2002, Proceedings. Fourth IEEE International Conference on 14-16 Oct. 2002》;IEEE;20021014;第1,3.1,3.2章,图1 *
camera-based analysis of text and documents:a survey;Jian Liang等;《International Journal of Document Analysis and Recognition (IJDAR)》;SPRINGER;20050701;第7卷(第2-3期);第2.4,2.5,3,3.1,4,4.3,5,5.1章 *
Extracting low resolution text with an active camera for OCR;Majid Mirmehdi等;《Proceedings of the IX Spanish Symposium on Pattern Recognition and Image Processing,1 May 2001》;20010501;第3章节,图1-2 *

Also Published As

Publication number Publication date
JP6129987B2 (ja) 2017-05-17
EP2932437A1 (en) 2015-10-21
WO2014092978A1 (en) 2014-06-19
JP2015537325A (ja) 2015-12-24
CN104871180A (zh) 2015-08-26
US9317764B2 (en) 2016-04-19
US20140168478A1 (en) 2014-06-19

Similar Documents

Publication Publication Date Title
CN104871180B (zh) 用于ocr的基于文本图像质量的反馈
US9171204B2 (en) Method of perspective correction for devanagari text
US8831381B2 (en) Detecting and correcting skew in regions of text in natural images
JP5775225B2 (ja) マルチレイヤ連結成分をヒストグラムと共に用いるテキスト検出
US9076242B2 (en) Automatic correction of skew in natural images and video
US9262699B2 (en) Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR
Ye et al. Text detection and recognition in imagery: A survey
US9141874B2 (en) Feature extraction and use with a probability density function (PDF) divergence metric
US20140023275A1 (en) Redundant aspect ratio decoding of devanagari characters
CN105574513A (zh) 文字检测方法和装置
JP2018045691A (ja) 画像視点変換装置及び方法
Bilgin et al. Road sign recognition system on Raspberry Pi
KR20160146355A (ko) 이미지 내의 텍스트를 검출하는 방법 및 장치
Wang et al. Multiorientation scene text detection via coarse-to-fine supervision-based convolutional networks
Nor et al. Image segmentation and text extraction: application to the extraction of textual information in scene images
Liu Digits Recognition on Medical Device
Nor et al. A new visual signature for content-based indexing of low resolution documents

Legal Events

Date Code Title Description
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170503

Termination date: 20181122