JP2012003756A5 - - Google Patents

Download PDF

Info

Publication number
JP2012003756A5
JP2012003756A5 JP2011129862A JP2011129862A JP2012003756A5 JP 2012003756 A5 JP2012003756 A5 JP 2012003756A5 JP 2011129862 A JP2011129862 A JP 2011129862A JP 2011129862 A JP2011129862 A JP 2011129862A JP 2012003756 A5 JP2012003756 A5 JP 2012003756A5
Authority
JP
Japan
Prior art keywords
components
word
column
height
subword
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2011129862A
Other languages
English (en)
Japanese (ja)
Other versions
JP5355625B2 (ja
JP2012003756A (ja
Filing date
Publication date
Priority claimed from US12/814,448 external-priority patent/US8218875B2/en
Application filed filed Critical
Publication of JP2012003756A publication Critical patent/JP2012003756A/ja
Publication of JP2012003756A5 publication Critical patent/JP2012003756A5/ja
Application granted granted Critical
Publication of JP5355625B2 publication Critical patent/JP5355625B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2011129862A 2010-06-12 2011-06-10 光学式文字認識用に画像を前処理するための方法およびシステム Expired - Fee Related JP5355625B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/814,448 2010-06-12
US12/814,448 US8218875B2 (en) 2010-06-12 2010-06-12 Method and system for preprocessing an image for optical character recognition

Publications (3)

Publication Number Publication Date
JP2012003756A JP2012003756A (ja) 2012-01-05
JP2012003756A5 true JP2012003756A5 (enExample) 2013-07-18
JP5355625B2 JP5355625B2 (ja) 2013-11-27

Family

ID=44654616

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2011129862A Expired - Fee Related JP5355625B2 (ja) 2010-06-12 2011-06-10 光学式文字認識用に画像を前処理するための方法およびシステム

Country Status (3)

Country Link
US (2) US8218875B2 (enExample)
EP (1) EP2395453A3 (enExample)
JP (1) JP5355625B2 (enExample)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8218875B2 (en) 2010-06-12 2012-07-10 Hussein Khalid Al-Omari Method and system for preprocessing an image for optical character recognition
US8542926B2 (en) * 2010-11-19 2013-09-24 Microsoft Corporation Script-agnostic text reflow for document images
US9734132B1 (en) * 2011-12-20 2017-08-15 Amazon Technologies, Inc. Alignment and reflow of displayed character images
JP5994251B2 (ja) * 2012-01-06 2016-09-21 富士ゼロックス株式会社 画像処理装置及びプログラム
EP2836962A4 (en) * 2012-04-12 2016-07-27 Tata Consultancy Services Ltd SYSTEM AND METHOD FOR DETECTION AND SEGMENTATION OF CHARACTERISTIC MATTERS FOR OPTICAL CHARACTER RECOGNITION (OCR)
EP2662802A1 (en) * 2012-05-09 2013-11-13 King Abdulaziz City for Science & Technology (KACST) Method and system for preprocessing an image for optical character recognition
US9785240B2 (en) * 2013-03-18 2017-10-10 Fuji Xerox Co., Ltd. Systems and methods for content-aware selection
JP5986051B2 (ja) * 2013-05-12 2016-09-06 キング・アブドゥルアジズ・シティ・フォー・サイエンス・アンド・テクノロジー(ケイ・エイ・シィ・エス・ティ)King Abdulaziz City For Science And Technology (Kacst) アラビア語テキストを自動的に認識するための方法
US20160098597A1 (en) * 2013-06-18 2016-04-07 Abbyy Development Llc Methods and systems that generate feature symbols with associated parameters in order to convert images to electronic documents
US9235755B2 (en) * 2013-08-15 2016-01-12 Konica Minolta Laboratory U.S.A., Inc. Removal of underlines and table lines in document images while preserving intersecting character strokes
US9292739B1 (en) * 2013-12-12 2016-03-22 A9.Com, Inc. Automated recognition of text utilizing multiple images
US9288362B2 (en) 2014-02-03 2016-03-15 King Fahd University Of Petroleum And Minerals Technique for skew detection of printed arabic documents
US9367766B2 (en) * 2014-07-22 2016-06-14 Adobe Systems Incorporated Text line detection in images
JP2016181111A (ja) * 2015-03-24 2016-10-13 富士ゼロックス株式会社 画像処理装置、及び画像処理プログラム
CN106156766B (zh) 2015-03-25 2020-02-18 阿里巴巴集团控股有限公司 文本行分类器的生成方法及装置
US10430649B2 (en) 2017-07-14 2019-10-01 Adobe Inc. Text region detection in digital images using image tag filtering
US11366968B2 (en) * 2019-07-29 2022-06-21 Intuit Inc. Region proposal networks for automated bounding box detection and text segmentation
US11270153B2 (en) 2020-02-19 2022-03-08 Northrop Grumman Systems Corporation System and method for whole word conversion of text in image
JP7528542B2 (ja) * 2020-06-03 2024-08-06 株式会社リコー 画像処理装置、方法およびプログラム
FR3155939A1 (fr) * 2023-11-27 2025-05-30 Orange Procédé d’analyse d’au moins une image, dispositif électronique et produit programme d’ordinateur correspondant

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5058182A (en) * 1988-05-02 1991-10-15 The Research Foundation Of State Univ. Of New York Method and apparatus for handwritten character recognition
US5224179A (en) * 1988-12-20 1993-06-29 At&T Bell Laboratories Image skeletonization method
US5680479A (en) * 1992-04-24 1997-10-21 Canon Kabushiki Kaisha Method and apparatus for character recognition
JP3253356B2 (ja) * 1992-07-06 2002-02-04 株式会社リコー 文書画像の領域識別方法
US5987170A (en) * 1992-09-28 1999-11-16 Matsushita Electric Industrial Co., Ltd. Character recognition machine utilizing language processing
US5410611A (en) * 1993-12-17 1995-04-25 Xerox Corporation Method for identifying word bounding boxes in text
CA2166248C (en) * 1995-12-28 2000-01-04 Abdel Naser Al-Karmi Optical character recognition of handwritten or cursive text
JPH11232378A (ja) * 1997-12-09 1999-08-27 Canon Inc デジタルカメラ、そのデジタルカメラを用いた文書処理システム、コンピュータ可読の記憶媒体、及び、プログラムコード送出装置
JP4323606B2 (ja) * 1999-03-01 2009-09-02 理想科学工業株式会社 文書画像傾き検出装置
US7298903B2 (en) * 2001-06-28 2007-11-20 Microsoft Corporation Method and system for separating text and drawings in digital ink
US7062090B2 (en) * 2002-06-28 2006-06-13 Microsoft Corporation Writing guide for a free-form document editor
US20040096102A1 (en) * 2002-11-18 2004-05-20 Xerox Corporation Methodology for scanned color document segmentation
US7499588B2 (en) * 2004-05-20 2009-03-03 Microsoft Corporation Low resolution OCR for camera acquired documents
US8139828B2 (en) * 2005-10-21 2012-03-20 Carestream Health, Inc. Method for enhanced visualization of medical images
JP4757001B2 (ja) * 2005-11-25 2011-08-24 キヤノン株式会社 画像処理装置、画像処理方法
US7668394B2 (en) * 2005-12-21 2010-02-23 Lexmark International, Inc. Background intensity correction of a scan of a document
US7724957B2 (en) * 2006-07-31 2010-05-25 Microsoft Corporation Two tiered text recognition
JP4988842B2 (ja) * 2007-06-28 2012-08-01 富士通株式会社 表データ生成プログラム、表データ生成方法および表データ生成装置
US20110043869A1 (en) * 2007-12-21 2011-02-24 Nec Corporation Information processing system, its method and program
US8027539B2 (en) * 2008-01-11 2011-09-27 Sharp Laboratories Of America, Inc. Method and apparatus for determining an orientation of a document including Korean characters
US8009928B1 (en) * 2008-01-23 2011-08-30 A9.Com, Inc. Method and system for detecting and recognizing text in images
US8150160B2 (en) * 2009-03-26 2012-04-03 King Fahd University Of Petroleum & Minerals Automatic Arabic text image optical character recognition method
TWI394098B (zh) * 2009-06-03 2013-04-21 Nat Univ Chung Cheng Shredding Method Based on File Image Texture Feature
US8086039B2 (en) * 2010-02-05 2011-12-27 Palo Alto Research Center Incorporated Fine-grained visual document fingerprinting for accurate document comparison and retrieval
US20110280481A1 (en) * 2010-05-17 2011-11-17 Microsoft Corporation User correction of errors arising in a textual document undergoing optical character recognition (ocr) process
US8218875B2 (en) 2010-06-12 2012-07-10 Hussein Khalid Al-Omari Method and system for preprocessing an image for optical character recognition

Similar Documents

Publication Publication Date Title
JP2012003756A5 (enExample)
JP2011243201A5 (enExample)
JP5355625B2 (ja) 光学式文字認識用に画像を前処理するための方法およびシステム
KR101733539B1 (ko) 문자인식장치 및 그 제어방법
KR102280401B1 (ko) 장애물의 하단 라인을 기준으로 roi를 검출하는 학습 방법 및 학습 장치 그리고 이를 이용한 테스트 방법 및 테스트 장치
US8194983B2 (en) Method and system for preprocessing an image for optical character recognition
JP2011018338A5 (enExample)
KR102279361B1 (ko) 장애물을 검출하는 학습 방법 및 학습 장치 그리고 이를 이용한 테스트 방법 및 테스트 장치
JP2012518223A5 (enExample)
CN102870399A (zh) 在ocr过程中将词语位图分割为单个字符或字形
CN107944451B (zh) 一种藏文古籍文档的行切分方法及系统
WO2017079055A3 (en) 2d image processing for extrusion into 3d objects
KR102280406B1 (ko) 근접 장애물의 하단 라인과 상단 라인을 검출하여 객체 존재성을 검출하는 학습 방법 및 학습 장치 그리고 이를 이용한 테스트 방법 및 테스트 장치
US10354111B2 (en) Primary localization method and system for QR codes
CN109727363B (zh) 一种在票据中识别大写金额的方法
US20160379029A1 (en) High capacity 2d color barcode design and processing method for camera based applications
CN105512600A (zh) 一种基于互信息与特征提取的车牌识别方法
CN102073862B (zh) 一种快速的文档图像版面结构计算方法
US20160171710A1 (en) Edge Detection System And Methods
Jain et al. A comparison paper on skew detection of scanned document images based on horizontal and vertical projection profile analysis
CN107437084B (zh) 一种脱机手写体文本识别的字符重心定位方法
JP2013235574A5 (enExample)
Singh Imagenet winning cnn architectures–a review
US20230419508A1 (en) Image processing device and method of detecting objects crossing a crossline and a direction the objects crosses the crossline
Arefin et al. Bangla handwritten characters recognition by using distance-based segmentation and histogram oriented gradients