KR101795823B1 - 광학 문자 인식되는 텍스트 영상의 텍스트 개선 기법 - Google Patents

광학 문자 인식되는 텍스트 영상의 텍스트 개선 기법 Download PDF

Info

Publication number
KR101795823B1
KR101795823B1 KR1020127023496A KR20127023496A KR101795823B1 KR 101795823 B1 KR101795823 B1 KR 101795823B1 KR 1020127023496 A KR1020127023496 A KR 1020127023496A KR 20127023496 A KR20127023496 A KR 20127023496A KR 101795823 B1 KR101795823 B1 KR 101795823B1
Authority
KR
South Korea
Prior art keywords
image
background
foreground
text
intensity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
KR1020127023496A
Other languages
English (en)
Korean (ko)
Other versions
KR20130016213A (ko
Inventor
사사 갈릭
드조르드제 니젬체빅
보딘 드레세빅
Original Assignee
지구 홀딩스 리미티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 지구 홀딩스 리미티드 filed Critical 지구 홀딩스 리미티드
Publication of KR20130016213A publication Critical patent/KR20130016213A/ko
Application granted granted Critical
Publication of KR101795823B1 publication Critical patent/KR101795823B1/ko
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/457Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by analysing connectivity, e.g. edge linking, connected component analysis or slices
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/16Image preprocessing
    • G06V30/162Quantising the image signal
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/28Quantising the image, e.g. histogram thresholding for discrimination between background and foreground patterns
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/18Extraction of features or characteristics of the image
    • G06V30/1801Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
    • G06V30/18076Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections by analysing connectivity, e.g. edge linking, connected component analysis or slices
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/01Solutions for problems related to non-uniform document background
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Character Input (AREA)
  • Facsimile Image Signal Circuits (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)
KR1020127023496A 2010-03-10 2011-03-07 광학 문자 인식되는 텍스트 영상의 텍스트 개선 기법 Active KR101795823B1 (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US12/720,732 US8526732B2 (en) 2010-03-10 2010-03-10 Text enhancement of a textual image undergoing optical character recognition
US12/720,732 2010-03-10
PCT/US2011/027439 WO2011112522A2 (en) 2010-03-10 2011-03-07 Text enhancement of a textual image undergoing optical character recognition

Publications (2)

Publication Number Publication Date
KR20130016213A KR20130016213A (ko) 2013-02-14
KR101795823B1 true KR101795823B1 (ko) 2017-11-08

Family

ID=44560016

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020127023496A Active KR101795823B1 (ko) 2010-03-10 2011-03-07 광학 문자 인식되는 텍스트 영상의 텍스트 개선 기법

Country Status (8)

Country Link
US (1) US8526732B2 (enExample)
EP (1) EP2545499B1 (enExample)
JP (1) JP5754065B2 (enExample)
KR (1) KR101795823B1 (enExample)
CN (1) CN102782706B (enExample)
CA (1) CA2790402A1 (enExample)
ES (1) ES2773719T3 (enExample)
WO (1) WO2011112522A2 (enExample)

Families Citing this family (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12165754B2 (en) 2010-09-01 2024-12-10 Apixio, Llc Systems and methods for improved optical character recognition of health records
US11610653B2 (en) * 2010-09-01 2023-03-21 Apixio, Inc. Systems and methods for improved optical character recognition of health records
US20120106845A1 (en) * 2010-10-30 2012-05-03 Prakash Reddy Replacing word with image of word
US9064191B2 (en) 2012-01-26 2015-06-23 Qualcomm Incorporated Lower modifier detection and extraction from devanagari text images to improve OCR performance
US20130194448A1 (en) * 2012-01-26 2013-08-01 Qualcomm Incorporated Rules for merging blocks of connected components in natural images
US8606011B1 (en) * 2012-06-07 2013-12-10 Amazon Technologies, Inc. Adaptive thresholding for image recognition
JP2014006614A (ja) * 2012-06-22 2014-01-16 Sony Corp 画像処理装置、画像処理方法、並びにプログラム
US9183458B2 (en) 2012-07-19 2015-11-10 Qualcomm Incorporated Parameter selection and coarse localization of interest regions for MSER processing
US9262699B2 (en) 2012-07-19 2016-02-16 Qualcomm Incorporated Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR
US9141874B2 (en) 2012-07-19 2015-09-22 Qualcomm Incorporated Feature extraction and use with a probability density function (PDF) divergence metric
US9076242B2 (en) 2012-07-19 2015-07-07 Qualcomm Incorporated Automatic correction of skew in natural images and video
US9047540B2 (en) 2012-07-19 2015-06-02 Qualcomm Incorporated Trellis based word decoder with reverse pass
US8787702B1 (en) * 2012-11-30 2014-07-22 Accusoft Corporation Methods and apparatus for determining and/or modifying image orientation
US9256798B2 (en) * 2013-01-31 2016-02-09 Aurasma Limited Document alteration based on native text analysis and OCR
GB2514410A (en) 2013-05-24 2014-11-26 Ibm Image scaling for images including low resolution text
WO2015071923A1 (ja) * 2013-11-12 2015-05-21 三菱電機株式会社 運転支援画像生成装置、運転支援画像表示装置、運転支援画像表示システム、及び運転支援画像生成プログラム
KR102159389B1 (ko) 2014-03-17 2020-09-24 삼성디스플레이 주식회사 디지털 비디오 데이터를 보정하기 위한 보정 데이터 산출방법과 이를 이용하여 생성한 룩-업 테이블을 포함하는 유기전계발광 표시장치
US9536161B1 (en) 2014-06-17 2017-01-03 Amazon Technologies, Inc. Visual and audio recognition for scene change events
CN105718926A (zh) * 2014-12-03 2016-06-29 夏普株式会社 一种文本检测的方法和装置
JP2016143310A (ja) * 2015-02-04 2016-08-08 ソニー株式会社 情報処理装置、画像処理方法及びプログラム
CN106156766B (zh) 2015-03-25 2020-02-18 阿里巴巴集团控股有限公司 文本行分类器的生成方法及装置
CN105245756B (zh) * 2015-09-28 2018-05-29 珠海奔图电子有限公司 图像处理方法及系统
US9916492B1 (en) * 2017-03-21 2018-03-13 SkySlope, Inc. Image processing and analysis for UID overlap avoidance
RU2657181C1 (ru) * 2017-09-01 2018-06-08 Общество с ограниченной ответственностью "Аби Продакшн" Способ улучшения качества распознавания отдельного кадра
CN110533049B (zh) * 2018-05-23 2023-05-02 富士通株式会社 提取印章图像的方法和装置
CN111986095B (zh) * 2019-05-22 2024-03-19 上海哔哩哔哩科技有限公司 基于边缘提取的图像处理方法及图像处理装置
CN111080554B (zh) * 2019-12-20 2023-08-04 成都极米科技股份有限公司 一种投影内容中字幕区域增强方法、装置及可读存储介质
US11205084B2 (en) * 2020-02-17 2021-12-21 Wipro Limited Method and system for evaluating an image quality for optical character recognition (OCR)
US11386687B2 (en) 2020-03-30 2022-07-12 Wipro Limited System and method for reconstructing an image
CN111507352B (zh) * 2020-04-16 2021-09-28 腾讯科技(深圳)有限公司 一种图像处理方法、装置、计算机设备以及存储介质
CN111753832B (zh) * 2020-07-02 2023-12-08 杭州睿琪软件有限公司 图像处理方法、图像处理装置、电子设备和存储介质
US11494944B2 (en) 2020-11-18 2022-11-08 Disney Enterprises, Inc. Automatic low contrast detection
US11544828B2 (en) 2020-11-18 2023-01-03 Disney Enterprises, Inc. Automatic occlusion detection
CN112906686B (zh) 2021-03-11 2024-11-15 北京小米移动软件有限公司 文字识别方法、装置、电子设备及存储介质
JP7137170B1 (ja) * 2021-03-22 2022-09-14 楽天グループ株式会社 情報処理装置、情報処理方法およびプログラム
CN113793403B (zh) * 2021-08-19 2023-09-22 西南科技大学 一种模拟绘画过程的文本合成图像方法
CN114241256B (zh) * 2021-11-30 2024-12-17 郑州信大先进技术研究院 基于神经网络的训练样本图像增强方法及系统
US11749006B2 (en) * 2021-12-15 2023-09-05 Intuit Inc. Optical character recognition quality evaluation and optimization
CN118172377B (zh) * 2022-12-09 2025-04-08 蔚来移动科技有限公司 文本的前景轮廓、水印图获取方法、系统、装置及介质
CN116071763B (zh) * 2023-03-06 2023-06-16 山东薪火书业有限公司 基于文字识别的教辅图书智能校编系统

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002157552A (ja) * 2000-11-22 2002-05-31 Oki Electric Ind Co Ltd 光学式文字読取装置
JP2008113446A (ja) * 2002-09-05 2008-05-15 Ricoh Co Ltd 画像処理装置、画像処理プログラムおよび記録媒体

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0256688A (ja) * 1988-08-23 1990-02-26 Toyota Central Res & Dev Lab Inc 文字切出し装置
US5513304A (en) * 1993-04-19 1996-04-30 Xerox Corporation Method and apparatus for enhanced automatic determination of text line dependent parameters
US5384864A (en) 1993-04-19 1995-01-24 Xerox Corporation Method and apparatus for automatic determination of text line, word and character cell spatial features
US5915039A (en) * 1996-11-12 1999-06-22 International Business Machines Corporation Method and means for extracting fixed-pitch characters on noisy images with complex background prior to character recognition
KR100480024B1 (ko) 1997-12-31 2005-08-01 엘지전자 주식회사 획의두께정보를이용한모음인식방법
US6301386B1 (en) * 1998-12-09 2001-10-09 Ncr Corporation Methods and apparatus for gray image based text identification
JP2003203205A (ja) * 2002-01-08 2003-07-18 Ricoh Co Ltd 文字認識装置、文字認識方法、およびその方法をコンピュータに実行させるプログラム、並びにそのプログラムを記録したコンピュータ読み取り可能な記録媒体
US20030198386A1 (en) * 2002-04-19 2003-10-23 Huitao Luo System and method for identifying and extracting character strings from captured image data
JP4118749B2 (ja) * 2002-09-05 2008-07-16 株式会社リコー 画像処理装置、画像処理プログラムおよび記憶媒体
JP2004199622A (ja) * 2002-12-20 2004-07-15 Ricoh Co Ltd 画像処理装置、画像処理方法、記録媒体およびプログラム
US7236632B2 (en) * 2003-04-11 2007-06-26 Ricoh Company, Ltd. Automated techniques for comparing contents of images
JP4259950B2 (ja) * 2003-08-08 2009-04-30 株式会社リコー 画像認識装置、画像認識プログラムおよび記録媒体
US8086050B2 (en) * 2004-08-25 2011-12-27 Ricoh Co., Ltd. Multi-resolution segmentation and fill
TWI248754B (en) * 2004-11-08 2006-02-01 Avision Inc Image acquiring device with background filtering function
US7953295B2 (en) * 2006-06-29 2011-05-31 Google Inc. Enhancing text in images
JP2008187327A (ja) 2007-01-29 2008-08-14 Sharp Corp 画像処理装置およびこれを備えた画像形成装置
US8223395B2 (en) * 2007-07-20 2012-07-17 Sharp Laboratories Of America, Inc. Methods and systems for refining text color in a digital image
US8320674B2 (en) * 2008-09-03 2012-11-27 Sony Corporation Text localization for image and video OCR

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002157552A (ja) * 2000-11-22 2002-05-31 Oki Electric Ind Co Ltd 光学式文字読取装置
JP2008113446A (ja) * 2002-09-05 2008-05-15 Ricoh Co Ltd 画像処理装置、画像処理プログラムおよび記録媒体

Also Published As

Publication number Publication date
CA2790402A1 (en) 2011-09-15
JP2013527513A (ja) 2013-06-27
WO2011112522A3 (en) 2011-11-03
US8526732B2 (en) 2013-09-03
EP2545499B1 (en) 2020-01-08
ES2773719T3 (es) 2020-07-14
WO2011112522A2 (en) 2011-09-15
EP2545499A2 (en) 2013-01-16
KR20130016213A (ko) 2013-02-14
JP5754065B2 (ja) 2015-07-22
CN102782706A (zh) 2012-11-14
CN102782706B (zh) 2014-07-23
EP2545499A4 (en) 2017-08-30
US20110222768A1 (en) 2011-09-15

Similar Documents

Publication Publication Date Title
KR101795823B1 (ko) 광학 문자 인식되는 텍스트 영상의 텍스트 개선 기법
JP4423298B2 (ja) デジタル画像におけるテキスト状エッジの強調
EP1910994B1 (en) Binarization of an image
US8417033B2 (en) Gradient based background segmentation and enhancement of images
CN105740876B (zh) 一种图像预处理方法及装置
JP2008148298A (ja) 画像における異なった内容の領域を識別する方法、画像における異なった内容の領域を識別する装置、および画像における異なった内容の領域を識別するコンピュータ・プログラムを具現するコンピュータ読み取り可能な媒体
CN109214996B (zh) 一种图像处理方法及装置
US10769478B2 (en) Convolutional neutral network identification efficiency increasing method and related convolutional neutral network identification efficiency increasing device
JP4852059B2 (ja) 文書画像の2値化性能を改善するノイズ除去装置及びノイズ除去プログラム
US8989493B1 (en) Method and apparatus for identifying regions of an image to be filtered during processing of the image
CN115272362A (zh) 一种数字病理全场图像有效区域分割方法、装置
KR20150099116A (ko) Ocr를 이용한 컬러 문자 인식 방법 및 그 장치
JP3906221B2 (ja) 画像処理方法及び画像処理装置
JP3989341B2 (ja) 画像処理装置
Boiangiu et al. Methods of bitonal image conversion for modern and classic documents
CN115829848B (zh) 处理图形符号的方法、装置和计算机可读存储介质
Boiangiu et al. Bitonal image creation for automatic content conversion
Cooksey et al. Rapid image binarization with morphological operators
KR100260923B1 (ko) 화상의 국부 이치화 장치 및 방법
CN119599953B (zh) 一种病理图像有效坐标区域自动生成系统
KR100416496B1 (ko) 다중 역치값을 이용한 이치화 방법
KR100514734B1 (ko) 디지털 화질 개선방법 및 장치
CN113129246A (zh) 一种文档图片的处理方法、装置及电子设备
HK1226175A1 (zh) 一种图像预处理方法及装置
ASHWINI et al. MORPHOLOGICAL BACKGROUND DETECTION AND ENHANCEMENT OF IMAGES WITH POOR LIGHTING USING CUMULATIVE HISTOGRAM ANALYSIS.

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20120907

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
N231 Notification of change of applicant
PN2301 Change of applicant

Patent event date: 20150715

Comment text: Notification of Change of Applicant

Patent event code: PN23011R01D

A201 Request for examination
PA0201 Request for examination

Patent event code: PA02012R01D

Patent event date: 20160204

Comment text: Request for Examination of Application

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

Comment text: Notification of reason for refusal

Patent event date: 20170720

Patent event code: PE09021S01D

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

Patent event code: PE07011S01D

Comment text: Decision to Grant Registration

Patent event date: 20171019

GRNT Written decision to grant
PR0701 Registration of establishment

Comment text: Registration of Establishment

Patent event date: 20171102

Patent event code: PR07011E01D

PR1002 Payment of registration fee

Payment date: 20171102

End annual number: 3

Start annual number: 1

PG1601 Publication of registration
PR1001 Payment of annual fee

Payment date: 20221021

Start annual number: 6

End annual number: 6

PR1001 Payment of annual fee

Payment date: 20231020

Start annual number: 7

End annual number: 7

PR1001 Payment of annual fee

Payment date: 20241028

Start annual number: 8

End annual number: 8