CN102782706A - 经历光学字符识别的文本图像的文本增强 - Google Patents
经历光学字符识别的文本图像的文本增强 Download PDFInfo
- Publication number
- CN102782706A CN102782706A CN2011800131958A CN201180013195A CN102782706A CN 102782706 A CN102782706 A CN 102782706A CN 2011800131958 A CN2011800131958 A CN 2011800131958A CN 201180013195 A CN201180013195 A CN 201180013195A CN 102782706 A CN102782706 A CN 102782706A
- Authority
- CN
- China
- Prior art keywords
- image
- background
- row
- pixel
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
- G06V10/457—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by analysing connectivity, e.g. edge linking, connected component analysis or slices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/16—Image preprocessing
- G06V30/162—Quantising the image signal
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/28—Quantising the image, e.g. histogram thresholding for discrimination between background and foreground patterns
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/18—Extraction of features or characteristics of the image
- G06V30/1801—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
- G06V30/18076—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections by analysing connectivity, e.g. edge linking, connected component analysis or slices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V2201/00—Indexing scheme relating to image or video recognition or understanding
- G06V2201/01—Solutions for problems related to non-uniform document background
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Character Input (AREA)
- Facsimile Image Signal Circuits (AREA)
- Image Processing (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (15)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/720732 | 2010-03-10 | ||
US12/720,732 US8526732B2 (en) | 2010-03-10 | 2010-03-10 | Text enhancement of a textual image undergoing optical character recognition |
US12/720,732 | 2010-03-10 | ||
PCT/US2011/027439 WO2011112522A2 (en) | 2010-03-10 | 2011-03-07 | Text enhancement of a textual image undergoing optical character recognition |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102782706A true CN102782706A (zh) | 2012-11-14 |
CN102782706B CN102782706B (zh) | 2014-07-23 |
Family
ID=44560016
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201180013195.8A Active CN102782706B (zh) | 2010-03-10 | 2011-03-07 | 经历光学字符识别的文本图像的文本增强 |
Country Status (8)
Country | Link |
---|---|
US (1) | US8526732B2 (zh) |
EP (1) | EP2545499B1 (zh) |
JP (1) | JP5754065B2 (zh) |
KR (1) | KR101795823B1 (zh) |
CN (1) | CN102782706B (zh) |
CA (1) | CA2790402A1 (zh) |
ES (1) | ES2773719T3 (zh) |
WO (1) | WO2011112522A2 (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016086877A1 (zh) * | 2014-12-03 | 2016-06-09 | 夏普株式会社 | 一种文本检测的方法和装置 |
CN105723432A (zh) * | 2013-11-12 | 2016-06-29 | 三菱电机株式会社 | 驾驶辅助图像生成装置、驾驶辅助图像显示装置、驾驶辅助图像显示系统及驾驶辅助图像生成程序 |
WO2017054637A1 (zh) * | 2015-09-28 | 2017-04-06 | 珠海赛纳打印科技股份有限公司 | 图像处理方法及系统 |
CN111080554A (zh) * | 2019-12-20 | 2020-04-28 | 成都极米科技股份有限公司 | 一种投影内容中字幕区域增强方法、装置及可读存储介质 |
CN111986095A (zh) * | 2019-05-22 | 2020-11-24 | 上海哔哩哔哩科技有限公司 | 基于边缘提取的图像处理方法及图像处理装置 |
Families Citing this family (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11610653B2 (en) * | 2010-09-01 | 2023-03-21 | Apixio, Inc. | Systems and methods for improved optical character recognition of health records |
US20120106845A1 (en) * | 2010-10-30 | 2012-05-03 | Prakash Reddy | Replacing word with image of word |
US9053361B2 (en) * | 2012-01-26 | 2015-06-09 | Qualcomm Incorporated | Identifying regions of text to merge in a natural image or video frame |
US9064191B2 (en) | 2012-01-26 | 2015-06-23 | Qualcomm Incorporated | Lower modifier detection and extraction from devanagari text images to improve OCR performance |
US8606011B1 (en) * | 2012-06-07 | 2013-12-10 | Amazon Technologies, Inc. | Adaptive thresholding for image recognition |
JP2014006614A (ja) * | 2012-06-22 | 2014-01-16 | Sony Corp | 画像処理装置、画像処理方法、並びにプログラム |
US9262699B2 (en) | 2012-07-19 | 2016-02-16 | Qualcomm Incorporated | Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR |
US9047540B2 (en) | 2012-07-19 | 2015-06-02 | Qualcomm Incorporated | Trellis based word decoder with reverse pass |
US9076242B2 (en) | 2012-07-19 | 2015-07-07 | Qualcomm Incorporated | Automatic correction of skew in natural images and video |
US9014480B2 (en) | 2012-07-19 | 2015-04-21 | Qualcomm Incorporated | Identifying a maximally stable extremal region (MSER) in an image by skipping comparison of pixels in the region |
US9141874B2 (en) | 2012-07-19 | 2015-09-22 | Qualcomm Incorporated | Feature extraction and use with a probability density function (PDF) divergence metric |
US8787702B1 (en) * | 2012-11-30 | 2014-07-22 | Accusoft Corporation | Methods and apparatus for determining and/or modifying image orientation |
US9256798B2 (en) * | 2013-01-31 | 2016-02-09 | Aurasma Limited | Document alteration based on native text analysis and OCR |
GB2514410A (en) | 2013-05-24 | 2014-11-26 | Ibm | Image scaling for images including low resolution text |
KR102159389B1 (ko) | 2014-03-17 | 2020-09-24 | 삼성디스플레이 주식회사 | 디지털 비디오 데이터를 보정하기 위한 보정 데이터 산출방법과 이를 이용하여 생성한 룩-업 테이블을 포함하는 유기전계발광 표시장치 |
US9536161B1 (en) | 2014-06-17 | 2017-01-03 | Amazon Technologies, Inc. | Visual and audio recognition for scene change events |
JP2016143310A (ja) * | 2015-02-04 | 2016-08-08 | ソニー株式会社 | 情報処理装置、画像処理方法及びプログラム |
CN106156766B (zh) | 2015-03-25 | 2020-02-18 | 阿里巴巴集团控股有限公司 | 文本行分类器的生成方法及装置 |
US9916492B1 (en) * | 2017-03-21 | 2018-03-13 | SkySlope, Inc. | Image processing and analysis for UID overlap avoidance |
RU2657181C1 (ru) * | 2017-09-01 | 2018-06-08 | Общество с ограниченной ответственностью "Аби Продакшн" | Способ улучшения качества распознавания отдельного кадра |
CN110533049B (zh) * | 2018-05-23 | 2023-05-02 | 富士通株式会社 | 提取印章图像的方法和装置 |
US11205084B2 (en) * | 2020-02-17 | 2021-12-21 | Wipro Limited | Method and system for evaluating an image quality for optical character recognition (OCR) |
US11386687B2 (en) | 2020-03-30 | 2022-07-12 | Wipro Limited | System and method for reconstructing an image |
CN111507352B (zh) * | 2020-04-16 | 2021-09-28 | 腾讯科技(深圳)有限公司 | 一种图像处理方法、装置、计算机设备以及存储介质 |
CN111753832B (zh) * | 2020-07-02 | 2023-12-08 | 杭州睿琪软件有限公司 | 图像处理方法、图像处理装置、电子设备和存储介质 |
US11544828B2 (en) | 2020-11-18 | 2023-01-03 | Disney Enterprises, Inc. | Automatic occlusion detection |
US11494944B2 (en) | 2020-11-18 | 2022-11-08 | Disney Enterprises, Inc. | Automatic low contrast detection |
CN112906686A (zh) | 2021-03-11 | 2021-06-04 | 北京小米移动软件有限公司 | 文字识别方法、装置、电子设备及存储介质 |
JP7137170B1 (ja) * | 2021-03-22 | 2022-09-14 | 楽天グループ株式会社 | 情報処理装置、情報処理方法およびプログラム |
CN113793403B (zh) * | 2021-08-19 | 2023-09-22 | 西南科技大学 | 一种模拟绘画过程的文本合成图像方法 |
CN114241256A (zh) * | 2021-11-30 | 2022-03-25 | 郑州信大先进技术研究院 | 基于神经网络的训练样本图像增强方法及系统 |
US11749006B2 (en) * | 2021-12-15 | 2023-09-05 | Intuit Inc. | Optical character recognition quality evaluation and optimization |
CN116071763B (zh) * | 2023-03-06 | 2023-06-16 | 山东薪火书业有限公司 | 基于文字识别的教辅图书智能校编系统 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR970002420B1 (ko) * | 1993-04-19 | 1997-03-05 | 가또 마사오 | 원고라인, 워드 및 문자셀 공간특징 자동판정방법 및 장치 |
US6301386B1 (en) * | 1998-12-09 | 2001-10-09 | Ncr Corporation | Methods and apparatus for gray image based text identification |
KR100480024B1 (ko) * | 1997-12-31 | 2005-08-01 | 엘지전자 주식회사 | 획의두께정보를이용한모음인식방법 |
CN1744657A (zh) * | 2004-08-25 | 2006-03-08 | 株式会社理光 | 多分辨率分割和填充 |
JP2008187327A (ja) * | 2007-01-29 | 2008-08-14 | Sharp Corp | 画像処理装置およびこれを備えた画像形成装置 |
US20100054585A1 (en) * | 2008-09-03 | 2010-03-04 | Jean-Pierre Guillou | Text localization for image and video OCR |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0256688A (ja) * | 1988-08-23 | 1990-02-26 | Toyota Central Res & Dev Lab Inc | 文字切出し装置 |
US5513304A (en) | 1993-04-19 | 1996-04-30 | Xerox Corporation | Method and apparatus for enhanced automatic determination of text line dependent parameters |
US5915039A (en) | 1996-11-12 | 1999-06-22 | International Business Machines Corporation | Method and means for extracting fixed-pitch characters on noisy images with complex background prior to character recognition |
JP2002157552A (ja) * | 2000-11-22 | 2002-05-31 | Oki Electric Ind Co Ltd | 光学式文字読取装置 |
JP2003203205A (ja) * | 2002-01-08 | 2003-07-18 | Ricoh Co Ltd | 文字認識装置、文字認識方法、およびその方法をコンピュータに実行させるプログラム、並びにそのプログラムを記録したコンピュータ読み取り可能な記録媒体 |
US20030198386A1 (en) | 2002-04-19 | 2003-10-23 | Huitao Luo | System and method for identifying and extracting character strings from captured image data |
JP4118749B2 (ja) * | 2002-09-05 | 2008-07-16 | 株式会社リコー | 画像処理装置、画像処理プログラムおよび記憶媒体 |
JP4350778B2 (ja) * | 2002-09-05 | 2009-10-21 | 株式会社リコー | 画像処理装置、画像処理プログラムおよび記録媒体 |
JP2004199622A (ja) * | 2002-12-20 | 2004-07-15 | Ricoh Co Ltd | 画像処理装置、画像処理方法、記録媒体およびプログラム |
US7236632B2 (en) | 2003-04-11 | 2007-06-26 | Ricoh Company, Ltd. | Automated techniques for comparing contents of images |
JP4259950B2 (ja) * | 2003-08-08 | 2009-04-30 | 株式会社リコー | 画像認識装置、画像認識プログラムおよび記録媒体 |
TWI248754B (en) | 2004-11-08 | 2006-02-01 | Avision Inc | Image acquiring device with background filtering function |
US7953295B2 (en) | 2006-06-29 | 2011-05-31 | Google Inc. | Enhancing text in images |
US8223395B2 (en) * | 2007-07-20 | 2012-07-17 | Sharp Laboratories Of America, Inc. | Methods and systems for refining text color in a digital image |
-
2010
- 2010-03-10 US US12/720,732 patent/US8526732B2/en active Active
-
2011
- 2011-03-07 KR KR1020127023496A patent/KR101795823B1/ko active IP Right Grant
- 2011-03-07 CA CA2790402A patent/CA2790402A1/en not_active Abandoned
- 2011-03-07 EP EP11753880.1A patent/EP2545499B1/en active Active
- 2011-03-07 WO PCT/US2011/027439 patent/WO2011112522A2/en active Application Filing
- 2011-03-07 ES ES11753880T patent/ES2773719T3/es active Active
- 2011-03-07 JP JP2012557155A patent/JP5754065B2/ja active Active
- 2011-03-07 CN CN201180013195.8A patent/CN102782706B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR970002420B1 (ko) * | 1993-04-19 | 1997-03-05 | 가또 마사오 | 원고라인, 워드 및 문자셀 공간특징 자동판정방법 및 장치 |
KR100480024B1 (ko) * | 1997-12-31 | 2005-08-01 | 엘지전자 주식회사 | 획의두께정보를이용한모음인식방법 |
US6301386B1 (en) * | 1998-12-09 | 2001-10-09 | Ncr Corporation | Methods and apparatus for gray image based text identification |
CN1744657A (zh) * | 2004-08-25 | 2006-03-08 | 株式会社理光 | 多分辨率分割和填充 |
JP2008187327A (ja) * | 2007-01-29 | 2008-08-14 | Sharp Corp | 画像処理装置およびこれを備えた画像形成装置 |
US20100054585A1 (en) * | 2008-09-03 | 2010-03-04 | Jean-Pierre Guillou | Text localization for image and video OCR |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105723432A (zh) * | 2013-11-12 | 2016-06-29 | 三菱电机株式会社 | 驾驶辅助图像生成装置、驾驶辅助图像显示装置、驾驶辅助图像显示系统及驾驶辅助图像生成程序 |
WO2016086877A1 (zh) * | 2014-12-03 | 2016-06-09 | 夏普株式会社 | 一种文本检测的方法和装置 |
WO2017054637A1 (zh) * | 2015-09-28 | 2017-04-06 | 珠海赛纳打印科技股份有限公司 | 图像处理方法及系统 |
US10497080B2 (en) | 2015-09-28 | 2019-12-03 | Zhuhai Seine Technology Co., Ltd. | Method and apparatus for image processing |
CN111986095A (zh) * | 2019-05-22 | 2020-11-24 | 上海哔哩哔哩科技有限公司 | 基于边缘提取的图像处理方法及图像处理装置 |
CN111986095B (zh) * | 2019-05-22 | 2024-03-19 | 上海哔哩哔哩科技有限公司 | 基于边缘提取的图像处理方法及图像处理装置 |
CN111080554A (zh) * | 2019-12-20 | 2020-04-28 | 成都极米科技股份有限公司 | 一种投影内容中字幕区域增强方法、装置及可读存储介质 |
CN111080554B (zh) * | 2019-12-20 | 2023-08-04 | 成都极米科技股份有限公司 | 一种投影内容中字幕区域增强方法、装置及可读存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN102782706B (zh) | 2014-07-23 |
KR101795823B1 (ko) | 2017-11-08 |
CA2790402A1 (en) | 2011-09-15 |
EP2545499A2 (en) | 2013-01-16 |
JP5754065B2 (ja) | 2015-07-22 |
US8526732B2 (en) | 2013-09-03 |
US20110222768A1 (en) | 2011-09-15 |
WO2011112522A2 (en) | 2011-09-15 |
ES2773719T3 (es) | 2020-07-14 |
EP2545499B1 (en) | 2020-01-08 |
JP2013527513A (ja) | 2013-06-27 |
KR20130016213A (ko) | 2013-02-14 |
EP2545499A4 (en) | 2017-08-30 |
WO2011112522A3 (en) | 2011-11-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102782706B (zh) | 经历光学字符识别的文本图像的文本增强 | |
CN101689300B (zh) | 图像分割和增强 | |
EP3309703A1 (en) | Method and system for decoding qr code based on weighted average grey method | |
CN107784669A (zh) | 一种光斑提取及其质心确定的方法 | |
US20070253040A1 (en) | Color scanning to enhance bitonal image | |
CN110210440B (zh) | 一种表格图像版面分析方法及系统 | |
CN105374015A (zh) | 基于局部对比度和笔画宽度估计的低质量文档图像二值化方法 | |
CN109961416B (zh) | 一种基于形态学梯度多尺度融合的营业执照信息提取方法 | |
US20060210164A1 (en) | Image processing device | |
JP4852059B2 (ja) | 文書画像の2値化性能を改善するノイズ除去装置及びノイズ除去プログラム | |
CN114529715B (zh) | 一种基于边缘提取的图像识别方法及系统 | |
CN109741273A (zh) | 一种手机拍照低质图像的自动处理与评分方法 | |
Saini | Document image binarization techniques, developments and related issues: a review | |
CN108205678B (zh) | 一种含有亮斑干扰的铭牌文字识别处理方法 | |
Nomura et al. | A new method for degraded color image binarization based on adaptive lightning on grayscale versions | |
CN111445402A (zh) | 一种图像去噪方法及装置 | |
CN105721738A (zh) | 一种彩色扫描文档图像预处理方法 | |
CN113012079B (zh) | 低亮度车底图像增强方法、装置及存储介质 | |
CN110298799B (zh) | 一种pcb图像定位校正方法 | |
Cooksey et al. | Rapid image binarization with morphological operators | |
Zhang et al. | Using Gaussian Kernels to Remove Uneven Shading from a Document Image | |
CN114565633B (zh) | 基于概念结构元素和矩阵范数的彩色图像边缘提取方法 | |
Chaudhari et al. | Document image binarization using threshold segmentation | |
Das et al. | Adaptive method for multi colored text binarization | |
Castro et al. | Restoration of double-sided ancient music documents with bleed-through |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MICROSOFT TECHNOLOGY LICENSING LLC Free format text: FORMER OWNER: MICROSOFT CORP. Effective date: 20150506 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20150506 Address after: Washington State Patentee after: Micro soft technique license Co., Ltd Address before: Washington State Patentee before: Microsoft Corp. |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20160809 Address after: Grand Cayman, Georgetown, Cayman Islands Patentee after: IValley Holding Co., Ltd. Address before: Washington State Patentee before: Micro soft technique license Co., Ltd |