CN103946866B - 与直方图一起使用多层连接分量的文本检测 - Google Patents
与直方图一起使用多层连接分量的文本检测 Download PDFInfo
- Publication number
- CN103946866B CN103946866B CN201280056944.XA CN201280056944A CN103946866B CN 103946866 B CN103946866 B CN 103946866B CN 201280056944 A CN201280056944 A CN 201280056944A CN 103946866 B CN103946866 B CN 103946866B
- Authority
- CN
- China
- Prior art keywords
- histogram
- ratio set
- connection component
- spatial bins
- bins
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/62—Text, e.g. of license plates, overlay texts or captions on TV images
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/40—Image enhancement or restoration using histogram techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/26—Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/18—Extraction of features or characteristics of the image
- G06V30/1801—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
- G06V30/18076—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections by analysing connectivity, e.g. edge linking, connected component analysis or slices
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Character Input (AREA)
- Image Analysis (AREA)
- Facsimile Image Signal Circuits (AREA)
- Image Processing (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/301,103 | 2011-11-21 | ||
| US13/301,103 US8611662B2 (en) | 2011-11-21 | 2011-11-21 | Text detection using multi-layer connected components with histograms |
| PCT/FI2012/050994 WO2013076358A1 (en) | 2011-11-21 | 2012-10-17 | Text detection using multi-layer connected components with histograms |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN103946866A CN103946866A (zh) | 2014-07-23 |
| CN103946866B true CN103946866B (zh) | 2018-06-01 |
Family
ID=48427024
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201280056944.XA Expired - Fee Related CN103946866B (zh) | 2011-11-21 | 2012-10-17 | 与直方图一起使用多层连接分量的文本检测 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US8611662B2 (enExample) |
| EP (1) | EP2783328B1 (enExample) |
| JP (1) | JP5775225B2 (enExample) |
| KR (1) | KR101617681B1 (enExample) |
| CN (1) | CN103946866B (enExample) |
| IN (1) | IN2014CN04624A (enExample) |
| WO (1) | WO2013076358A1 (enExample) |
Families Citing this family (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8731296B2 (en) * | 2011-04-21 | 2014-05-20 | Seiko Epson Corporation | Contact text detection in scanned images |
| US9064191B2 (en) | 2012-01-26 | 2015-06-23 | Qualcomm Incorporated | Lower modifier detection and extraction from devanagari text images to improve OCR performance |
| US9053361B2 (en) | 2012-01-26 | 2015-06-09 | Qualcomm Incorporated | Identifying regions of text to merge in a natural image or video frame |
| US9076242B2 (en) | 2012-07-19 | 2015-07-07 | Qualcomm Incorporated | Automatic correction of skew in natural images and video |
| US9183458B2 (en) * | 2012-07-19 | 2015-11-10 | Qualcomm Incorporated | Parameter selection and coarse localization of interest regions for MSER processing |
| US9262699B2 (en) | 2012-07-19 | 2016-02-16 | Qualcomm Incorporated | Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR |
| US9141874B2 (en) | 2012-07-19 | 2015-09-22 | Qualcomm Incorporated | Feature extraction and use with a probability density function (PDF) divergence metric |
| US9047540B2 (en) | 2012-07-19 | 2015-06-02 | Qualcomm Incorporated | Trellis based word decoder with reverse pass |
| US9047528B1 (en) * | 2013-02-19 | 2015-06-02 | Amazon Technologies, Inc. | Identifying characters in grid-based text |
| US9928572B1 (en) | 2013-12-20 | 2018-03-27 | Amazon Technologies, Inc. | Label orientation |
| US9460357B2 (en) * | 2014-01-08 | 2016-10-04 | Qualcomm Incorporated | Processing text images with shadows |
| US9858304B2 (en) * | 2014-04-15 | 2018-01-02 | Raytheon Company | Computing cross-correlations for sparse data |
| US9183636B1 (en) * | 2014-04-16 | 2015-11-10 | I.R.I.S. | Line segmentation method |
| CN104182750B (zh) * | 2014-07-14 | 2017-08-01 | 上海交通大学 | 一种在自然场景图像中基于极值连通域的中文检测方法 |
| WO2016014020A1 (en) | 2014-07-21 | 2016-01-28 | Hewlett-Packard Development Company, L.P. | Radial histogram matching |
| US9235757B1 (en) * | 2014-07-24 | 2016-01-12 | Amazon Technologies, Inc. | Fast text detection |
| CN104751147A (zh) * | 2015-04-16 | 2015-07-01 | 成都汇智远景科技有限公司 | 一种图像识别方法 |
| CN104766095A (zh) * | 2015-04-16 | 2015-07-08 | 成都汇智远景科技有限公司 | 一种移动终端图像识别方法 |
| US9471990B1 (en) * | 2015-10-20 | 2016-10-18 | Interra Systems, Inc. | Systems and methods for detection of burnt-in text in a video |
| US10083353B2 (en) * | 2016-10-28 | 2018-09-25 | Intuit Inc. | Identifying document forms using digital fingerprints |
| CN107688806B (zh) * | 2017-08-21 | 2021-04-20 | 西北工业大学 | 一种基于仿射变换的自由场景文本检测方法 |
| CN108985288B (zh) * | 2018-07-17 | 2022-06-14 | 电子科技大学 | 一种基于TGMSERs的SAR图像溢油检测方法 |
| CN110008950A (zh) * | 2019-03-13 | 2019-07-12 | 南京大学 | 一种对形状鲁棒的自然场景中文本检测的方法 |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS6413687A (en) * | 1987-07-07 | 1989-01-18 | Nec Corp | System for detecting character row |
| US5920655A (en) | 1995-02-10 | 1999-07-06 | Canon Kabushiki Kaisha | Binarization image processing for multi-level image data |
| JP3868654B2 (ja) * | 1998-03-27 | 2007-01-17 | 株式会社リコー | 画像処理装置 |
| JP4418726B2 (ja) * | 2004-10-01 | 2010-02-24 | 日本電信電話株式会社 | 文字列探索装置、探索方法およびこの方法のプログラム |
| US7570816B2 (en) | 2005-03-31 | 2009-08-04 | Microsoft Corporation | Systems and methods for detecting text |
| CN100565559C (zh) * | 2007-03-14 | 2009-12-02 | 中国科学院自动化研究所 | 基于连通分量和支持向量机的图像文本定位方法和装置 |
| CN101615252B (zh) * | 2008-06-25 | 2012-07-04 | 中国科学院自动化研究所 | 一种自适应图像文本信息提取方法 |
| US8189917B2 (en) | 2008-09-25 | 2012-05-29 | Sharp Laboratories Of America, Inc. | Methods and systems for locating text in a digital image |
| CN102163284B (zh) * | 2011-04-11 | 2013-02-27 | 西安电子科技大学 | 面向中文环境的复杂场景文本定位方法 |
-
2011
- 2011-11-21 US US13/301,103 patent/US8611662B2/en active Active
-
2012
- 2012-10-17 EP EP12851984.0A patent/EP2783328B1/en not_active Not-in-force
- 2012-10-17 IN IN4624CHN2014 patent/IN2014CN04624A/en unknown
- 2012-10-17 WO PCT/FI2012/050994 patent/WO2013076358A1/en not_active Ceased
- 2012-10-17 JP JP2014537674A patent/JP5775225B2/ja not_active Expired - Fee Related
- 2012-10-17 CN CN201280056944.XA patent/CN103946866B/zh not_active Expired - Fee Related
- 2012-10-17 KR KR1020147016856A patent/KR101617681B1/ko not_active Expired - Fee Related
Non-Patent Citations (1)
| Title |
|---|
| "TextFinder: An Automatic System to Detect and Recognize Text In Images";Victor Wu 等;《IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE》;19991130;第21卷(第11期);1-6 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN103946866A (zh) | 2014-07-23 |
| US20130129216A1 (en) | 2013-05-23 |
| EP2783328B1 (en) | 2018-08-22 |
| WO2013076358A1 (en) | 2013-05-30 |
| KR20140091762A (ko) | 2014-07-22 |
| JP2014531097A (ja) | 2014-11-20 |
| KR101617681B1 (ko) | 2016-05-11 |
| EP2783328A1 (en) | 2014-10-01 |
| EP2783328A4 (en) | 2016-09-28 |
| US8611662B2 (en) | 2013-12-17 |
| JP5775225B2 (ja) | 2015-09-09 |
| IN2014CN04624A (enExample) | 2015-09-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN103946866B (zh) | 与直方图一起使用多层连接分量的文本检测 | |
| CN110348294B (zh) | Pdf文档中图表的定位方法、装置及计算机设备 | |
| CN104871180B (zh) | 用于ocr的基于文本图像质量的反馈 | |
| US9076056B2 (en) | Text detection in natural images | |
| CN110717497B (zh) | 图像相似度匹配方法、装置及计算机可读存储介质 | |
| Tabassum et al. | Text detection using MSER and stroke width transform | |
| US20200302135A1 (en) | Method and apparatus for localization of one-dimensional barcodes | |
| Shehu et al. | Character recognition using correlation & hamming distance | |
| KR101732359B1 (ko) | 이미지 내의 텍스트를 검출하는 방법 및 장치 | |
| US10496894B2 (en) | System and method for text localization in images | |
| Shetty et al. | Ote-OCR based text recognition and extraction from video frames | |
| Dave et al. | OCR text detector and audio convertor | |
| Vasilopoulos et al. | Unified layout analysis and text localization framework | |
| Wang et al. | Multiorientation scene text detection via coarse-to-fine supervision-based convolutional networks | |
| Valiente et al. | A process for text recognition of generic identification documents over cloud computing | |
| CN113743413B (zh) | 一种结合图像语义信息的视觉slam方法及系统 | |
| Selokar et al. | Automatic number plate recognition system using a fast stroke-based method | |
| Liu | Digits Recognition on Medical Device | |
| Nor et al. | Image segmentation and text extraction: application to the extraction of textual information in scene images | |
| Samuel et al. | Automatic Text Segmentation and Recognition in Natural Scene Images Using Msocr | |
| Shekar et al. | Text localization in video/scene images using Kirsch Directional Masks | |
| Soumya et al. | Text extraction from images: a survey | |
| Shabana et al. | Text detection and recognition in natural images | |
| Jambekar | A Review of Optical Character Recognition System for Recognition of Printed Text | |
| CN120496076A (zh) | 基于ocr技术的端子3d环形字符识别方法、装置、设备及介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C41 | Transfer of patent application or patent right or utility model | ||
| TA01 | Transfer of patent application right |
Effective date of registration: 20160201 Address after: Espoo, Finland Applicant after: Technology Co., Ltd. of Nokia Address before: Espoo, Finland Applicant before: Nokia Oyj |
|
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20180601 Termination date: 20201017 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |