CN103946866B - 与直方图一起使用多层连接分量的文本检测 - Google Patents

与直方图一起使用多层连接分量的文本检测 Download PDF

Info

Publication number
CN103946866B
CN103946866B CN201280056944.XA CN201280056944A CN103946866B CN 103946866 B CN103946866 B CN 103946866B CN 201280056944 A CN201280056944 A CN 201280056944A CN 103946866 B CN103946866 B CN 103946866B
Authority
CN
China
Prior art keywords
histogram
ratio set
connection component
spatial bins
bins
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201280056944.XA
Other languages
English (en)
Chinese (zh)
Other versions
CN103946866A (zh
Inventor
S-H·蔡
V·帕拉梅斯瓦兰
R·格泽茨克祖克
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Technologies Oy
Original Assignee
Nokia Technologies Oy
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Technologies Oy filed Critical Nokia Technologies Oy
Publication of CN103946866A publication Critical patent/CN103946866A/zh
Application granted granted Critical
Publication of CN103946866B publication Critical patent/CN103946866B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/40Image enhancement or restoration using histogram techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/18Extraction of features or characteristics of the image
    • G06V30/1801Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
    • G06V30/18076Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections by analysing connectivity, e.g. edge linking, connected component analysis or slices
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Character Input (AREA)
  • Image Analysis (AREA)
  • Facsimile Image Signal Circuits (AREA)
  • Image Processing (AREA)
CN201280056944.XA 2011-11-21 2012-10-17 与直方图一起使用多层连接分量的文本检测 Expired - Fee Related CN103946866B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/301,103 2011-11-21
US13/301,103 US8611662B2 (en) 2011-11-21 2011-11-21 Text detection using multi-layer connected components with histograms
PCT/FI2012/050994 WO2013076358A1 (en) 2011-11-21 2012-10-17 Text detection using multi-layer connected components with histograms

Publications (2)

Publication Number Publication Date
CN103946866A CN103946866A (zh) 2014-07-23
CN103946866B true CN103946866B (zh) 2018-06-01

Family

ID=48427024

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280056944.XA Expired - Fee Related CN103946866B (zh) 2011-11-21 2012-10-17 与直方图一起使用多层连接分量的文本检测

Country Status (7)

Country Link
US (1) US8611662B2 (enExample)
EP (1) EP2783328B1 (enExample)
JP (1) JP5775225B2 (enExample)
KR (1) KR101617681B1 (enExample)
CN (1) CN103946866B (enExample)
IN (1) IN2014CN04624A (enExample)
WO (1) WO2013076358A1 (enExample)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8731296B2 (en) * 2011-04-21 2014-05-20 Seiko Epson Corporation Contact text detection in scanned images
US9064191B2 (en) 2012-01-26 2015-06-23 Qualcomm Incorporated Lower modifier detection and extraction from devanagari text images to improve OCR performance
US9053361B2 (en) 2012-01-26 2015-06-09 Qualcomm Incorporated Identifying regions of text to merge in a natural image or video frame
US9076242B2 (en) 2012-07-19 2015-07-07 Qualcomm Incorporated Automatic correction of skew in natural images and video
US9183458B2 (en) * 2012-07-19 2015-11-10 Qualcomm Incorporated Parameter selection and coarse localization of interest regions for MSER processing
US9262699B2 (en) 2012-07-19 2016-02-16 Qualcomm Incorporated Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR
US9141874B2 (en) 2012-07-19 2015-09-22 Qualcomm Incorporated Feature extraction and use with a probability density function (PDF) divergence metric
US9047540B2 (en) 2012-07-19 2015-06-02 Qualcomm Incorporated Trellis based word decoder with reverse pass
US9047528B1 (en) * 2013-02-19 2015-06-02 Amazon Technologies, Inc. Identifying characters in grid-based text
US9928572B1 (en) 2013-12-20 2018-03-27 Amazon Technologies, Inc. Label orientation
US9460357B2 (en) * 2014-01-08 2016-10-04 Qualcomm Incorporated Processing text images with shadows
US9858304B2 (en) * 2014-04-15 2018-01-02 Raytheon Company Computing cross-correlations for sparse data
US9183636B1 (en) * 2014-04-16 2015-11-10 I.R.I.S. Line segmentation method
CN104182750B (zh) * 2014-07-14 2017-08-01 上海交通大学 一种在自然场景图像中基于极值连通域的中文检测方法
WO2016014020A1 (en) 2014-07-21 2016-01-28 Hewlett-Packard Development Company, L.P. Radial histogram matching
US9235757B1 (en) * 2014-07-24 2016-01-12 Amazon Technologies, Inc. Fast text detection
CN104751147A (zh) * 2015-04-16 2015-07-01 成都汇智远景科技有限公司 一种图像识别方法
CN104766095A (zh) * 2015-04-16 2015-07-08 成都汇智远景科技有限公司 一种移动终端图像识别方法
US9471990B1 (en) * 2015-10-20 2016-10-18 Interra Systems, Inc. Systems and methods for detection of burnt-in text in a video
US10083353B2 (en) * 2016-10-28 2018-09-25 Intuit Inc. Identifying document forms using digital fingerprints
CN107688806B (zh) * 2017-08-21 2021-04-20 西北工业大学 一种基于仿射变换的自由场景文本检测方法
CN108985288B (zh) * 2018-07-17 2022-06-14 电子科技大学 一种基于TGMSERs的SAR图像溢油检测方法
CN110008950A (zh) * 2019-03-13 2019-07-12 南京大学 一种对形状鲁棒的自然场景中文本检测的方法

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6413687A (en) * 1987-07-07 1989-01-18 Nec Corp System for detecting character row
US5920655A (en) 1995-02-10 1999-07-06 Canon Kabushiki Kaisha Binarization image processing for multi-level image data
JP3868654B2 (ja) * 1998-03-27 2007-01-17 株式会社リコー 画像処理装置
JP4418726B2 (ja) * 2004-10-01 2010-02-24 日本電信電話株式会社 文字列探索装置、探索方法およびこの方法のプログラム
US7570816B2 (en) 2005-03-31 2009-08-04 Microsoft Corporation Systems and methods for detecting text
CN100565559C (zh) * 2007-03-14 2009-12-02 中国科学院自动化研究所 基于连通分量和支持向量机的图像文本定位方法和装置
CN101615252B (zh) * 2008-06-25 2012-07-04 中国科学院自动化研究所 一种自适应图像文本信息提取方法
US8189917B2 (en) 2008-09-25 2012-05-29 Sharp Laboratories Of America, Inc. Methods and systems for locating text in a digital image
CN102163284B (zh) * 2011-04-11 2013-02-27 西安电子科技大学 面向中文环境的复杂场景文本定位方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"TextFinder: An Automatic System to Detect and Recognize Text In Images";Victor Wu 等;《IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE》;19991130;第21卷(第11期);1-6 *

Also Published As

Publication number Publication date
CN103946866A (zh) 2014-07-23
US20130129216A1 (en) 2013-05-23
EP2783328B1 (en) 2018-08-22
WO2013076358A1 (en) 2013-05-30
KR20140091762A (ko) 2014-07-22
JP2014531097A (ja) 2014-11-20
KR101617681B1 (ko) 2016-05-11
EP2783328A1 (en) 2014-10-01
EP2783328A4 (en) 2016-09-28
US8611662B2 (en) 2013-12-17
JP5775225B2 (ja) 2015-09-09
IN2014CN04624A (enExample) 2015-09-18

Similar Documents

Publication Publication Date Title
CN103946866B (zh) 与直方图一起使用多层连接分量的文本检测
CN110348294B (zh) Pdf文档中图表的定位方法、装置及计算机设备
CN104871180B (zh) 用于ocr的基于文本图像质量的反馈
US9076056B2 (en) Text detection in natural images
CN110717497B (zh) 图像相似度匹配方法、装置及计算机可读存储介质
Tabassum et al. Text detection using MSER and stroke width transform
US20200302135A1 (en) Method and apparatus for localization of one-dimensional barcodes
Shehu et al. Character recognition using correlation & hamming distance
KR101732359B1 (ko) 이미지 내의 텍스트를 검출하는 방법 및 장치
US10496894B2 (en) System and method for text localization in images
Shetty et al. Ote-OCR based text recognition and extraction from video frames
Dave et al. OCR text detector and audio convertor
Vasilopoulos et al. Unified layout analysis and text localization framework
Wang et al. Multiorientation scene text detection via coarse-to-fine supervision-based convolutional networks
Valiente et al. A process for text recognition of generic identification documents over cloud computing
CN113743413B (zh) 一种结合图像语义信息的视觉slam方法及系统
Selokar et al. Automatic number plate recognition system using a fast stroke-based method
Liu Digits Recognition on Medical Device
Nor et al. Image segmentation and text extraction: application to the extraction of textual information in scene images
Samuel et al. Automatic Text Segmentation and Recognition in Natural Scene Images Using Msocr
Shekar et al. Text localization in video/scene images using Kirsch Directional Masks
Soumya et al. Text extraction from images: a survey
Shabana et al. Text detection and recognition in natural images
Jambekar A Review of Optical Character Recognition System for Recognition of Printed Text
CN120496076A (zh) 基于ocr技术的端子3d环形字符识别方法、装置、设备及介质

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20160201

Address after: Espoo, Finland

Applicant after: Technology Co., Ltd. of Nokia

Address before: Espoo, Finland

Applicant before: Nokia Oyj

GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180601

Termination date: 20201017

CF01 Termination of patent right due to non-payment of annual fee