JP5775225B2 - マルチレイヤ連結成分をヒストグラムと共に用いるテキスト検出 - Google Patents

マルチレイヤ連結成分をヒストグラムと共に用いるテキスト検出 Download PDF

Info

Publication number
JP5775225B2
JP5775225B2 JP2014537674A JP2014537674A JP5775225B2 JP 5775225 B2 JP5775225 B2 JP 5775225B2 JP 2014537674 A JP2014537674 A JP 2014537674A JP 2014537674 A JP2014537674 A JP 2014537674A JP 5775225 B2 JP5775225 B2 JP 5775225B2
Authority
JP
Japan
Prior art keywords
spatial
bins
histogram
bin
connected components
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2014537674A
Other languages
English (en)
Japanese (ja)
Other versions
JP2014531097A (ja
Inventor
シャン−シュアン ツァイ
シャン−シュアン ツァイ
ヴァスデーヴ パラメスワラン
ヴァスデーヴ パラメスワラン
ラデク グジェシュチャク
ラデク グジェシュチャク
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Oyj
Original Assignee
Nokia Oyj
Nokia Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Oyj, Nokia Inc filed Critical Nokia Oyj
Publication of JP2014531097A publication Critical patent/JP2014531097A/ja
Application granted granted Critical
Publication of JP5775225B2 publication Critical patent/JP5775225B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/40Image enhancement or restoration using histogram techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/18Extraction of features or characteristics of the image
    • G06V30/1801Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
    • G06V30/18076Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections by analysing connectivity, e.g. edge linking, connected component analysis or slices
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)
  • Character Input (AREA)
  • Facsimile Image Signal Circuits (AREA)
  • Image Processing (AREA)
JP2014537674A 2011-11-21 2012-10-17 マルチレイヤ連結成分をヒストグラムと共に用いるテキスト検出 Expired - Fee Related JP5775225B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/301,103 2011-11-21
US13/301,103 US8611662B2 (en) 2011-11-21 2011-11-21 Text detection using multi-layer connected components with histograms
PCT/FI2012/050994 WO2013076358A1 (en) 2011-11-21 2012-10-17 Text detection using multi-layer connected components with histograms

Publications (2)

Publication Number Publication Date
JP2014531097A JP2014531097A (ja) 2014-11-20
JP5775225B2 true JP5775225B2 (ja) 2015-09-09

Family

ID=48427024

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2014537674A Expired - Fee Related JP5775225B2 (ja) 2011-11-21 2012-10-17 マルチレイヤ連結成分をヒストグラムと共に用いるテキスト検出

Country Status (7)

Country Link
US (1) US8611662B2 (enExample)
EP (1) EP2783328B1 (enExample)
JP (1) JP5775225B2 (enExample)
KR (1) KR101617681B1 (enExample)
CN (1) CN103946866B (enExample)
IN (1) IN2014CN04624A (enExample)
WO (1) WO2013076358A1 (enExample)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8731296B2 (en) * 2011-04-21 2014-05-20 Seiko Epson Corporation Contact text detection in scanned images
US9064191B2 (en) 2012-01-26 2015-06-23 Qualcomm Incorporated Lower modifier detection and extraction from devanagari text images to improve OCR performance
US9053361B2 (en) 2012-01-26 2015-06-09 Qualcomm Incorporated Identifying regions of text to merge in a natural image or video frame
US9076242B2 (en) 2012-07-19 2015-07-07 Qualcomm Incorporated Automatic correction of skew in natural images and video
US9183458B2 (en) * 2012-07-19 2015-11-10 Qualcomm Incorporated Parameter selection and coarse localization of interest regions for MSER processing
US9262699B2 (en) 2012-07-19 2016-02-16 Qualcomm Incorporated Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR
US9141874B2 (en) 2012-07-19 2015-09-22 Qualcomm Incorporated Feature extraction and use with a probability density function (PDF) divergence metric
US9047540B2 (en) 2012-07-19 2015-06-02 Qualcomm Incorporated Trellis based word decoder with reverse pass
US9047528B1 (en) * 2013-02-19 2015-06-02 Amazon Technologies, Inc. Identifying characters in grid-based text
US9928572B1 (en) 2013-12-20 2018-03-27 Amazon Technologies, Inc. Label orientation
US9460357B2 (en) * 2014-01-08 2016-10-04 Qualcomm Incorporated Processing text images with shadows
US9858304B2 (en) * 2014-04-15 2018-01-02 Raytheon Company Computing cross-correlations for sparse data
US9183636B1 (en) * 2014-04-16 2015-11-10 I.R.I.S. Line segmentation method
CN104182750B (zh) * 2014-07-14 2017-08-01 上海交通大学 一种在自然场景图像中基于极值连通域的中文检测方法
WO2016014020A1 (en) 2014-07-21 2016-01-28 Hewlett-Packard Development Company, L.P. Radial histogram matching
US9235757B1 (en) * 2014-07-24 2016-01-12 Amazon Technologies, Inc. Fast text detection
CN104751147A (zh) * 2015-04-16 2015-07-01 成都汇智远景科技有限公司 一种图像识别方法
CN104766095A (zh) * 2015-04-16 2015-07-08 成都汇智远景科技有限公司 一种移动终端图像识别方法
US9471990B1 (en) * 2015-10-20 2016-10-18 Interra Systems, Inc. Systems and methods for detection of burnt-in text in a video
US10083353B2 (en) * 2016-10-28 2018-09-25 Intuit Inc. Identifying document forms using digital fingerprints
CN107688806B (zh) * 2017-08-21 2021-04-20 西北工业大学 一种基于仿射变换的自由场景文本检测方法
CN108985288B (zh) * 2018-07-17 2022-06-14 电子科技大学 一种基于TGMSERs的SAR图像溢油检测方法
CN110008950A (zh) * 2019-03-13 2019-07-12 南京大学 一种对形状鲁棒的自然场景中文本检测的方法

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6413687A (en) * 1987-07-07 1989-01-18 Nec Corp System for detecting character row
US5920655A (en) 1995-02-10 1999-07-06 Canon Kabushiki Kaisha Binarization image processing for multi-level image data
JP3868654B2 (ja) * 1998-03-27 2007-01-17 株式会社リコー 画像処理装置
JP4418726B2 (ja) * 2004-10-01 2010-02-24 日本電信電話株式会社 文字列探索装置、探索方法およびこの方法のプログラム
US7570816B2 (en) 2005-03-31 2009-08-04 Microsoft Corporation Systems and methods for detecting text
CN100565559C (zh) * 2007-03-14 2009-12-02 中国科学院自动化研究所 基于连通分量和支持向量机的图像文本定位方法和装置
CN101615252B (zh) * 2008-06-25 2012-07-04 中国科学院自动化研究所 一种自适应图像文本信息提取方法
US8189917B2 (en) 2008-09-25 2012-05-29 Sharp Laboratories Of America, Inc. Methods and systems for locating text in a digital image
CN102163284B (zh) * 2011-04-11 2013-02-27 西安电子科技大学 面向中文环境的复杂场景文本定位方法

Also Published As

Publication number Publication date
CN103946866A (zh) 2014-07-23
US20130129216A1 (en) 2013-05-23
EP2783328B1 (en) 2018-08-22
WO2013076358A1 (en) 2013-05-30
KR20140091762A (ko) 2014-07-22
JP2014531097A (ja) 2014-11-20
CN103946866B (zh) 2018-06-01
KR101617681B1 (ko) 2016-05-11
EP2783328A1 (en) 2014-10-01
EP2783328A4 (en) 2016-09-28
US8611662B2 (en) 2013-12-17
IN2014CN04624A (enExample) 2015-09-18

Similar Documents

Publication Publication Date Title
JP5775225B2 (ja) マルチレイヤ連結成分をヒストグラムと共に用いるテキスト検出
CN110414507B (zh) 车牌识别方法、装置、计算机设备和存储介质
CN104871180B (zh) 用于ocr的基于文本图像质量的反馈
US9053361B2 (en) Identifying regions of text to merge in a natural image or video frame
CN109918987B (zh) 一种视频字幕关键词识别方法及装置
CN109583345B (zh) 道路识别方法、装置、计算机装置及计算机可读存储介质
US9076056B2 (en) Text detection in natural images
US9171224B2 (en) Method of improving contrast for text extraction and recognition applications
WO2014092979A1 (en) Method of perspective correction for devanagari text
Tabassum et al. Text detection using MSER and stroke width transform
CN110852311A (zh) 一种三维人手关键点定位方法及装置
Yasmeen et al. Text detection and classification from low quality natural images
CN112686122A (zh) 人体及影子的检测方法、装置、电子设备、存储介质
KR101732359B1 (ko) 이미지 내의 텍스트를 검출하는 방법 및 장치
CN113228105A (zh) 一种图像处理方法、装置和电子设备
CN115270841A (zh) 条码检测方法、装置、存储介质及计算机设备
Vidhyalakshmi et al. Text detection in natural images with hybrid stroke feature transform and high performance deep Convnet computing
Wang et al. Multiorientation scene text detection via coarse-to-fine supervision-based convolutional networks
CN113743413B (zh) 一种结合图像语义信息的视觉slam方法及系统
Arai et al. Text extraction from TV commercial using blob extraction method
Liu Digits Recognition on Medical Device
Nor et al. Image segmentation and text extraction: application to the extraction of textual information in scene images
CN114170536B (zh) 一种针对目标部分遮挡的识别检测方法、装置及系统
Shabana et al. Text detection and recognition in natural images
Yang et al. A skeleton based binarization approach for video text recognition

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20140425

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20150304

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20150310

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20150529

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20150622

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20150702

R150 Certificate of patent or registration of utility model

Ref document number: 5775225

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

S111 Request for change of ownership or part of ownership

Free format text: JAPANESE INTERMEDIATE CODE: R313113

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees