IN2014CN04624A - - Google Patents

Download PDF

Info

Publication number
IN2014CN04624A
IN2014CN04624A IN4624CHN2014A IN2014CN04624A IN 2014CN04624 A IN2014CN04624 A IN 2014CN04624A IN 4624CHN2014 A IN4624CHN2014 A IN 4624CHN2014A IN 2014CN04624 A IN2014CN04624 A IN 2014CN04624A
Authority
IN
India
Prior art keywords
bins
connected components
scale sets
scale
spatial
Prior art date
Application number
Inventor
Shang Hsuan Tsai
Vasudev Parameswaran
Radek Grzeszczuk
Original Assignee
Nokia Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Corp filed Critical Nokia Corp
Publication of IN2014CN04624A publication Critical patent/IN2014CN04624A/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/40Image enhancement or restoration using histogram techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/18Extraction of features or characteristics of the image
    • G06V30/1801Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
    • G06V30/18076Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections by analysing connectivity, e.g. edge linking, connected component analysis or slices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)
  • Character Input (AREA)
  • Image Processing (AREA)
  • Facsimile Image Signal Circuits (AREA)

Abstract

A digital image is converted to a multiple level image and multiple scale sets are formed from connected components of the multiple level image such that different ones of the scale sets define different size spatial bins. For each of the multiple scale sets there is generated a count of connected components extracted from the respective scale set for each spatial bin; and adjacent spatial bins which represent connected components are linked. Then the connected components from the different scale sets are merged and text line detection is performed on the merged connected components. In one embodiment each of the scale sets is a histogram and prior to linking all bins with less than a predetermined count are filtered out; and each histogram is extended such that counts of adjacent horizontal and vertical bins are added (single region bins are filtered out) and the linking is on the extended histograms.
IN4624CHN2014 2011-11-21 2012-10-17 IN2014CN04624A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13/301,103 US8611662B2 (en) 2011-11-21 2011-11-21 Text detection using multi-layer connected components with histograms
PCT/FI2012/050994 WO2013076358A1 (en) 2011-11-21 2012-10-17 Text detection using multi-layer connected components with histograms

Publications (1)

Publication Number Publication Date
IN2014CN04624A true IN2014CN04624A (en) 2015-09-18

Family

ID=48427024

Family Applications (1)

Application Number Title Priority Date Filing Date
IN4624CHN2014 IN2014CN04624A (en) 2011-11-21 2012-10-17

Country Status (7)

Country Link
US (1) US8611662B2 (en)
EP (1) EP2783328B1 (en)
JP (1) JP5775225B2 (en)
KR (1) KR101617681B1 (en)
CN (1) CN103946866B (en)
IN (1) IN2014CN04624A (en)
WO (1) WO2013076358A1 (en)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8731296B2 (en) * 2011-04-21 2014-05-20 Seiko Epson Corporation Contact text detection in scanned images
US9053361B2 (en) 2012-01-26 2015-06-09 Qualcomm Incorporated Identifying regions of text to merge in a natural image or video frame
US9064191B2 (en) 2012-01-26 2015-06-23 Qualcomm Incorporated Lower modifier detection and extraction from devanagari text images to improve OCR performance
US9141874B2 (en) 2012-07-19 2015-09-22 Qualcomm Incorporated Feature extraction and use with a probability density function (PDF) divergence metric
US9076242B2 (en) 2012-07-19 2015-07-07 Qualcomm Incorporated Automatic correction of skew in natural images and video
US9047540B2 (en) 2012-07-19 2015-06-02 Qualcomm Incorporated Trellis based word decoder with reverse pass
US9014480B2 (en) 2012-07-19 2015-04-21 Qualcomm Incorporated Identifying a maximally stable extremal region (MSER) in an image by skipping comparison of pixels in the region
US9262699B2 (en) 2012-07-19 2016-02-16 Qualcomm Incorporated Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR
US9047528B1 (en) * 2013-02-19 2015-06-02 Amazon Technologies, Inc. Identifying characters in grid-based text
US9928572B1 (en) 2013-12-20 2018-03-27 Amazon Technologies, Inc. Label orientation
US9460357B2 (en) * 2014-01-08 2016-10-04 Qualcomm Incorporated Processing text images with shadows
US9858304B2 (en) * 2014-04-15 2018-01-02 Raytheon Company Computing cross-correlations for sparse data
US9183636B1 (en) * 2014-04-16 2015-11-10 I.R.I.S. Line segmentation method
CN104182750B (en) * 2014-07-14 2017-08-01 上海交通大学 A kind of Chinese detection method based on extreme value connected domain in natural scene image
WO2016014020A1 (en) 2014-07-21 2016-01-28 Hewlett-Packard Development Company, L.P. Radial histogram matching
US9235757B1 (en) * 2014-07-24 2016-01-12 Amazon Technologies, Inc. Fast text detection
CN104766095A (en) * 2015-04-16 2015-07-08 成都汇智远景科技有限公司 Mobile terminal image identification method
CN104751147A (en) * 2015-04-16 2015-07-01 成都汇智远景科技有限公司 Image recognition method
US9471990B1 (en) * 2015-10-20 2016-10-18 Interra Systems, Inc. Systems and methods for detection of burnt-in text in a video
US10083353B2 (en) * 2016-10-28 2018-09-25 Intuit Inc. Identifying document forms using digital fingerprints
CN107688806B (en) * 2017-08-21 2021-04-20 西北工业大学 Affine transformation-based free scene text detection method
CN108985288B (en) * 2018-07-17 2022-06-14 电子科技大学 TGMSERs-based SAR image oil spill detection method
CN110008950A (en) * 2019-03-13 2019-07-12 南京大学 The method of text detection in the natural scene of a kind of pair of shape robust

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6413687A (en) * 1987-07-07 1989-01-18 Nec Corp System for detecting character row
US5920655A (en) 1995-02-10 1999-07-06 Canon Kabushiki Kaisha Binarization image processing for multi-level image data
JP3868654B2 (en) * 1998-03-27 2007-01-17 株式会社リコー Image processing device
JP4418726B2 (en) * 2004-10-01 2010-02-24 日本電信電話株式会社 Character string search device, search method, and program for this method
US7570816B2 (en) 2005-03-31 2009-08-04 Microsoft Corporation Systems and methods for detecting text
CN100565559C (en) * 2007-03-14 2009-12-02 中国科学院自动化研究所 Image text location method and device based on connected component and support vector machine
CN101615252B (en) * 2008-06-25 2012-07-04 中国科学院自动化研究所 Method for extracting text information from adaptive images
US8189917B2 (en) 2008-09-25 2012-05-29 Sharp Laboratories Of America, Inc. Methods and systems for locating text in a digital image
CN102163284B (en) * 2011-04-11 2013-02-27 西安电子科技大学 Chinese environment-oriented complex scene text positioning method

Also Published As

Publication number Publication date
JP2014531097A (en) 2014-11-20
EP2783328B1 (en) 2018-08-22
WO2013076358A1 (en) 2013-05-30
KR20140091762A (en) 2014-07-22
EP2783328A1 (en) 2014-10-01
EP2783328A4 (en) 2016-09-28
CN103946866B (en) 2018-06-01
CN103946866A (en) 2014-07-23
KR101617681B1 (en) 2016-05-11
US8611662B2 (en) 2013-12-17
JP5775225B2 (en) 2015-09-09
US20130129216A1 (en) 2013-05-23

Similar Documents

Publication Publication Date Title
IN2014CN04624A (en)
Aad et al. Search for long-lived, heavy particles in final states with a muon and multi-track displaced vertex in proton–proton collisions at s= 7TeV with the ATLAS detector
MX343875B (en) Method and system for determining image similarity.
GB2544237A (en) Method and system for identifying relevant media content
BR112015014945A2 (en) remote phototestimography monitoring system, remote photoplestimography monitoring method, and computer program
TWI563400B (en) Method, computer program product and system for extracting semantic relationships from table structures in electronic documents
MX2011013468A (en) Disambiguating pointers by imaging multiple touch-input zones.
GB201312213D0 (en) Compact and robust signature for large scale visual search,retrieval and classification
UA115570C2 (en) Method and system for processing ore-containing material
GB2541608A (en) Selection of thumbnails for video segments
EP2600309A3 (en) Foreground subject detection
EP2385441A3 (en) Mobile terminal and image display method therein
MX2017012505A (en) Setting different background model sensitivities by user defined regions and background filters.
WO2012135220A3 (en) Real-time depth extraction using stereo correspondence
GB2550777A (en) Classification and storage of documents
GB201203858D0 (en) Automated processing of documents
MX2014014763A (en) Image processing method and apparatus.
GB2513815A (en) System, method, and interfaces for work product management
IN2013MU03662A (en)
MX2015009736A (en) Picture sorting method and apparatus.
MX355628B (en) System and method for counting zooplankton.
GB201216254D0 (en) Method, apparatus and manufacture for smiling face detection
WO2014201065A3 (en) User experience for capturing and reconciling items
MY179329A (en) Three-dimensional object detection device
JP2015204023A5 (en)