EP2275974A3 - System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking - Google Patents

System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking Download PDF

Info

Publication number
EP2275974A3
EP2275974A3 EP10168842.2A EP10168842A EP2275974A3 EP 2275974 A3 EP2275974 A3 EP 2275974A3 EP 10168842 A EP10168842 A EP 10168842A EP 2275974 A3 EP2275974 A3 EP 2275974A3
Authority
EP
European Patent Office
Prior art keywords
marking
classifier
images
fragments
type
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP10168842.2A
Other languages
German (de)
English (en)
French (fr)
Other versions
EP2275974A2 (en
Inventor
Prateek Sarkar
Eric Saund
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Palo Alto Research Center Inc
Original Assignee
Palo Alto Research Center Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Palo Alto Research Center Inc filed Critical Palo Alto Research Center Inc
Publication of EP2275974A2 publication Critical patent/EP2275974A2/en
Publication of EP2275974A3 publication Critical patent/EP2275974A3/en
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/15Cutting or merging image elements, e.g. region growing, watershed or clustering-based techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/18Extraction of features or characteristics of the image
    • G06V30/1801Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
    • G06V30/18076Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections by analysing connectivity, e.g. edge linking, connected component analysis or slices
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/191Design or setup of recognition systems or techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06V30/19147Obtaining sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Image Analysis (AREA)
  • Character Input (AREA)
EP10168842.2A 2009-07-10 2010-07-08 System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking Ceased EP2275974A3 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US12/501,187 US8442319B2 (en) 2009-07-10 2009-07-10 System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking

Publications (2)

Publication Number Publication Date
EP2275974A2 EP2275974A2 (en) 2011-01-19
EP2275974A3 true EP2275974A3 (en) 2014-12-31

Family

ID=43034592

Family Applications (1)

Application Number Title Priority Date Filing Date
EP10168842.2A Ceased EP2275974A3 (en) 2009-07-10 2010-07-08 System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking

Country Status (3)

Country Link
US (1) US8442319B2 (enExample)
EP (1) EP2275974A3 (enExample)
JP (1) JP5379085B2 (enExample)

Families Citing this family (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8649600B2 (en) * 2009-07-10 2014-02-11 Palo Alto Research Center Incorporated System and method for segmenting text lines in documents
US8452086B2 (en) * 2009-07-10 2013-05-28 Palo Alto Research Center Incorporated System and user interface for machine-assisted human labeling of pixels in an image
US8442319B2 (en) * 2009-07-10 2013-05-14 Palo Alto Research Center Incorporated System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking
US20120092374A1 (en) * 2010-10-19 2012-04-19 Apple Inc. Systems, methods, and computer-readable media for placing a representation of the captured signature in a document
CN102622724A (zh) * 2011-01-27 2012-08-01 鸿富锦精密工业(深圳)有限公司 外观专利图像切割方法及系统
DE102011082866A1 (de) * 2011-09-16 2013-03-21 Olaf Rudolph Verfahren zum Erkennen einer oder meherer gleichzeitig auftretender Teilenladungs-Quellen
US8792730B2 (en) * 2012-03-07 2014-07-29 Ricoh Co., Ltd. Classification and standardization of field images associated with a field in a form
US9152884B2 (en) * 2012-06-05 2015-10-06 Drvision Technologies Llc Teachable pattern scoring method
US8879855B2 (en) * 2012-08-17 2014-11-04 Nec Laboratories America, Inc. Image segmentation for large-scale fine-grained recognition
US9280206B2 (en) * 2012-08-20 2016-03-08 Samsung Electronics Co., Ltd. System and method for perceiving images with multimodal feedback
US9235781B2 (en) * 2013-08-09 2016-01-12 Kabushiki Kaisha Toshiba Method of, and apparatus for, landmark location
US9245205B1 (en) * 2013-10-16 2016-01-26 Xerox Corporation Supervised mid-level features for word image representation
US9940511B2 (en) * 2014-05-30 2018-04-10 Kofax, Inc. Machine print, hand print, and signature discrimination
US10120843B2 (en) 2014-08-26 2018-11-06 International Business Machines Corporation Generation of parsable data for deep parsing
US11100650B2 (en) * 2016-03-31 2021-08-24 Sony Depthsensing Solutions Sa/Nv Method for foreground and background determination in an image
US10607228B1 (en) * 2016-08-24 2020-03-31 Jpmorgan Chase Bank, N.A. Dynamic rule strategy and fraud detection system and method
US10354161B2 (en) * 2017-06-05 2019-07-16 Intuit, Inc. Detecting font size in a digital image
US10163022B1 (en) * 2017-06-22 2018-12-25 StradVision, Inc. Method for learning text recognition, method for recognizing text using the same, and apparatus for learning text recognition, apparatus for recognizing text using the same
US11416546B2 (en) * 2018-03-20 2022-08-16 Hulu, LLC Content type detection in videos using multiple classifiers
CN108960290A (zh) * 2018-06-08 2018-12-07 Oppo广东移动通信有限公司 图像处理方法、装置、计算机可读存储介质和电子设备
US10685261B2 (en) * 2018-06-11 2020-06-16 GM Global Technology Operations LLC Active segmention of scanned images based on deep reinforcement learning for OCR applications
JP7262993B2 (ja) * 2018-12-19 2023-04-24 キヤノン株式会社 画像処理システム、画像処理方法、画像処理装置
US11462037B2 (en) 2019-01-11 2022-10-04 Walmart Apollo, Llc System and method for automated analysis of electronic travel data
US10671892B1 (en) 2019-03-31 2020-06-02 Hyper Labs, Inc. Apparatuses, methods, and systems for 3-channel dynamic contextual script recognition using neural network image analytics and 4-tuple machine learning with enhanced templates and context data
US11106891B2 (en) 2019-09-09 2021-08-31 Morgan Stanley Services Group Inc. Automated signature extraction and verification
JP7431005B2 (ja) * 2019-09-20 2024-02-14 Toppanエッジ株式会社 学習データ生成装置、学習データ生成方法、及びプログラム
US11200411B2 (en) * 2019-10-16 2021-12-14 The Toronto-Dominion Bank Training a card type classifier with simulated card images
US12175337B2 (en) * 2020-08-04 2024-12-24 Bentley Systems, Incorporated Techniques for extracting machine-readable information from P and IDs
KR102509343B1 (ko) * 2020-11-17 2023-03-13 아주대학교산학협력단 이미지의 레이아웃 분석 방법 및 시스템
US11704352B2 (en) 2021-05-03 2023-07-18 Bank Of America Corporation Automated categorization and assembly of low-quality images into electronic documents
US11798258B2 (en) 2021-05-03 2023-10-24 Bank Of America Corporation Automated categorization and assembly of low-quality images into electronic documents
US11881041B2 (en) 2021-09-02 2024-01-23 Bank Of America Corporation Automated categorization and processing of document images of varying degrees of quality
US11409951B1 (en) 2021-09-24 2022-08-09 International Business Machines Corporation Facilitating annotation of document elements
CN113657559B (zh) * 2021-10-18 2022-02-08 广州天鹏计算机科技有限公司 基于机器学习的胸部扫描图像分类方法
US12367694B2 (en) * 2021-10-29 2025-07-22 Samsung Electronics Co., Ltd. Methods and systems for semantically segmenting a source text image based on a text area threshold determination
US20230162520A1 (en) * 2021-11-23 2023-05-25 Abbyy Development Inc. Identifying writing systems utilized in documents
US12348560B2 (en) * 2022-04-25 2025-07-01 Palo Alto Networks, Inc. Detecting phishing PDFs with an image-based deep learning approach
US12159021B1 (en) * 2022-06-30 2024-12-03 Amazon Technologies, Inc. Semantic detection and rendering in digital content
US12406519B1 (en) 2022-07-29 2025-09-02 Bentley Systems, Incorporated Techniques for extracting links and connectivity from schematic diagrams
US12288411B2 (en) 2022-10-06 2025-04-29 Bentley Systems, Incorporated Techniques for extracting associations between text labels and symbols and links in schematic diagrams

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5434953A (en) * 1992-03-20 1995-07-18 Xerox Corporation Use of fast textured reduction for discrimination of document image components
US6301386B1 (en) * 1998-12-09 2001-10-09 Ncr Corporation Methods and apparatus for gray image based text identification
EP1327955A2 (en) * 2002-01-11 2003-07-16 Hewlett-Packard Company Text extraction from a compound document

Family Cites Families (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5181255A (en) * 1990-12-13 1993-01-19 Xerox Corporation Segmentation of handwriting and machine printed text
US5402504A (en) * 1989-12-08 1995-03-28 Xerox Corporation Segmentation of text styles
US5202933A (en) * 1989-12-08 1993-04-13 Xerox Corporation Segmentation of text and graphics
US5201011A (en) * 1991-11-19 1993-04-06 Xerox Corporation Method and apparatus for image hand markup detection using morphological techniques
US5369714A (en) * 1991-11-19 1994-11-29 Xerox Corporation Method and apparatus for determining the frequency of phrases in a document without document image decoding
US6009196A (en) * 1995-11-28 1999-12-28 Xerox Corporation Method for classifying non-running text in an image
US5892842A (en) * 1995-12-14 1999-04-06 Xerox Corporation Automatic method of identifying sentence boundaries in a document image
US5956468A (en) * 1996-07-12 1999-09-21 Seiko Epson Corporation Document segmentation system
US5778092A (en) * 1996-12-20 1998-07-07 Xerox Corporation Method and apparatus for compressing color or gray scale documents
US6411733B1 (en) * 1998-11-25 2002-06-25 Xerox Corporation Method and apparatus for separating document image object types
US6377710B1 (en) * 1998-11-25 2002-04-23 Xerox Corporation Method and apparatus for extracting the skeleton of a binary figure by contour-based erosion
US6587583B1 (en) * 1999-09-17 2003-07-01 Kurzweil Educational Systems, Inc. Compression/decompression algorithm for image documents having text, graphical and color content
US6771816B1 (en) * 2000-01-19 2004-08-03 Adobe Systems Incorporated Generating a text mask for representing text pixels
SE0000205D0 (sv) * 2000-01-25 2000-01-25 Siemens Elema Ab Ventilator
US7136082B2 (en) * 2002-01-25 2006-11-14 Xerox Corporation Method and apparatus to convert digital ink images for use in a structured text/graphics editor
US7036077B2 (en) * 2002-03-22 2006-04-25 Xerox Corporation Method for gestural interpretation in a system for selecting and arranging visible material in document images
US7086013B2 (en) * 2002-03-22 2006-08-01 Xerox Corporation Method and system for overloading loop selection commands in a system for selecting and arranging visible material in document images
US6903751B2 (en) * 2002-03-22 2005-06-07 Xerox Corporation System and method for editing electronic images
JP3800208B2 (ja) * 2002-07-26 2006-07-26 松下電工株式会社 画像処理方法
US7177483B2 (en) * 2002-08-29 2007-02-13 Palo Alto Research Center Incorporated. System and method for enhancement of document images
US7079687B2 (en) * 2003-03-06 2006-07-18 Seiko Epson Corporation Method and apparatus for segmentation of compound documents
US7379594B2 (en) * 2004-01-28 2008-05-27 Sharp Laboratories Of America, Inc. Methods and systems for automatic detection of continuous-tone regions in document images
US7260276B2 (en) * 2004-06-30 2007-08-21 Sharp Laboratories Of America, Inc. Methods and systems for complexity estimation and complexity-based selection
JP2006072839A (ja) * 2004-09-03 2006-03-16 Ricoh Co Ltd 画像処理方法、画像処理装置、画像処理プログラム及び記録媒体
US7970171B2 (en) * 2007-01-18 2011-06-28 Ricoh Co., Ltd. Synthetic image and video generation from ground truth data
US7570816B2 (en) * 2005-03-31 2009-08-04 Microsoft Corporation Systems and methods for detecting text
GB0510793D0 (en) 2005-05-26 2005-06-29 Bourbay Ltd Segmentation of digital images
US7783117B2 (en) * 2005-08-12 2010-08-24 Seiko Epson Corporation Systems and methods for generating background and foreground images for document compression
US7899258B2 (en) * 2005-08-12 2011-03-01 Seiko Epson Corporation Systems and methods to convert images into high-quality compressed documents
JP4329764B2 (ja) * 2006-01-17 2009-09-09 コニカミノルタビジネステクノロジーズ株式会社 画像処理装置および罫線抽出プログラム
US7734094B2 (en) * 2006-06-28 2010-06-08 Microsoft Corporation Techniques for filtering handwriting recognition results
US7792353B2 (en) * 2006-10-31 2010-09-07 Hewlett-Packard Development Company, L.P. Retraining a machine-learning classifier using re-labeled training samples
AU2006252019B2 (en) * 2006-12-13 2012-06-28 Canon Kabushiki Kaisha Method and Apparatus for Dynamic Connector Analysis
US8417033B2 (en) * 2007-04-27 2013-04-09 Hewlett-Packard Development Company, L.P. Gradient based background segmentation and enhancement of images
US7907778B2 (en) * 2007-08-13 2011-03-15 Seiko Epson Corporation Segmentation-based image labeling
US7936923B2 (en) * 2007-08-31 2011-05-03 Seiko Epson Corporation Image background suppression
US7958068B2 (en) * 2007-12-12 2011-06-07 International Business Machines Corporation Method and apparatus for model-shared subspace boosting for multi-label classification
US8180112B2 (en) * 2008-01-21 2012-05-15 Eastman Kodak Company Enabling persistent recognition of individuals in images
US8111923B2 (en) * 2008-08-14 2012-02-07 Xerox Corporation System and method for object class localization and semantic class based image segmentation
US8261180B2 (en) * 2009-04-28 2012-09-04 Lexmark International, Inc. Automatic forms processing systems and methods
US8442319B2 (en) * 2009-07-10 2013-05-14 Palo Alto Research Center Incorporated System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking
US8452086B2 (en) * 2009-07-10 2013-05-28 Palo Alto Research Center Incorporated System and user interface for machine-assisted human labeling of pixels in an image
US8649600B2 (en) * 2009-07-10 2014-02-11 Palo Alto Research Center Incorporated System and method for segmenting text lines in documents

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5434953A (en) * 1992-03-20 1995-07-18 Xerox Corporation Use of fast textured reduction for discrimination of document image components
US6301386B1 (en) * 1998-12-09 2001-10-09 Ncr Corporation Methods and apparatus for gray image based text identification
EP1327955A2 (en) * 2002-01-11 2003-07-16 Hewlett-Packard Company Text extraction from a compound document

Also Published As

Publication number Publication date
US8442319B2 (en) 2013-05-14
EP2275974A2 (en) 2011-01-19
JP5379085B2 (ja) 2013-12-25
US20110007366A1 (en) 2011-01-13
JP2011018338A (ja) 2011-01-27

Similar Documents

Publication Publication Date Title
EP2275974A3 (en) System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking
EP1918873A3 (en) Image processing method, medium and system
EP1327955A3 (en) Text extraction from a compound document
EP3125158A3 (en) Method and device for displaying images
EP2003600A3 (en) Method and apparatus for recognizing characters in a document image
CN103955660B (zh) 一种批量二维码图像识别方法
EP3832541A3 (en) Method and apparatus for recognizing text
EP2731054A3 (en) Method and device for recognizing document image, and photographing method using the same
EP1944721A3 (en) Image processing apparatus, method and program product thereof
EP2701098A3 (en) Region refocusing for data-driven object localization
EP2154631A3 (en) System and method for object class localization and semantic class based image segmentation
WO2015017796A3 (en) Learning systems and methods
EP2230593A3 (en) Job management apparatus, control method, and program
JP2011018338A5 (enExample)
EP4250143A3 (en) Systems and methods for spectral unmixing of microscopic images using pixel grouping
WO2008097552A3 (en) System and method for cell analysis in microscopy
EP2275973A3 (en) System and method for segmenting text lines in documents
WO2008002536A3 (en) Computer-implemented method for automated object recognition and classification in scenes using segment-based object extraction
EP2991004A3 (en) Method and apparatus for labeling training samples
EP1942313A3 (en) Apparatus and method of measuring distance using structured light
WO2007028166A3 (en) A system and method for detecting text in real-world color images
EP3214603A3 (en) Image processing
EP3343909A3 (en) Information processing apparatus and information processing method
WO2008019020A3 (en) Methods for discriminating moving objects in motion image sequences
WO2007001820A3 (en) Image superresolution through edge extraction and contrast enhancement

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME RS

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME RS

RIC1 Information provided on ipc code assigned before grant

Ipc: G06K 9/34 20060101ALI20141121BHEP

Ipc: G06K 9/62 20060101ALI20141121BHEP

Ipc: G06K 9/46 20060101ALI20141121BHEP

Ipc: G06K 9/20 20060101AFI20141121BHEP

17P Request for examination filed

Effective date: 20150630

RBV Designated contracting states (corrected)

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK SM TR

17Q First examination report despatched

Effective date: 20151118

APBK Appeal reference recorded

Free format text: ORIGINAL CODE: EPIDOSNREFNE

APBN Date of receipt of notice of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNNOA2E

APAF Appeal reference modified

Free format text: ORIGINAL CODE: EPIDOSCREFNE

APBT Appeal procedure closed

Free format text: ORIGINAL CODE: EPIDOSNNOA9E

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20170914