JP2011018338A5 - - Google Patents

Download PDF

Info

Publication number
JP2011018338A5
JP2011018338A5 JP2010156620A JP2010156620A JP2011018338A5 JP 2011018338 A5 JP2011018338 A5 JP 2011018338A5 JP 2010156620 A JP2010156620 A JP 2010156620A JP 2010156620 A JP2010156620 A JP 2010156620A JP 2011018338 A5 JP2011018338 A5 JP 2011018338A5
Authority
JP
Japan
Prior art keywords
classifier
fragment
fragments
dividing
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2010156620A
Other languages
English (en)
Japanese (ja)
Other versions
JP5379085B2 (ja
JP2011018338A (ja
Filing date
Publication date
Priority claimed from US12/501,187 external-priority patent/US8442319B2/en
Application filed filed Critical
Publication of JP2011018338A publication Critical patent/JP2011018338A/ja
Publication of JP2011018338A5 publication Critical patent/JP2011018338A5/ja
Application granted granted Critical
Publication of JP5379085B2 publication Critical patent/JP5379085B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2010156620A 2009-07-10 2010-07-09 スキャンされた文書画像内の前景画素群の連結グループをマーキング種類に基づき分類する方法及びシステム Expired - Fee Related JP5379085B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/501,187 2009-07-10
US12/501,187 US8442319B2 (en) 2009-07-10 2009-07-10 System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking

Publications (3)

Publication Number Publication Date
JP2011018338A JP2011018338A (ja) 2011-01-27
JP2011018338A5 true JP2011018338A5 (enExample) 2013-08-22
JP5379085B2 JP5379085B2 (ja) 2013-12-25

Family

ID=43034592

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2010156620A Expired - Fee Related JP5379085B2 (ja) 2009-07-10 2010-07-09 スキャンされた文書画像内の前景画素群の連結グループをマーキング種類に基づき分類する方法及びシステム

Country Status (3)

Country Link
US (1) US8442319B2 (enExample)
EP (1) EP2275974A3 (enExample)
JP (1) JP5379085B2 (enExample)

Families Citing this family (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8452086B2 (en) * 2009-07-10 2013-05-28 Palo Alto Research Center Incorporated System and user interface for machine-assisted human labeling of pixels in an image
US8442319B2 (en) * 2009-07-10 2013-05-14 Palo Alto Research Center Incorporated System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking
US8649600B2 (en) * 2009-07-10 2014-02-11 Palo Alto Research Center Incorporated System and method for segmenting text lines in documents
US20120092374A1 (en) * 2010-10-19 2012-04-19 Apple Inc. Systems, methods, and computer-readable media for placing a representation of the captured signature in a document
CN102622724A (zh) * 2011-01-27 2012-08-01 鸿富锦精密工业(深圳)有限公司 外观专利图像切割方法及系统
DE102011082866A1 (de) 2011-09-16 2013-03-21 Olaf Rudolph Verfahren zum Erkennen einer oder meherer gleichzeitig auftretender Teilenladungs-Quellen
US8792730B2 (en) * 2012-03-07 2014-07-29 Ricoh Co., Ltd. Classification and standardization of field images associated with a field in a form
US9152884B2 (en) * 2012-06-05 2015-10-06 Drvision Technologies Llc Teachable pattern scoring method
US8879855B2 (en) * 2012-08-17 2014-11-04 Nec Laboratories America, Inc. Image segmentation for large-scale fine-grained recognition
US9280206B2 (en) * 2012-08-20 2016-03-08 Samsung Electronics Co., Ltd. System and method for perceiving images with multimodal feedback
US9235781B2 (en) * 2013-08-09 2016-01-12 Kabushiki Kaisha Toshiba Method of, and apparatus for, landmark location
US9245205B1 (en) * 2013-10-16 2016-01-26 Xerox Corporation Supervised mid-level features for word image representation
US9940511B2 (en) * 2014-05-30 2018-04-10 Kofax, Inc. Machine print, hand print, and signature discrimination
US10120843B2 (en) 2014-08-26 2018-11-06 International Business Machines Corporation Generation of parsable data for deep parsing
US11100650B2 (en) * 2016-03-31 2021-08-24 Sony Depthsensing Solutions Sa/Nv Method for foreground and background determination in an image
US10607228B1 (en) * 2016-08-24 2020-03-31 Jpmorgan Chase Bank, N.A. Dynamic rule strategy and fraud detection system and method
US10354161B2 (en) * 2017-06-05 2019-07-16 Intuit, Inc. Detecting font size in a digital image
US10163022B1 (en) * 2017-06-22 2018-12-25 StradVision, Inc. Method for learning text recognition, method for recognizing text using the same, and apparatus for learning text recognition, apparatus for recognizing text using the same
US11416546B2 (en) * 2018-03-20 2022-08-16 Hulu, LLC Content type detection in videos using multiple classifiers
CN108960290A (zh) * 2018-06-08 2018-12-07 Oppo广东移动通信有限公司 图像处理方法、装置、计算机可读存储介质和电子设备
US10685261B2 (en) * 2018-06-11 2020-06-16 GM Global Technology Operations LLC Active segmention of scanned images based on deep reinforcement learning for OCR applications
JP7262993B2 (ja) * 2018-12-19 2023-04-24 キヤノン株式会社 画像処理システム、画像処理方法、画像処理装置
US11462037B2 (en) 2019-01-11 2022-10-04 Walmart Apollo, Llc System and method for automated analysis of electronic travel data
US10671892B1 (en) * 2019-03-31 2020-06-02 Hyper Labs, Inc. Apparatuses, methods, and systems for 3-channel dynamic contextual script recognition using neural network image analytics and 4-tuple machine learning with enhanced templates and context data
US11106891B2 (en) 2019-09-09 2021-08-31 Morgan Stanley Services Group Inc. Automated signature extraction and verification
JP7431005B2 (ja) * 2019-09-20 2024-02-14 Toppanエッジ株式会社 学習データ生成装置、学習データ生成方法、及びプログラム
US11200411B2 (en) * 2019-10-16 2021-12-14 The Toronto-Dominion Bank Training a card type classifier with simulated card images
US12175337B2 (en) * 2020-08-04 2024-12-24 Bentley Systems, Incorporated Techniques for extracting machine-readable information from P and IDs
KR102509343B1 (ko) * 2020-11-17 2023-03-13 아주대학교산학협력단 이미지의 레이아웃 분석 방법 및 시스템
US11704352B2 (en) 2021-05-03 2023-07-18 Bank Of America Corporation Automated categorization and assembly of low-quality images into electronic documents
US11798258B2 (en) 2021-05-03 2023-10-24 Bank Of America Corporation Automated categorization and assembly of low-quality images into electronic documents
US11881041B2 (en) 2021-09-02 2024-01-23 Bank Of America Corporation Automated categorization and processing of document images of varying degrees of quality
US11409951B1 (en) 2021-09-24 2022-08-09 International Business Machines Corporation Facilitating annotation of document elements
CN113657559B (zh) * 2021-10-18 2022-02-08 广州天鹏计算机科技有限公司 基于机器学习的胸部扫描图像分类方法
US12367694B2 (en) * 2021-10-29 2025-07-22 Samsung Electronics Co., Ltd. Methods and systems for semantically segmenting a source text image based on a text area threshold determination
US20230162520A1 (en) * 2021-11-23 2023-05-25 Abbyy Development Inc. Identifying writing systems utilized in documents
US12348560B2 (en) * 2022-04-25 2025-07-01 Palo Alto Networks, Inc. Detecting phishing PDFs with an image-based deep learning approach
US12159021B1 (en) * 2022-06-30 2024-12-03 Amazon Technologies, Inc. Semantic detection and rendering in digital content
US12406519B1 (en) 2022-07-29 2025-09-02 Bentley Systems, Incorporated Techniques for extracting links and connectivity from schematic diagrams
US12288411B2 (en) 2022-10-06 2025-04-29 Bentley Systems, Incorporated Techniques for extracting associations between text labels and symbols and links in schematic diagrams

Family Cites Families (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5202933A (en) 1989-12-08 1993-04-13 Xerox Corporation Segmentation of text and graphics
US5402504A (en) 1989-12-08 1995-03-28 Xerox Corporation Segmentation of text styles
US5181255A (en) 1990-12-13 1993-01-19 Xerox Corporation Segmentation of handwriting and machine printed text
US5201011A (en) 1991-11-19 1993-04-06 Xerox Corporation Method and apparatus for image hand markup detection using morphological techniques
US5369714A (en) * 1991-11-19 1994-11-29 Xerox Corporation Method and apparatus for determining the frequency of phrases in a document without document image decoding
US5434953A (en) * 1992-03-20 1995-07-18 Xerox Corporation Use of fast textured reduction for discrimination of document image components
US6009196A (en) * 1995-11-28 1999-12-28 Xerox Corporation Method for classifying non-running text in an image
US5892842A (en) 1995-12-14 1999-04-06 Xerox Corporation Automatic method of identifying sentence boundaries in a document image
US5956468A (en) * 1996-07-12 1999-09-21 Seiko Epson Corporation Document segmentation system
US5778092A (en) 1996-12-20 1998-07-07 Xerox Corporation Method and apparatus for compressing color or gray scale documents
US6377710B1 (en) 1998-11-25 2002-04-23 Xerox Corporation Method and apparatus for extracting the skeleton of a binary figure by contour-based erosion
US6411733B1 (en) 1998-11-25 2002-06-25 Xerox Corporation Method and apparatus for separating document image object types
US6301386B1 (en) 1998-12-09 2001-10-09 Ncr Corporation Methods and apparatus for gray image based text identification
US6587583B1 (en) 1999-09-17 2003-07-01 Kurzweil Educational Systems, Inc. Compression/decompression algorithm for image documents having text, graphical and color content
US6771816B1 (en) 2000-01-19 2004-08-03 Adobe Systems Incorporated Generating a text mask for representing text pixels
SE0000205D0 (sv) * 2000-01-25 2000-01-25 Siemens Elema Ab Ventilator
US8103104B2 (en) * 2002-01-11 2012-01-24 Hewlett-Packard Development Company, L.P. Text extraction and its application to compound document image compression
US7136082B2 (en) 2002-01-25 2006-11-14 Xerox Corporation Method and apparatus to convert digital ink images for use in a structured text/graphics editor
US7036077B2 (en) 2002-03-22 2006-04-25 Xerox Corporation Method for gestural interpretation in a system for selecting and arranging visible material in document images
US6903751B2 (en) 2002-03-22 2005-06-07 Xerox Corporation System and method for editing electronic images
US7086013B2 (en) 2002-03-22 2006-08-01 Xerox Corporation Method and system for overloading loop selection commands in a system for selecting and arranging visible material in document images
JP3800208B2 (ja) * 2002-07-26 2006-07-26 松下電工株式会社 画像処理方法
US7177483B2 (en) 2002-08-29 2007-02-13 Palo Alto Research Center Incorporated. System and method for enhancement of document images
US7079687B2 (en) * 2003-03-06 2006-07-18 Seiko Epson Corporation Method and apparatus for segmentation of compound documents
US7379594B2 (en) 2004-01-28 2008-05-27 Sharp Laboratories Of America, Inc. Methods and systems for automatic detection of continuous-tone regions in document images
US7260276B2 (en) * 2004-06-30 2007-08-21 Sharp Laboratories Of America, Inc. Methods and systems for complexity estimation and complexity-based selection
JP2006072839A (ja) * 2004-09-03 2006-03-16 Ricoh Co Ltd 画像処理方法、画像処理装置、画像処理プログラム及び記録媒体
US7970171B2 (en) 2007-01-18 2011-06-28 Ricoh Co., Ltd. Synthetic image and video generation from ground truth data
US7570816B2 (en) * 2005-03-31 2009-08-04 Microsoft Corporation Systems and methods for detecting text
GB0510793D0 (en) 2005-05-26 2005-06-29 Bourbay Ltd Segmentation of digital images
US7899258B2 (en) * 2005-08-12 2011-03-01 Seiko Epson Corporation Systems and methods to convert images into high-quality compressed documents
US7783117B2 (en) 2005-08-12 2010-08-24 Seiko Epson Corporation Systems and methods for generating background and foreground images for document compression
JP4329764B2 (ja) * 2006-01-17 2009-09-09 コニカミノルタビジネステクノロジーズ株式会社 画像処理装置および罫線抽出プログラム
US7734094B2 (en) 2006-06-28 2010-06-08 Microsoft Corporation Techniques for filtering handwriting recognition results
US7792353B2 (en) * 2006-10-31 2010-09-07 Hewlett-Packard Development Company, L.P. Retraining a machine-learning classifier using re-labeled training samples
AU2006252019B2 (en) * 2006-12-13 2012-06-28 Canon Kabushiki Kaisha Method and Apparatus for Dynamic Connector Analysis
US8417033B2 (en) * 2007-04-27 2013-04-09 Hewlett-Packard Development Company, L.P. Gradient based background segmentation and enhancement of images
US7907778B2 (en) * 2007-08-13 2011-03-15 Seiko Epson Corporation Segmentation-based image labeling
US7936923B2 (en) * 2007-08-31 2011-05-03 Seiko Epson Corporation Image background suppression
US7958068B2 (en) * 2007-12-12 2011-06-07 International Business Machines Corporation Method and apparatus for model-shared subspace boosting for multi-label classification
US8180112B2 (en) 2008-01-21 2012-05-15 Eastman Kodak Company Enabling persistent recognition of individuals in images
US8111923B2 (en) * 2008-08-14 2012-02-07 Xerox Corporation System and method for object class localization and semantic class based image segmentation
US8261180B2 (en) * 2009-04-28 2012-09-04 Lexmark International, Inc. Automatic forms processing systems and methods
US8442319B2 (en) * 2009-07-10 2013-05-14 Palo Alto Research Center Incorporated System and method for classifying connected groups of foreground pixels in scanned document images according to the type of marking
US8452086B2 (en) 2009-07-10 2013-05-28 Palo Alto Research Center Incorporated System and user interface for machine-assisted human labeling of pixels in an image
US8649600B2 (en) 2009-07-10 2014-02-11 Palo Alto Research Center Incorporated System and method for segmenting text lines in documents

Similar Documents

Publication Publication Date Title
JP2011018338A5 (enExample)
JP5379085B2 (ja) スキャンされた文書画像内の前景画素群の連結グループをマーキング種類に基づき分類する方法及びシステム
Minetto et al. T-HOG: An effective gradient-based descriptor for single line text regions
US8606010B2 (en) Identifying text pixels in scanned images
CN108805116B (zh) 图像文本检测方法及其系统
CN106796647B (zh) 场景文本检测系统和方法
US10242295B2 (en) Method and apparatus for generating, updating classifier, detecting objects and image processing device
WO2016107103A1 (zh) 图像主体区域的识别方法及装置
CN1737822A (zh) 用于照相机获得的文件的低分辨率光学字符识别
JP2015032308A (ja) 畳み込みニューラルネットワークの分類器、及びその分類方法、訓練方法
JP2007235951A (ja) 車両画像認識装置およびその方法
Kumar et al. OTCYMIST: Otsu-Canny minimal spanning tree for born-digital images
Shivakumara et al. Gradient-angular-features for word-wise video script identification
JP2007537542A5 (enExample)
CN106156777A (zh) 文本图片检测方法及装置
CN109919149B (zh) 基于物体检测模型的物体标注方法及相关设备
JP2016143408A (ja) 交通標識から英数字を抽出/認識するコンピュータ実施システムおよび方法
US20120052473A1 (en) Learning apparatus, learning method, and computer program product
KR101484043B1 (ko) 차량 식별자 인식 시스템 및 그 방법
JP2014229314A (ja) テキスト検出の方法及び装置
Xue Optical character recognition
Qin et al. Video scene text frames categorization for text detection and recognition
JP6377214B2 (ja) テキスト検出方法および装置
Seuret et al. Pixel level handwritten and printed content discrimination in scanned documents
Tikader et al. Histogram of oriented gradients for English-Bengali script recognition