JP4251586B2 - 画像処理方法及び装置 - Google Patents

画像処理方法及び装置 Download PDF

Info

Publication number
JP4251586B2
JP4251586B2 JP00023999A JP23999A JP4251586B2 JP 4251586 B2 JP4251586 B2 JP 4251586B2 JP 00023999 A JP00023999 A JP 00023999A JP 23999 A JP23999 A JP 23999A JP 4251586 B2 JP4251586 B2 JP 4251586B2
Authority
JP
Japan
Prior art keywords
row
cell
assigning
image
grid line
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP00023999A
Other languages
English (en)
Japanese (ja)
Other versions
JPH11259655A5 (https=
JPH11259655A (ja
Inventor
イン ワン シン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Publication of JPH11259655A publication Critical patent/JPH11259655A/ja
Publication of JPH11259655A5 publication Critical patent/JPH11259655A5/ja
Application granted granted Critical
Publication of JP4251586B2 publication Critical patent/JP4251586B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/412Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/414Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Character Input (AREA)
  • Image Analysis (AREA)
JP00023999A 1998-01-05 1999-01-04 画像処理方法及び装置 Expired - Fee Related JP4251586B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/002684 1998-01-05
US09/002,684 US6173073B1 (en) 1998-01-05 1998-01-05 System for analyzing table images

Publications (3)

Publication Number Publication Date
JPH11259655A JPH11259655A (ja) 1999-09-24
JPH11259655A5 JPH11259655A5 (https=) 2006-04-06
JP4251586B2 true JP4251586B2 (ja) 2009-04-08

Family

ID=21701969

Family Applications (1)

Application Number Title Priority Date Filing Date
JP00023999A Expired - Fee Related JP4251586B2 (ja) 1998-01-05 1999-01-04 画像処理方法及び装置

Country Status (5)

Country Link
US (1) US6173073B1 (https=)
EP (1) EP0927950B1 (https=)
JP (1) JP4251586B2 (https=)
CN (1) CN1143239C (https=)
DE (1) DE69825856D1 (https=)

Families Citing this family (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7099507B2 (en) * 1998-11-05 2006-08-29 Ricoh Company, Ltd Method and system for extracting title from document image
JP2000339301A (ja) * 1999-03-23 2000-12-08 Canon Inc 文書分割装置及び方法、及びそのプログラムを記憶した記憶媒体
US6757870B1 (en) * 2000-03-22 2004-06-29 Hewlett-Packard Development Company, L.P. Automatic table detection method and system
JP2002032770A (ja) * 2000-06-23 2002-01-31 Internatl Business Mach Corp <Ibm> 文書処理方法、文書処理システムおよび媒体
US6778700B2 (en) * 2001-03-14 2004-08-17 Electronics For Imaging, Inc. Method and apparatus for text detection
CA2391692C (en) 2002-07-15 2006-07-04 Allan Williams Computer database with adaptive storage space architecture
US20040066538A1 (en) * 2002-10-04 2004-04-08 Rozzi William A. Conversion of halftone bitmaps to continuous tone representations
US7308159B2 (en) * 2004-01-16 2007-12-11 Enuclia Semiconductor, Inc. Image processing system and method with dynamically controlled pixel processing
US8095871B2 (en) * 2004-05-06 2012-01-10 Siemens Corporation System and method for GUI supported specifications for automating form field extraction with database mapping
US7668404B2 (en) * 2004-06-30 2010-02-23 Lexmark International, Inc. Method and system of deskewing an image using monochrome conversion to separate foreground from background
US7707488B2 (en) * 2006-02-09 2010-04-27 Microsoft Corporation Analyzing lines to detect tables in documents
JP4135752B2 (ja) * 2006-06-12 2008-08-20 コニカミノルタビジネステクノロジーズ株式会社 画像処理装置、画像処理方法及び画像処理プログラム
JP4665933B2 (ja) * 2006-07-04 2011-04-06 セイコーエプソン株式会社 文書編集支援装置、プログラムおよび記憶媒体
US7801358B2 (en) * 2006-11-03 2010-09-21 Google Inc. Methods and systems for analyzing data in media material having layout
JP4988842B2 (ja) * 2007-06-28 2012-08-01 富士通株式会社 表データ生成プログラム、表データ生成方法および表データ生成装置
US8155442B2 (en) * 2008-02-04 2012-04-10 The Neat Company, Inc. Method and apparatus for modifying the histogram of an image
CN101551859B (zh) * 2008-03-31 2012-01-04 夏普株式会社 图像辨别装置及图像检索装置
US8144986B2 (en) * 2008-09-05 2012-03-27 The Neat Company, Inc. Method and apparatus for binarization threshold calculation
US8473467B2 (en) * 2009-01-02 2013-06-25 Apple Inc. Content profiling to dynamically configure content processing
US8261180B2 (en) * 2009-04-28 2012-09-04 Lexmark International, Inc. Automatic forms processing systems and methods
US8214733B2 (en) * 2010-04-28 2012-07-03 Lexmark International, Inc. Automatic forms processing systems and methods
JP2013500527A (ja) * 2009-07-30 2013-01-07 オセ−テクノロジーズ・ベー・ヴエー 文書内の表の自動的な位置特定
US20110032266A1 (en) * 2009-08-07 2011-02-10 Delphi Technologies, Inc. Glare detection and mitigation method for a photo-sensitive display device
CN101866335B (zh) * 2010-06-14 2012-12-12 深圳市万兴软件有限公司 一种文档转换中的表格处理方法及装置
CN101984426B (zh) * 2010-10-21 2013-04-10 优视科技有限公司 用于对网页图片进行字符切分的方法及装置
US8543911B2 (en) 2011-01-18 2013-09-24 Apple Inc. Ordering document content based on reading flow
US8380753B2 (en) 2011-01-18 2013-02-19 Apple Inc. Reconstruction of lists in a document
US8731296B2 (en) * 2011-04-21 2014-05-20 Seiko Epson Corporation Contact text detection in scanned images
KR101872564B1 (ko) 2012-01-23 2018-06-28 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 무경계 표 검출 엔진
US8942489B2 (en) 2012-01-23 2015-01-27 Microsoft Corporation Vector graphics classification engine
CN103377177B (zh) * 2012-04-27 2016-03-30 北大方正集团有限公司 一种数字版式文件中识别表格的方法及装置
US9953008B2 (en) 2013-01-18 2018-04-24 Microsoft Technology Licensing, Llc Grouping fixed format document elements to preserve graphical data semantics after reflow by manipulating a bounding box vertically and horizontally
US9047528B1 (en) * 2013-02-19 2015-06-02 Amazon Technologies, Inc. Identifying characters in grid-based text
US10076751B2 (en) 2013-12-30 2018-09-18 General Electric Company Systems and methods for reagent storage
US9399216B2 (en) 2013-12-30 2016-07-26 General Electric Company Fluid transport in microfluidic applications with sensors for detecting fluid presence and pressure
JP6435636B2 (ja) * 2014-05-15 2018-12-12 富士ゼロックス株式会社 情報処理装置及び情報処理プログラム
CN104050487B (zh) * 2014-06-06 2017-06-16 华东师范大学 一种基于布局信息分析的邮件图像方向辨别方法
US9235757B1 (en) * 2014-07-24 2016-01-12 Amazon Technologies, Inc. Fast text detection
CN105426834B (zh) * 2015-11-17 2019-02-22 中国传媒大学 一种基于投影特征与结构特征进行表格图像检测的方法
CN106446881B (zh) * 2016-07-29 2019-05-21 北京交通大学 从医疗化验单图像中提取化验结果信息的方法
CN106156761B (zh) * 2016-08-10 2020-01-10 北京交通大学 面向移动终端拍摄的图像表格检测与识别方法
US10242257B2 (en) * 2017-05-18 2019-03-26 Wipro Limited Methods and devices for extracting text from documents
US10410386B2 (en) * 2017-09-15 2019-09-10 Konica Minolta Laboratory U.S.A., Inc. Table cell validation
US11650970B2 (en) 2018-03-09 2023-05-16 International Business Machines Corporation Extracting structure and semantics from tabular data
CN108470021B (zh) * 2018-03-26 2022-06-03 阿博茨德(北京)科技有限公司 Pdf文档中表格的定位方法及装置
US11200413B2 (en) 2018-07-31 2021-12-14 International Business Machines Corporation Table recognition in portable document format documents
EP3966730A2 (en) * 2019-05-08 2022-03-16 Vrije Universiteit Brussel Computer implemented method for segmenting a binarized document
US11062133B2 (en) * 2019-06-24 2021-07-13 International Business Machines Corporation Data structure generation for tabular information in scanned images
CN114357958A (zh) * 2020-09-30 2022-04-15 中移(苏州)软件技术有限公司 一种表格提取方法、装置、设备及存储介质
US12260662B2 (en) * 2021-04-15 2025-03-25 Microsoft Technology Licensing, Llc Inferring structure information from table images
US11829701B1 (en) * 2022-06-30 2023-11-28 Accenture Global Solutions Limited Heuristics-based processing of electronic document contents

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5185813A (en) 1988-01-19 1993-02-09 Kabushiki Kaisha Toshiba Document image processing apparatus
US5131053A (en) 1988-08-10 1992-07-14 Caere Corporation Optical character recognition method and apparatus
US5101448A (en) 1988-08-24 1992-03-31 Hitachi, Ltd. Method and apparatus for processing a document by utilizing an image
US5129012A (en) 1989-03-25 1992-07-07 Sony Corporation Detecting line segments and predetermined patterns in an optically scanned document
JP2812982B2 (ja) * 1989-04-05 1998-10-22 株式会社リコー 表認識方法
JP2940936B2 (ja) 1989-06-06 1999-08-25 株式会社リコー 表領域識別方法
US5448692A (en) * 1991-03-27 1995-09-05 Ricoh Company, Ltd. Digital image processing device involving processing of areas of image, based on respective contour line traces
JPH05250357A (ja) 1992-03-05 1993-09-28 Ricoh Co Ltd 画像読取修正装置および修正画像形成装置
US5335290A (en) 1992-04-06 1994-08-02 Ricoh Corporation Segmentation of text, picture and lines of a document image
US5680479A (en) 1992-04-24 1997-10-21 Canon Kabushiki Kaisha Method and apparatus for character recognition
US5594814A (en) * 1992-10-19 1997-01-14 Fast; Bruce B. OCR image preprocessing method for image enhancement of scanned documents
JP2789971B2 (ja) * 1992-10-27 1998-08-27 富士ゼロックス株式会社 表認識装置
US5485566A (en) 1993-10-29 1996-01-16 Xerox Corporation Method of finding columns in tabular documents
US5588072A (en) 1993-12-22 1996-12-24 Canon Kabushiki Kaisha Method and apparatus for selecting blocks of image data from image data having both horizontally- and vertically-oriented blocks
US5689342A (en) 1994-11-17 1997-11-18 Canon Kabushiki Kaisha Image processing method and apparatus which orders text areas which have been extracted from an image
US5661818A (en) 1995-01-27 1997-08-26 Eastman Kodak Company Method and system for detecting grids in a digital image
US5848186A (en) * 1995-08-11 1998-12-08 Canon Kabushiki Kaisha Feature extraction system for identifying text within a table image

Also Published As

Publication number Publication date
EP0927950A2 (en) 1999-07-07
EP0927950B1 (en) 2004-08-25
EP0927950A3 (en) 2001-10-17
CN1143239C (zh) 2004-03-24
US6173073B1 (en) 2001-01-09
DE69825856D1 (de) 2004-09-30
JPH11259655A (ja) 1999-09-24
CN1237745A (zh) 1999-12-08

Similar Documents

Publication Publication Date Title
JP4251586B2 (ja) 画像処理方法及び装置
US6512848B2 (en) Page analysis system
JP3302147B2 (ja) 文書画像処理方法
JP4577421B2 (ja) 画像処理装置及び画像処理プログラム
JP4655335B2 (ja) 画像認識装置、画像認識方法および画像認識プログラムを記録したコンピュータ読取可能な記録媒体
US20110044539A1 (en) Information processing device, computer readable medium storing information processing program, and information processing method
JP2010020468A (ja) 画像処理装置、画像処理方法、そのプログラムおよび記憶媒体
JP2000200350A (ja) 情報処理方法及び装置
CN102473278B (zh) 图像处理装置、图像处理方法和存储介质
JP4408495B2 (ja) 画像処理方法及び画像処理装置
US7783108B2 (en) Document management method and apparatus
US7528986B2 (en) Image forming apparatus, image forming method, program therefor, and storage medium
JP4785655B2 (ja) 文書処理装置及び文書処理方法
JP5020698B2 (ja) 画像処理装置、画像処理方法、画像処理プログラム
US20100100811A1 (en) Information processing apparatus and layout processing method
JP5159588B2 (ja) 画像処理装置、画像処理方法、コンピュータプログラム
JP2002232679A (ja) 画像処理方法及び装置及びコンピュータプログラム及び記憶媒体
JP4040905B2 (ja) 縮小画像表示装置、方法、プログラムおよびプログラムを記録した記録媒体
JP2002175532A (ja) 画像処理装置、画像処理方法および画像処理プログラムを記録した記憶媒体
JP7243981B2 (ja) 紙面領域分類装置及びそのプログラム
US8059138B2 (en) Image processing and arranging system, image processing and arranging method, and computer readable medium for image processing and arranging
JP7700541B2 (ja) 画像処理装置、画像処理方法、及びプログラム
JP4329370B2 (ja) 画像データ分類装置及びプログラム
JP4587167B2 (ja) 画像処理装置及び画像処理方法
JP2002049890A (ja) 画像認識装置、画像認識方法および画像認識プログラムを記録したコンピュータ読取可能な記録媒体

Legal Events

Date Code Title Description
A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20051228

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20051228

RD01 Notification of change of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7426

Effective date: 20051228

RD03 Notification of appointment of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7423

Effective date: 20051228

RD04 Notification of resignation of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7424

Effective date: 20080723

RD04 Notification of resignation of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7424

Effective date: 20080807

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20081008

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20081014

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20081215

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20090116

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20090119

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120130

Year of fee payment: 3

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130130

Year of fee payment: 4

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20140130

Year of fee payment: 5

LAPS Cancellation because of no payment of annual fees