JP4077904B2 - 情報処理装置およびその方法 - Google Patents

情報処理装置およびその方法 Download PDF

Info

Publication number
JP4077904B2
JP4077904B2 JP16020597A JP16020597A JP4077904B2 JP 4077904 B2 JP4077904 B2 JP 4077904B2 JP 16020597 A JP16020597 A JP 16020597A JP 16020597 A JP16020597 A JP 16020597A JP 4077904 B2 JP4077904 B2 JP 4077904B2
Authority
JP
Japan
Prior art keywords
frame
outline
area
tracing
black
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP16020597A
Other languages
English (en)
Japanese (ja)
Other versions
JPH1083431A (ja
JPH1083431A5 (enExample
Inventor
ヤン ワング シン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Publication of JPH1083431A publication Critical patent/JPH1083431A/ja
Publication of JPH1083431A5 publication Critical patent/JPH1083431A5/ja
Application granted granted Critical
Publication of JP4077904B2 publication Critical patent/JP4077904B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/412Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/155Removing patterns interfering with the pattern to be recognised, such as ruled lines or underlines
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Character Input (AREA)
  • Image Analysis (AREA)
  • Processing Or Creating Images (AREA)
JP16020597A 1996-06-17 1997-06-17 情報処理装置およびその方法 Expired - Fee Related JP4077904B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/664,675 US6157738A (en) 1996-06-17 1996-06-17 System for extracting attached text
US08/664675 1996-06-17

Publications (3)

Publication Number Publication Date
JPH1083431A JPH1083431A (ja) 1998-03-31
JPH1083431A5 JPH1083431A5 (enExample) 2007-01-25
JP4077904B2 true JP4077904B2 (ja) 2008-04-23

Family

ID=24666972

Family Applications (1)

Application Number Title Priority Date Filing Date
JP16020597A Expired - Fee Related JP4077904B2 (ja) 1996-06-17 1997-06-17 情報処理装置およびその方法

Country Status (4)

Country Link
US (1) US6157738A (enExample)
EP (1) EP0814422B1 (enExample)
JP (1) JP4077904B2 (enExample)
DE (1) DE69718243T2 (enExample)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6112216A (en) * 1997-12-19 2000-08-29 Microsoft Corporation Method and system for editing a table in a document
US6330357B1 (en) * 1999-04-07 2001-12-11 Raf Technology, Inc. Extracting user data from a scanned image of a pre-printed form
JP3204259B2 (ja) * 1999-10-06 2001-09-04 インターナショナル・ビジネス・マシーンズ・コーポレーション 文字列抽出方法、手書き文字列抽出方法、文字列抽出装置、および画像処理装置
JP3425408B2 (ja) * 2000-05-31 2003-07-14 株式会社東芝 文書読取装置
EP1271403B1 (en) * 2001-06-26 2005-03-09 Nokia Corporation Method and device for character location in images from digital camera
JP2004088585A (ja) * 2002-08-28 2004-03-18 Fuji Xerox Co Ltd 画像処理システムおよびその方法
JP4897520B2 (ja) * 2006-03-20 2012-03-14 株式会社リコー 情報配信システム
US20070253615A1 (en) * 2006-04-26 2007-11-01 Yuan-Hsiang Chang Method and system for banknote recognition
US8331680B2 (en) * 2008-06-23 2012-12-11 International Business Machines Corporation Method of gray-level optical segmentation and isolation using incremental connected components
CN102314608A (zh) * 2010-06-30 2012-01-11 汉王科技股份有限公司 文字图像中行提取的方法和装置
US20130163871A1 (en) * 2011-12-22 2013-06-27 General Electric Company System and method for segmenting image data to identify a character-of-interest
US9842281B2 (en) * 2014-06-05 2017-12-12 Xerox Corporation System for automated text and halftone segmentation
US20160055376A1 (en) * 2014-06-21 2016-02-25 iQG DBA iQGATEWAY LLC Method and system for identification and extraction of data from structured documents
CN104268545B (zh) * 2014-09-15 2017-09-29 同方知网(北京)技术有限公司 一种电子档版式文件中的表格区域识别与内容栅格化方法
JP6173542B1 (ja) * 2016-08-10 2017-08-02 株式会社Pfu 画像処理装置、画像処理方法、および、プログラム
CN115240214A (zh) * 2021-04-09 2022-10-25 华南理工大学广州学院 一种表格结构识别方法
CN113221778B (zh) * 2021-05-19 2022-05-10 北京航空航天大学杭州创新研究院 手写表格的检测与识别方法及装置
CN113901950A (zh) * 2021-11-05 2022-01-07 上海派拉软件股份有限公司 一种高准确率的表格ocr识别方法及系统

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4377803A (en) * 1980-07-02 1983-03-22 International Business Machines Corporation Algorithm for the segmentation of printed fixed pitch documents
JPS63268081A (ja) * 1987-04-17 1988-11-04 インタ−ナショナル・ビジネス・マシ−ンズ・コ−ポレ−ション 文書の文字を認識する方法及び装置
US5588072A (en) * 1993-12-22 1996-12-24 Canon Kabushiki Kaisha Method and apparatus for selecting blocks of image data from image data having both horizontally- and vertically-oriented blocks
US5848186A (en) * 1995-08-11 1998-12-08 Canon Kabushiki Kaisha Feature extraction system for identifying text within a table image

Also Published As

Publication number Publication date
EP0814422A3 (en) 1998-01-28
US6157738A (en) 2000-12-05
DE69718243T2 (de) 2003-08-28
EP0814422A2 (en) 1997-12-29
EP0814422B1 (en) 2003-01-08
DE69718243D1 (de) 2003-02-13
JPH1083431A (ja) 1998-03-31

Similar Documents

Publication Publication Date Title
JP4077904B2 (ja) 情報処理装置およびその方法
JP3950498B2 (ja) イメージ処理方法及び装置
US6173073B1 (en) System for analyzing table images
US5893127A (en) Generator for document with HTML tagged table having data elements which preserve layout relationships of information in bitmap image of original document
CN114004204B (zh) 基于计算机视觉的表格结构重建与文字提取方法和系统
US6903751B2 (en) System and method for editing electronic images
EP0758775B1 (en) Feature extraction system
JP3359095B2 (ja) 画像処理方法及び装置
US5987171A (en) Page analysis system
EP0690415B1 (en) Editing scanned document images using simple interpretations
US6711292B2 (en) Block selection of table features
US5509092A (en) Method and apparatus for generating information on recognized characters
JPH0668300A (ja) 文書画像のレイアウトモデルを作成する方法及び装置
JPH08185474A (ja) 文書画像分割装置
US9189459B2 (en) Document image layout apparatus
JPH10513284A (ja) 二進イメージに対する空白ページ及び文字枠の自動決定
JP4390523B2 (ja) 最小領域による合成画像の分割
JPH08320914A (ja) 表認識方法および装置
JPH0612540B2 (ja) 文書作成支援装置
JP2004282701A5 (enExample)
JP2008108114A (ja) 文書処理装置および文書処理方法
JP4574347B2 (ja) 画像処理装置、方法及びプログラム
JPH02138674A (ja) 文書処理方法及び装置
JPH07296109A (ja) 画像処理方法とその装置

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20040531

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20040531

RD01 Notification of change of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7426

Effective date: 20040531

RD03 Notification of appointment of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7423

Effective date: 20040531

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20061127

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20071026

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20071225

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20080128

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20080204

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20110208

Year of fee payment: 3

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20120208

Year of fee payment: 4

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130208

Year of fee payment: 5

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20140208

Year of fee payment: 6

LAPS Cancellation because of no payment of annual fees