TW275115B - Intelligent analyzing and processing system for missive-table document - Google Patents

Intelligent analyzing and processing system for missive-table document

Info

Publication number
TW275115B
TW275115B TW83108000A TW83108000A TW275115B TW 275115 B TW275115 B TW 275115B TW 83108000 A TW83108000 A TW 83108000A TW 83108000 A TW83108000 A TW 83108000A TW 275115 B TW275115 B TW 275115B
Authority
TW
Taiwan
Prior art keywords
segment
missive
block
unit
horizontal
Prior art date
Application number
TW83108000A
Other languages
Chinese (zh)
Inventor
Tiee-Min Wu
Jin-Shii Guo
Keh-Hwa Shyu
Guang-Yaw Jang
Bor-Shuenn Jeng
Pei-Yi Ding
Duen-Wen Bai
Original Assignee
Telecomm Lab Dgt Motc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telecomm Lab Dgt Motc filed Critical Telecomm Lab Dgt Motc
Priority to TW83108000A priority Critical patent/TW275115B/en
Application granted granted Critical
Publication of TW275115B publication Critical patent/TW275115B/en

Links

Abstract

An intelligent analyzing and processing system for missive-table document comprises: one monitor; one scanner; one pre-processing unit for receiving digital image converted by scanner via host, and executing run-length operation to every horizontal scan line to get segment combined by all continuous pixels of the horizontal line, in the meantime viewing all continuous segment of the first horizontal scan line as block with width 1, and pertableing block growth operation with all continuous segment of next horizontal scan line one by one to separate each independent block part in image, then pertableing merging test of text or segment to every independent block; one segment and text extracting unit for differentiating block processed by pre-processing unit into segment block or text block by their features, if the length of block belonged to segment is far larger than its width, then the block is one horizontal segment, otherwise, the block is one vertical segment, and for general block the ratio of its longest side to the other side is usually less than one constant value, so the segment and text extracting unit utilizes the block aspect ratio as differentiation basis; one segment cross point extracting unit for checking and recording if every horizontal segment has cross point with vertical segment, and considering there exists fork due to broken cross point, this unit set one w value as tolerance value of cross broken fork, and w>0; one missive table frame confirming unit by taking advantage of one dependent relationship between four corner points of every frame in missive table, so this unit could find dependent ordinal numberrelationship between them to confirm all frames in missive table from the ordinal numbers of horizontal and vertical segments passing through each cross point in missive table; one Chinese/English character recognizing unit for converting font image scanned by scanner into inner code tableat to save storage space; in this unit extracting font features from font image via analysis, and pertableing recognition matching with font template data to find inner code corresponding to the font and save with the inner code; one missive table document feature database for creating one missive table document database from all features(such as horizontal segment, vertical segment, frame and character inner code in frame) generated by analysis-processed missive table according to one fixed data structure as future missive table document data index and missive table recognition.
TW83108000A 1994-08-31 1994-08-31 Intelligent analyzing and processing system for missive-table document TW275115B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW83108000A TW275115B (en) 1994-08-31 1994-08-31 Intelligent analyzing and processing system for missive-table document

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW83108000A TW275115B (en) 1994-08-31 1994-08-31 Intelligent analyzing and processing system for missive-table document

Publications (1)

Publication Number Publication Date
TW275115B true TW275115B (en) 1996-05-01

Family

ID=51397267

Family Applications (1)

Application Number Title Priority Date Filing Date
TW83108000A TW275115B (en) 1994-08-31 1994-08-31 Intelligent analyzing and processing system for missive-table document

Country Status (1)

Country Link
TW (1) TW275115B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106326854A (en) * 2016-08-19 2017-01-11 掌阅科技股份有限公司 Open fixed-layout document paragraph identification method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106326854A (en) * 2016-08-19 2017-01-11 掌阅科技股份有限公司 Open fixed-layout document paragraph identification method
CN106326854B (en) * 2016-08-19 2019-09-06 掌阅科技股份有限公司 A kind of format document paragraph recognition methods

Similar Documents

Publication Publication Date Title
Pratt et al. Combined symbol matching facsimile data compression system
KR100523898B1 (en) Identification, separation and compression of multiple forms with mutants
US5956422A (en) Processor based method for extracting tablets from printed documents
US7295694B2 (en) MICR-based optical character recognition system and method
EP0658042B1 (en) Dropped-form document image compression
EP0567344B1 (en) Method and apparatus for character recognition
US5818965A (en) Consolidation of equivalence classes of scanned symbols
EP0843277A3 (en) Page analysis system
KR920001359A (en) Image Processing System and Method for Document Data
EP0720114A2 (en) Method and apparatus for detecting and interpreting textual captions in digital video signals
US6351559B1 (en) User-enclosed region extraction from scanned document images
EP0184682A3 (en) Digital scanner
EP0851659A3 (en) Information processing system and method therefor
US20020039439A1 (en) Interpretation of coloured documents
US4684997A (en) Machine for the reading, processing and compression of documents
US20030012438A1 (en) Multiple size reductions for image segmentation
US6690492B2 (en) Image processing method and apparatus
US5864629A (en) Character recognition methods and apparatus for locating and extracting predetermined data from a document
EP0516576A2 (en) Method of discriminating between text and graphics
Tompkins et al. A fast segmentation algorithm for bi-level image compression using JBIG2
US6289122B1 (en) Intelligent detection of text on a page
TW275115B (en) Intelligent analyzing and processing system for missive-table document
US6487311B1 (en) OCR-based image compression
US5721790A (en) Methods and apparatus for separating integer and fractional portions of a financial amount
JPH0291789A (en) Character recognizing system