TW275115B - Intelligent analyzing and processing system for missive-table document - Google Patents
Intelligent analyzing and processing system for missive-table documentInfo
- Publication number
- TW275115B TW275115B TW83108000A TW83108000A TW275115B TW 275115 B TW275115 B TW 275115B TW 83108000 A TW83108000 A TW 83108000A TW 83108000 A TW83108000 A TW 83108000A TW 275115 B TW275115 B TW 275115B
- Authority
- TW
- Taiwan
- Prior art keywords
- segment
- missive
- block
- unit
- horizontal
- Prior art date
Links
Abstract
An intelligent analyzing and processing system for missive-table document comprises: one monitor; one scanner; one pre-processing unit for receiving digital image converted by scanner via host, and executing run-length operation to every horizontal scan line to get segment combined by all continuous pixels of the horizontal line, in the meantime viewing all continuous segment of the first horizontal scan line as block with width 1, and pertableing block growth operation with all continuous segment of next horizontal scan line one by one to separate each independent block part in image, then pertableing merging test of text or segment to every independent block; one segment and text extracting unit for differentiating block processed by pre-processing unit into segment block or text block by their features, if the length of block belonged to segment is far larger than its width, then the block is one horizontal segment, otherwise, the block is one vertical segment, and for general block the ratio of its longest side to the other side is usually less than one constant value, so the segment and text extracting unit utilizes the block aspect ratio as differentiation basis; one segment cross point extracting unit for checking and recording if every horizontal segment has cross point with vertical segment, and considering there exists fork due to broken cross point, this unit set one w value as tolerance value of cross broken fork, and w>0; one missive table frame confirming unit by taking advantage of one dependent relationship between four corner points of every frame in missive table, so this unit could find dependent ordinal numberrelationship between them to confirm all frames in missive table from the ordinal numbers of horizontal and vertical segments passing through each cross point in missive table; one Chinese/English character recognizing unit for converting font image scanned by scanner into inner code tableat to save storage space; in this unit extracting font features from font image via analysis, and pertableing recognition matching with font template data to find inner code corresponding to the font and save with the inner code; one missive table document feature database for creating one missive table document database from all features(such as horizontal segment, vertical segment, frame and character inner code in frame) generated by analysis-processed missive table according to one fixed data structure as future missive table document data index and missive table recognition.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW83108000A TW275115B (en) | 1994-08-31 | 1994-08-31 | Intelligent analyzing and processing system for missive-table document |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW83108000A TW275115B (en) | 1994-08-31 | 1994-08-31 | Intelligent analyzing and processing system for missive-table document |
Publications (1)
Publication Number | Publication Date |
---|---|
TW275115B true TW275115B (en) | 1996-05-01 |
Family
ID=51397267
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW83108000A TW275115B (en) | 1994-08-31 | 1994-08-31 | Intelligent analyzing and processing system for missive-table document |
Country Status (1)
Country | Link |
---|---|
TW (1) | TW275115B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106326854A (en) * | 2016-08-19 | 2017-01-11 | 掌阅科技股份有限公司 | Open fixed-layout document paragraph identification method |
-
1994
- 1994-08-31 TW TW83108000A patent/TW275115B/en active
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106326854A (en) * | 2016-08-19 | 2017-01-11 | 掌阅科技股份有限公司 | Open fixed-layout document paragraph identification method |
CN106326854B (en) * | 2016-08-19 | 2019-09-06 | 掌阅科技股份有限公司 | A kind of format document paragraph recognition methods |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Pratt et al. | Combined symbol matching facsimile data compression system | |
KR100523898B1 (en) | Identification, separation and compression of multiple forms with mutants | |
US5956422A (en) | Processor based method for extracting tablets from printed documents | |
US7295694B2 (en) | MICR-based optical character recognition system and method | |
EP0658042B1 (en) | Dropped-form document image compression | |
EP0567344B1 (en) | Method and apparatus for character recognition | |
US5818965A (en) | Consolidation of equivalence classes of scanned symbols | |
EP0843277A3 (en) | Page analysis system | |
KR920001359A (en) | Image Processing System and Method for Document Data | |
EP0720114A2 (en) | Method and apparatus for detecting and interpreting textual captions in digital video signals | |
US6351559B1 (en) | User-enclosed region extraction from scanned document images | |
EP0184682A3 (en) | Digital scanner | |
EP0851659A3 (en) | Information processing system and method therefor | |
US20020039439A1 (en) | Interpretation of coloured documents | |
US4684997A (en) | Machine for the reading, processing and compression of documents | |
US20030012438A1 (en) | Multiple size reductions for image segmentation | |
US6690492B2 (en) | Image processing method and apparatus | |
US5864629A (en) | Character recognition methods and apparatus for locating and extracting predetermined data from a document | |
EP0516576A2 (en) | Method of discriminating between text and graphics | |
Tompkins et al. | A fast segmentation algorithm for bi-level image compression using JBIG2 | |
US6289122B1 (en) | Intelligent detection of text on a page | |
TW275115B (en) | Intelligent analyzing and processing system for missive-table document | |
US6487311B1 (en) | OCR-based image compression | |
US5721790A (en) | Methods and apparatus for separating integer and fractional portions of a financial amount | |
JPH0291789A (en) | Character recognizing system |