CN101876967B - Method for generating PDF text paragraphs - Google Patents
Method for generating PDF text paragraphs Download PDFInfo
- Publication number
- CN101876967B CN101876967B CN2010101363998A CN201010136399A CN101876967B CN 101876967 B CN101876967 B CN 101876967B CN 2010101363998 A CN2010101363998 A CN 2010101363998A CN 201010136399 A CN201010136399 A CN 201010136399A CN 101876967 B CN101876967 B CN 101876967B
- Authority
- CN
- China
- Prior art keywords
- text
- line
- literal
- adjacent
- texts
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 37
- 239000012634 fragment Substances 0.000 claims description 27
- 230000008859 change Effects 0.000 claims description 19
- 239000000284 extract Substances 0.000 claims description 12
- 230000002093 peripheral effect Effects 0.000 claims description 4
- 230000000694 effects Effects 0.000 abstract description 2
- 230000008569 process Effects 0.000 description 9
- 238000010586 diagram Methods 0.000 description 6
- 238000000605 extraction Methods 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
Images
Abstract
Description
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010101363998A CN101876967B (en) | 2010-03-25 | 2010-03-25 | Method for generating PDF text paragraphs |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010101363998A CN101876967B (en) | 2010-03-25 | 2010-03-25 | Method for generating PDF text paragraphs |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101876967A CN101876967A (en) | 2010-11-03 |
CN101876967B true CN101876967B (en) | 2012-05-02 |
Family
ID=43019525
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2010101363998A Expired - Fee Related CN101876967B (en) | 2010-03-25 | 2010-03-25 | Method for generating PDF text paragraphs |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101876967B (en) |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102479215B (en) * | 2010-11-30 | 2013-10-30 | 汉王科技股份有限公司 | Automatic file exporting method and electronic reading device |
CN102546577A (en) * | 2010-12-27 | 2012-07-04 | 北京大学 | Compression and decompression method and system for format data |
CN102890826B (en) * | 2011-08-12 | 2015-09-09 | 北京多看科技有限公司 | A kind of method of scanned version document re-ranking version |
CN102306294A (en) * | 2011-08-23 | 2012-01-04 | 深圳市万兴软件有限公司 | Method and system for extracting image from portable document format (PDF) file page |
CN102306143A (en) * | 2011-09-22 | 2012-01-04 | 汉王科技股份有限公司 | Method and system for generating and editing PDF (portable document format) document |
CN102722475A (en) * | 2012-05-09 | 2012-10-10 | 深圳市万兴软件有限公司 | Method for converting form in portable document format (PDF) document into Excel form |
US20140280186A1 (en) * | 2013-03-15 | 2014-09-18 | International Business Machines Corporation | Crowdsourcing and consolidating user notes taken in a virtual meeting |
CN104063364A (en) * | 2013-03-19 | 2014-09-24 | 福建福昕软件开发股份有限公司北京分公司 | PDF document recognition method |
CN104516868B (en) * | 2013-09-30 | 2018-03-06 | 北大方正集团有限公司 | The streaming restoring method and system in a kind of space of a whole page space |
CN105354174B (en) * | 2014-08-22 | 2018-04-10 | 北大方正集团有限公司 | For exporting the composition method and device of epub formatted files |
CN104199805B (en) * | 2014-09-11 | 2017-10-20 | 清华大学 | Text joining method and device |
CN104850316B (en) * | 2015-04-29 | 2019-02-12 | 小米科技有限责任公司 | E-book font method of adjustment and device |
CN105373526B (en) * | 2015-10-23 | 2019-02-15 | 北大方正集团有限公司 | A kind of white space processing method and system in electronic document |
CN107391457B (en) * | 2017-07-26 | 2020-10-27 | 成都科来软件有限公司 | Document segmentation method and device based on text line |
CN107783956B (en) * | 2017-11-23 | 2019-03-15 | 掌阅科技股份有限公司 | Composition method, electronic equipment and the computer storage medium of text information |
CN109815453A (en) * | 2018-12-25 | 2019-05-28 | 东软集团股份有限公司 | Document method of partition, device, storage medium and electronic equipment |
CN109948518B (en) * | 2019-03-18 | 2023-06-09 | 武汉汉王大数据技术有限公司 | Neural network-based PDF document content text paragraph aggregation method |
CN110222324B (en) * | 2019-05-21 | 2022-11-08 | 上海阿几网络技术有限公司 | Automatic layout device based on character paragraph structure and word size change rate |
CN112307713A (en) * | 2020-10-27 | 2021-02-02 | 广州朗国电子科技有限公司 | Automatic text typesetting method and system based on Android system |
CN117217172A (en) * | 2023-11-09 | 2023-12-12 | 金蝶征信有限公司 | Table information acquisition method, apparatus, computer device, and storage medium |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0702322B1 (en) * | 1994-09-12 | 2002-02-13 | Adobe Systems Inc. | Method and apparatus for identifying words described in a portable electronic document |
CN1278260C (en) * | 2004-02-06 | 2006-10-04 | 珠海金山软件股份有限公司 | Typesetting method |
CN101206639B (en) * | 2007-12-20 | 2012-05-23 | 北大方正集团有限公司 | Method for indexing complex impression based on PDF |
-
2010
- 2010-03-25 CN CN2010101363998A patent/CN101876967B/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN101876967A (en) | 2010-11-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101876967B (en) | Method for generating PDF text paragraphs | |
CN110163030B (en) | PDF framed table extraction method based on image information | |
CN100578432C (en) | Method for directly writing handwriting information | |
CN101206639B (en) | Method for indexing complex impression based on PDF | |
US20070126793A1 (en) | Digital content creation system, digital content creation method, and program product | |
CN101101588B (en) | Document editing device, program, and storage medium | |
CN1312611C (en) | Placement system, programm and method | |
JP5189497B2 (en) | Form creation system, network system using the same, and form creation method. | |
EP2002352B1 (en) | Applying effects to a merged text path | |
CN1936882A (en) | Paging form data-processing method and system | |
US8799761B2 (en) | Method and system for repurposing a spreadsheet to save paper and ink | |
CN105139334A (en) | Multiline text watermark production device | |
AU660313B2 (en) | Method and apparatus for automated page layout of text and graphic elements | |
JP5950700B2 (en) | Image processing apparatus, image processing method, and program | |
CN104424174B (en) | Document processing system and document processing method | |
CN103488619B (en) | Method and device for processing document file | |
CN103970890B (en) | Real-time webpage data generation method and device | |
CN113962193A (en) | Table typesetting method and device, electronic equipment and storage medium | |
JP6152633B2 (en) | Display control apparatus and program | |
CN112307725A (en) | Method for adding table information on two-dimensional drawing interface | |
CN111126007A (en) | HTML (Hypertext markup language) -based medical record document paging algorithm | |
CN104112287A (en) | Method and device for segmenting characters in picture | |
JP6540546B2 (en) | Information processing apparatus and program | |
JP2000207393A (en) | Character arrangement outputting device | |
CN101571882A (en) | System and method for generating minimum outline of characters |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C56 | Change in the name or address of the patentee |
Owner name: SHENZHEN WONDERSHARE INFORMATION TECHNOLOGY CO., L Free format text: FORMER NAME: SHENZHEN WONDERSHARE SOFTWARE CO., LTD. |
|
CP03 | Change of name, title or address |
Address after: 518057 Guangdong city of Shenzhen province Nanshan District Gao Xin Road, room 9 building on the north side of block A901 No. 006 TCL Industry Research Institute building A A Building 8 floor Patentee after: SHENZHEN WONDERSHARE INFORMATION TECHNOLOGY Co.,Ltd. Address before: Room 9, block A901 building on the north side of a building 518057 North TCL A of Guangdong Province, Shenzhen city Nanshan District South Road West ten high new technology Patentee before: WONDERSHARE SOFTWARE Co.,Ltd. |
|
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: 850000 Tibet autonomous region, Lhasa City, New District, west of the East Ring Road, 1-4 road to the north, south of 1-3 Road, Liu Dong building, east of the 8 unit 6, floor 2, No. Patentee after: WONDERSHARE TECHNOLOGY CO.,LTD. Address before: 518057 Guangdong city of Shenzhen province Nanshan District Gao Xin Road, room 9 building on the north side of block A901 No. 006 TCL Industry Research Institute building A A Building 8 floor Patentee before: SHENZHEN WONDERSHARE INFORMATION TECHNOLOGY Co.,Ltd. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20120502 |