CN109522277A - A kind of method and apparatus that multi-format document is read - Google Patents

A kind of method and apparatus that multi-format document is read Download PDF

Info

Publication number
CN109522277A
CN109522277A CN201811175073.9A CN201811175073A CN109522277A CN 109522277 A CN109522277 A CN 109522277A CN 201811175073 A CN201811175073 A CN 201811175073A CN 109522277 A CN109522277 A CN 109522277A
Authority
CN
China
Prior art keywords
file
page
pdf
read
support
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811175073.9A
Other languages
Chinese (zh)
Inventor
何中
汤海泉
何书
陈明敏
严伟
戴建峰
姚童
何登
王斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
JIANGSU ZHONGWEI TECHNOLOGY SOFTWARE SYSTEM Co Ltd
Original Assignee
JIANGSU ZHONGWEI TECHNOLOGY SOFTWARE SYSTEM Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by JIANGSU ZHONGWEI TECHNOLOGY SOFTWARE SYSTEM Co Ltd filed Critical JIANGSU ZHONGWEI TECHNOLOGY SOFTWARE SYSTEM Co Ltd
Priority to CN201811175073.9A priority Critical patent/CN109522277A/en
Publication of CN109522277A publication Critical patent/CN109522277A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention discloses the method and apparatus that a kind of multi-format document is read, the following steps are included: file pre-processes, parsing extracts, imports multiple files, click processing, Study document type directly filters out the file that do not support, for the file of support, different processing libraries is loaded, is preloaded.File rendering, computed altitude, it is automatically introduced into following file, default load page 3, preferential load document page 3, if first file is less than page 3, so subsequent file is filled automatically, file synthesis, export, by the All Files of importing, successively be converted to pdf, it is not converted if original is pdf, otherwise enters conversion process, pop-up saves file frame after conversion end, select storing path, input saves filename and carries out saving a variety of files of invention support, and multiple file consolidations rollings check that support, which merges, is converted into same file.

Description

A kind of method and apparatus that multi-format document is read
Technical field
The present invention relates to the method and apparatus technical field that multi-format document is read, specially a kind of multi-format document is read Method and apparatus.
Background technique
File format (or file type) refers to the specific coding mode to information that computer uses to store information, It is the data of internal reservoir for identification.Than if any storage picture, some storage programs, some storage text informations.It is each Category information, can one or more file formats be stored in computer storage in.Each file format usually have it is a kind of or A variety of extension name can be used to identify, it is also possible to without extension name.The tray that extension name can help application program to identify Formula.
For hard disk drive or the storage of any computer, effective information only has 0 and 1 two kind.So computer must design There is corresponding mode to carry out the conversion of one bit of information.There is different storage formats for different information.
1, existing different files, which need to install different programs, can just read, and changeover program is needed to open, very numerous It is trivial;Switching is often also required to even if 2, of a sort multiple files, when checking to be checked, it is not convenient enough;3, not identical text Part cannot be converted to unified file and be checked, it would therefore be highly desirable to which a kind of improved technology is come in the presence of solving the prior art This problem.
Summary of the invention
The purpose of the present invention is to provide the method and apparatus that a kind of multi-format document is read, and can support a variety of files, more A file consolidation integration, which rolls, to be checked, supports merging to be converted into same file, to solve mentioned above in the background art ask Topic.
To achieve the above object, the invention provides the following technical scheme: a kind of multi-format document read method, including with Lower step:
Step 1: file pretreatment is parsed, is extracted, and imports multiple files, click processing, and Study document type will not prop up The file held directly filters out, and for the file of support, loads different processing libraries, is preloaded.
Step 2: file rendering, computed altitude are automatically introduced into following file, default load page 3, preferential load document 3 Page, if first file is less than page 3, subsequent file is filled automatically.
Step 3: file synthesis, the All Files of importing are successively converted to pdf by export, if original is Pdf is not converted then, otherwise enters conversion process, and pop-up saves file frame after conversion end, selects storing path, and input saves text Part name is saved.
Preferably, it is preloaded in the step 1 as properties: file content, file total page number, file rendering library.
Preferably, loading mode is as follows in the step 2: when opening page 1, while loading page 2, slides into page 2 When, page 3 is loaded, 3 page datas can be cached at this time, when sliding into third page, page 4 is loaded, the data of first page is emptied, with this Analogize, after a file rolls, loads the first page of next file automatically into screen.
Preferably, described device includes:
Processor;
The memory of executable instruction for storage processor;
Wherein, the processor is configured to:
When importing file, for judging file type, if it is determined that the file type that do not support, is directly skipped, if It is judged as the file type of support, loading processing library;
After introducing file, calculates and loading page caches, will be preloaded in the next page for reading the page, empty and reading page The data of the prevpage of one page in front;
The file of non-pdf format is converted into pdf format.
Preferably, described device includes:
Hard disk;
Medium for storing data;
Wherein, the hard disk is configured as:
Deposit and reading process library, and can loading processing library;
Deposit and reading file cache;
Deposit and reading file data.
Compared with prior art, the beneficial effects of the present invention are:
(1) a variety of files are supported.
(2) multiple file consolidation rollings are checked.
(3) it supports to merge to be converted into same file.
Detailed description of the invention
Fig. 1 is flow diagram of the present invention.
Specific embodiment
The following is a clear and complete description of the technical scheme in the embodiments of the invention, it is clear that described embodiment Only a part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, the common skill in this field Art personnel every other embodiment obtained without making creative work belongs to the model that the present invention protects It encloses.
As shown in Figure 1, the present invention provides a kind of technical solution: a kind of method that multi-format document is read, including following step It is rapid:
Step 1: file pretreatment is parsed, is extracted, and imports multiple files, click processing, and Study document type will not prop up The file held directly filters out, and for the file of support, loads different processing libraries, is preloaded.
Step 2: file rendering, computed altitude are automatically introduced into following file, default load page 3, preferential load document 3 Page, if first file is less than page 3, subsequent file is filled automatically.
Step 3: file synthesis, the All Files of importing are successively converted to pdf by export, if original is Pdf is not converted then, otherwise enters conversion process, and pop-up saves file frame after conversion end, selects storing path, and input saves text Part name is saved.
Embodiment one:
By " file 1.doc " (page 2), " file 2.pdf " (page 4), " file 3.xls " (page 1), " file 4.txt " (1 Page), " file 5.html " (page 1), (page 1) of " file 6.jpg " importing, click processing, Load Game item is completed to preload, first First it is shown that " file 1.doc " page 1, downward page turning, successively it is shown that " file 1.doc " page 2, " file 2.pdf " Page 1, " file 2.pdf " page 2, " file 2.pdf " page 3, " file 2.pdf " page 4, " file 3.xls " page 1, " text Part 4.txt " page 1, " file 5.html " page 1, " file 6.jpg " page 1 click export button, read conversion stripes, successively Display be " file 1.doc " file conversion in, " file 3.xls " file conversion in, " file 4.txt " file conversion in, " file In the conversion of 5.html " file, in the conversion of " file 6.jpg " file, finally pop-up saves file frame, and suffix is defaulted as .pdf, according to It is secondary that file is saved as into " file 1.pdf ", " file 3.pdf ", " file 4.pdf ", " file 5.pdf ", " file 6.pdf ", it is complete At all operations.
Embodiment two:
Based on embodiment one, " file 3.xls " is replaced with into " file 3.avi "
By " file 1.doc " (page 2), " file 2.pdf " (page 4), " file 3.avi ", " file 4.txt " (page 1), " text Part 5.html " (page 1), " file 6.jpg " (page 1) importing, click processing, Load Game item pop up the warning dialog box " " file 3.ayi " be do not support file format ", click confirming button, continue Load Game item, complete preload, first it is shown that " file 1.doc " page 1, downward page turning, successively it is shown that " file 1.doc " page 2, " file 2.pdf " page 1, " file 2.pdf " page 2, " file 2.pdf " page 3, " file 2.pdf " page 4, " file 4.txt " page 1, " file 5.html " Page 1, " file 6.jpg " page 1, click export button, read conversion stripes, successively display be " file 1.doc " file conversion in, In the conversion of " file 4.txt " file, in the conversion of " file 5.html " file, in the conversion of " file 6.jpg " file, finally pop-up is protected Deposit file frame, suffix is defaulted as .pdf, successively by file save as " file 1.pdf ", " file 4.pdf ", " file 5.pdf ", " file 6.pdf ", completes all operations.
Embodiment three:
Based on embodiment one, " file 1.doc " is replaced with into " file 1.pdf "
By " file 1.pdf " (page 1), " file 2.pdf " (page 4), " file 3.xls " (page 1), " file 4.txt " (1 Page), " file 5.html " (page 1), (page 1) of " file 6.jpg " importing, click processing, Load Game item is completed to preload, first First it is shown that " file 1.pdf " page 1, downward page turning, successively it is shown that " file 2.pdf " page 1, " file 2.pdf " Page 2, " file 2.pdf " page 3, " file 2.pdf " page 4, " file 3.xls " (page 1), " file 4.txt " page 1, " text Part 5.html " page 1, " file 6.jpg " page 1 click export button, read conversion stripes, and successively display is " file 3.xls " File conversion in, " file 4.txt " file conversion in, " file 5.html " file conversion in, " file 6.jpg " file conversion In, finally pop-up saves file frame, and suffix is defaulted as .pdf, successively by file save as " file 3.pdf ", " file 4.pdf ", " file 5.pdf ", " file 6.pdf ", completes all operations.
Example IV:
All import " .pdf " formatted file
By " file 1.pdf " (page 1), " file 2.pdf " (page 1), " file 3.pdf " (page 2), " file 4.pdf " (page 2) It importing, click processing, Load Game item is completed to preload, first it is shown that " file 1.pdf " page 1, downward page turning, according to It is secondary that " file 2.pdf " page 1, " file 3.pdf " page 1, " file 3.pdf " page 2, " file 4.pdf " the 1st is shown Page, " file 4.pdf " page 2 click export button, read conversion stripes, and then display is saved and finished, and completes all operations.
Embodiment five:
It all imports non-" .pdf " and supports formatted file
By " file 1.doc " (page 2), " file 2.ppt " (page 3), " file 3.xls " (page 1), " file 4.txt " (1 Page), " file 5.html " (page 1), (page 1) of " file 6.jpg " importing, click processing, Load Game item is completed to preload, first First it is shown that " file 1.doc " page 1, downward page turning, successively it is shown that " file 1.doc " page 2, " file 2.ppt " Page 1, " file 2.ppt " page 2, " file 2.ppt " page 3, " file 3.xls " page 1, " file 4.txt " page 1, " text Part 5.html " page 1, " file 6.jpg " page 1 click export button, read conversion stripes, and successively display is " file 1.doc " File conversion in, " file 2.ppt " file conversion in, " file 3.xls " file conversion in, " file 4.txt " file conversion in, In the conversion of " file 5.html " file, in the conversion of " file 6.jpg " file, finally pop-up saves file frame, and suffix is defaulted as .pdf, file is successively saved as into " file 1.pdf ", " file 2.pdf ", " file 3.pdf ", " file 4.pdf ", " file 5.pdf ", " file 6.pdf ", complete all operations.
Embodiment six:
All import non-supporting formatted file
By " file 1.avi ", " file 2.rm ", " file 3.iso ", " file 4.tmp ", " file 5.mid ", " file 6.bak " is imported, click processing, Load Game item, pop-up the warning dialog box " " file 1.avi " is not support file format ", point Confirming button is hit, pop-up the warning dialog box " " file 2.rm " is not support file format " clicks confirming button, pop-up warning pair Words frame " " file 3.iso " is not support file format " clicks confirming button, and " " file 4.tmp " is not to pop-up the warning dialog box Support file format ", confirming button is clicked, pop-up the warning dialog box " " file 5.mid " is not support file format " is clicked true Determine button, pop-up the warning dialog box " " file 6.bak " is not support file format " clicks confirming button, completes to preload, with Afterwards it is shown that blank page, clicks close button, complete all operations.
According to embodiment one~six, the present invention, which has, supports a variety of files, and multiple file consolidation rollings are checked, supports Merging is converted into the beneficial effects such as same file.
Embodiment is related to a kind of terminal device, terminal device desktop computer, notebook, palm PC, mobile phone or cloud Server etc. is held to calculate equipment, the mobile terminal device may include but be not limited to processor, memory, those skilled in the art It is appreciated that above-mentioned is only example of the present invention for making comments and instructions across lattice, the restriction of mobile terminal device is not constituted, can wrap It includes more than example perhaps less component and perhaps combines certain components or different components, such as the terminal device is also It may include input-output equipment, network access equipment, bus etc..
Memory of the processor for the executable instruction of storage processor, wherein the processor is configured to: it imports When file, for judging file type, if it is determined that the file type that do not support, is directly skipped, if it is determined that support File type, loading processing library;It after introducing file, calculates and loading page caches, pre-add will be carried out in the next page for reading the page It carries, empties the data in the prevpage for reading page prevpage;The file of non-pdf format is converted into pdf format.
Memory is configured as storing various types of data to support the operation in mobile terminal device.These data Example includes the instruction of any application or method for operating on the terminal device.Memory can be by any kind of Volatibility or non-volatile memory device or their combination are realized, such as static random access memory (SRAM), electrically erasable Except programmable read only memory (EEPROM), Erasable Programmable Read Only Memory EPROM (EPROM), programmable read only memory (PROM) or flash memory, hard disk used in embodiments herein, medium for storing data;Wherein, the hard disk is matched It is set to: deposit and reading process library, and can loading processing library;Deposit and reading file cache;Deposit and reading file data.
It is obvious to a person skilled in the art that invention is not limited to the details of the above exemplary embodiments, Er Qie In the case where without departing substantially from spirit or essential attributes of the invention, the present invention can be realized in other specific forms.Therefore, no matter From the point of view of which point, the present embodiments are to be considered as illustrative and not restrictive, and the scope of the present invention is by appended power Benefit requires rather than above description limits, it is intended that all by what is fallen within the meaning and scope of the equivalent elements of the claims Variation is included in the present invention.Any reference signs in the claims should not be construed as limiting the involved claims.This Outside, it is clear that one word of " comprising " does not exclude other units or steps, and odd number is not excluded for plural number.System, device or computer installation power Multiple units, module or the device stated in benefit requirement can also be passed through software or hard by the same unit, module or device Part is realized.The first, the second equal words are used to indicate names, and are not indicated any particular order.
It although an embodiment of the present invention has been shown and described, for the ordinary skill in the art, can be with A variety of variations, modification, replacement can be carried out to these embodiments without departing from the principles and spirit of the present invention by understanding And modification, the scope of the present invention is defined by the appended.

Claims (5)

1. a kind of method that multi-format document is read, it is characterised in that: the following steps are included:
Step 1: file pretreatment is parsed, is extracted, and imports multiple files, click processing, Study document type, by what is do not supported File directly filters out, and for the file of support, loads different processing libraries, is preloaded.
Step 2: file rendering, computed altitude are automatically introduced into following file, default load page 3, preferential load document page 3, If first file is less than page 3, subsequent file is filled automatically.
Step 3: the All Files of importing are successively converted to pdf by file synthesis, export, if original is pdf It does not convert, otherwise enters conversion process, pop-up saves file frame after conversion end, selects storing path, and input saves filename It is saved.
2. the method that a kind of multi-format document according to claim 1 is read, it is characterised in that: pre-add in the step 1 It is loaded with as properties: file content, file total page number, file rendering library.
3. the method that a kind of multi-format document according to claim 1 is read, it is characterised in that: loaded in the step 2 Mode is as follows: when opening page 1, while page 2, when sliding into page 2 loaded, loads page 3,3 page datas can be cached at this time, When sliding into third page, page 4 is loaded, the data of first page are emptied, and so on, after a file rolls, automatically The first page of next file is loaded into screen.
4. the device that a kind of multi-format document according to claim 1 is read, it is characterised in that: described device includes:
Processor;
The memory of executable instruction for storage processor;
Wherein, the processor is configured to:
When importing file, for judging file type, if it is determined that the file type that do not support, is directly skipped, if it is determined that For the file type of support, loading processing library;
After introducing file, calculates and loading page caches, will preload, and be emptied before reading the page in the next page for reading the page The data of the prevpage of one page;
The file of non-pdf format is converted into pdf format.
5. the device that a kind of multi-format document according to claim 1 is read, it is characterised in that: described device includes:
Hard disk;
Medium for storing data;
Wherein, the hard disk is configured as:
Deposit and reading process library, and can loading processing library;
Deposit and reading file cache;
Deposit and reading file data.
CN201811175073.9A 2018-10-10 2018-10-10 A kind of method and apparatus that multi-format document is read Pending CN109522277A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811175073.9A CN109522277A (en) 2018-10-10 2018-10-10 A kind of method and apparatus that multi-format document is read

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811175073.9A CN109522277A (en) 2018-10-10 2018-10-10 A kind of method and apparatus that multi-format document is read

Publications (1)

Publication Number Publication Date
CN109522277A true CN109522277A (en) 2019-03-26

Family

ID=65770091

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811175073.9A Pending CN109522277A (en) 2018-10-10 2018-10-10 A kind of method and apparatus that multi-format document is read

Country Status (1)

Country Link
CN (1) CN109522277A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116755593A (en) * 2023-08-11 2023-09-15 江苏中威科技软件系统有限公司 Method for combining or combining information with file aggregation whiteboard for reading and operating
CN117892695A (en) * 2024-03-13 2024-04-16 江苏中威科技软件系统有限公司 Method for carrying multi-format file by DLF file

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040230700A1 (en) * 2003-05-15 2004-11-18 Canon Kabushiki Kaisha Information transmission method and information transmission apparatus
CN103533073A (en) * 2013-10-23 2014-01-22 北京网秦天下科技有限公司 File management system and method for mobile equipment
CN105045802A (en) * 2015-05-22 2015-11-11 杭州亿方云网络科技有限公司 Message-driven multi-type file preview system
CN107526619A (en) * 2017-09-04 2017-12-29 江苏中威科技软件系统有限公司 The load mode of format data stream file
CN108108478A (en) * 2018-01-04 2018-06-01 中煤航测遥感集团有限公司 Conversion method of data format, system and electronic equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040230700A1 (en) * 2003-05-15 2004-11-18 Canon Kabushiki Kaisha Information transmission method and information transmission apparatus
CN103533073A (en) * 2013-10-23 2014-01-22 北京网秦天下科技有限公司 File management system and method for mobile equipment
CN105045802A (en) * 2015-05-22 2015-11-11 杭州亿方云网络科技有限公司 Message-driven multi-type file preview system
CN107526619A (en) * 2017-09-04 2017-12-29 江苏中威科技软件系统有限公司 The load mode of format data stream file
CN108108478A (en) * 2018-01-04 2018-06-01 中煤航测遥感集团有限公司 Conversion method of data format, system and electronic equipment

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116755593A (en) * 2023-08-11 2023-09-15 江苏中威科技软件系统有限公司 Method for combining or combining information with file aggregation whiteboard for reading and operating
CN116755593B (en) * 2023-08-11 2023-10-24 江苏中威科技软件系统有限公司 Method for combining or combining information with file aggregation whiteboard for reading and operating
CN117892695A (en) * 2024-03-13 2024-04-16 江苏中威科技软件系统有限公司 Method for carrying multi-format file by DLF file
CN117892695B (en) * 2024-03-13 2024-05-28 江苏中威科技软件系统有限公司 Method for carrying multi-format file by DLF file

Similar Documents

Publication Publication Date Title
CA2831381C (en) Recovery of tenant data across tenant moves
CN103488732A (en) Generation method and device of static pages
US9460069B2 (en) Generation of test data using text analytics
US20140237343A1 (en) Method and system for optimizing rendering of data tables
CN104601691A (en) Method and system for increasing loading speed of Web site resource
US20120158742A1 (en) Managing documents using weighted prevalence data for statements
US7370060B2 (en) System and method for user edit merging with preservation of unrepresented data
CN103067480A (en) Synchronized method and system of network disk
US11030163B2 (en) System for tracking and displaying changes in a set of related electronic documents
CN109767274B (en) Method and system for carrying out associated storage on massive invoice data
CN104765849A (en) Method and system for acquiring copied data source information
US20120060086A1 (en) Removing style corruption from extensible markup language documents
CN106445815A (en) Automated testing method and device
CN113076731A (en) Report file generation method and device, computer equipment and storage medium
CN109522277A (en) A kind of method and apparatus that multi-format document is read
CN113835692A (en) Dictionary data processing method and device, electronic equipment and computer storage medium
CN109697019A (en) The method and system of data write-in based on FAT file system
US8131728B2 (en) Processing large sized relationship-specifying markup language documents
CN113010542B (en) Service data processing method, device, computer equipment and storage medium
CN110377891B (en) Method, device and equipment for generating event analysis article and computer readable storage medium
CN110795920B (en) Document generation method and device
US9069884B2 (en) Processing special attributes within a file
CN105574164A (en) Excel document data analysis method and device
CN108536715B (en) Preview page generation method, device, equipment and storage medium
KR101174398B1 (en) Apparatus and method for recommanding contents

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190326

WD01 Invention patent application deemed withdrawn after publication