CN109597980A - PDF document dividing method, device and electronic equipment - Google Patents

PDF document dividing method, device and electronic equipment Download PDF

Info

Publication number
CN109597980A
CN109597980A CN201811502370.XA CN201811502370A CN109597980A CN 109597980 A CN109597980 A CN 109597980A CN 201811502370 A CN201811502370 A CN 201811502370A CN 109597980 A CN109597980 A CN 109597980A
Authority
CN
China
Prior art keywords
document
subdocument
pages
target
pdf source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811502370.XA
Other languages
Chinese (zh)
Inventor
李譞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wanxing Polytron Technologies Inc
Original Assignee
Wanxing Polytron Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wanxing Polytron Technologies Inc filed Critical Wanxing Polytron Technologies Inc
Priority to CN201811502370.XA priority Critical patent/CN109597980A/en
Publication of CN109597980A publication Critical patent/CN109597980A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting

Abstract

The present invention provides a kind of PDF document dividing method, device and electronic equipments, are related to electronic document editing and processing technical field, this method comprises: receiving the cutting operation parameter of the target PDF source document that user terminal uploads and input;According to cutting operation parameter, target PDF source document is split, obtains the corresponding output subdocument of target PDF source document;Output subdocument is sent to user terminal.The present invention is able to ascend the personalization level of PDF document segmentation.

Description

PDF document dividing method, device and electronic equipment
Technical field
The present invention relates to electronic document editing and processing technical fields, more particularly, to a kind of PDF document dividing method, device And electronic equipment.
Background technique
PDF (Portable Document Format, Portable document format) document is common, widely used electronics Document format, content of pages use fixed format, and the content between same page is not on the data store without any connection.Mesh Before, usually PDF document is split according to fixed page quantity, if user has specific segmentation demand, user It needs to carry out multi-pass operation to be likely to complete its purpose for dividing PDF document;Meanwhile only being split according to the page, generation File size is related with the content of the page, is unknown, it is impossible to meet user such as to file size limitation specific demand. Such mode is not able to satisfy the individual demand of user.
Summary of the invention
In view of this, the purpose of the present invention is to provide a kind of PDF document dividing method, device and electronic equipment, it can Promote the personalization level of PDF document segmentation.
In a first aspect, the embodiment of the invention provides a kind of PDF document dividing methods, comprising: receive what user terminal uploaded Target PDF source document and the cutting operation parameter of input;According to cutting operation parameter, target PDF source document is split, is obtained To the corresponding output subdocument of target PDF source document;Output subdocument is sent to user terminal.
With reference to first aspect, the embodiment of the invention provides the first possible embodiments of first aspect, wherein point Cutting operating parameter is document size threshold value;It is above-mentioned according to cutting operation parameter, target PDF source document is split, mesh is obtained The step of mark PDF source document corresponding output subdocument includes: to count when target PDF source document is greater than document size threshold value The number of pages of target PDF source document;When the number of pages of target PDF source document is greater than 1, target PDF source document is divided into the first mesh Mark document and the second destination document;Wherein, the number of pages of first object document is equal to the number of pages or the second mesh of the second destination document The number of pages of mark document adds 1;Judge whether the size of first object document is greater than document size threshold value;If so, working as first object When the number of pages of document is greater than 1, using first object document as target PDF source document, above-mentioned steps are re-executed: being grasped according to segmentation Make parameter, target PDF source document is split, obtains the corresponding output subdocument of target PDF source document;If not, according to First object document and the second destination document determine output subdocument.
The possible embodiment of with reference to first aspect the first, the embodiment of the present invention are supplied to the second of first aspect The possible embodiment of kind, wherein it is above-mentioned according to first object document and the second destination document, determine the step of output subdocument Suddenly, comprising: the top n target pages in the second destination document are rejected from the second destination document, obtain the second current document; Wherein, the value of N is the number of pages of the second destination document divided by determined by rounding up after 2;Top n target pages are added the One destination document obtains the first current document;Judge whether the size of the first current document is greater than document size threshold value;If It is no, according to the number of pages of the second current document, determine output subdocument;If so, determining output subdocument according to the value of N.
The possible embodiment of second with reference to first aspect, the embodiment of the invention provides the third of first aspect Possible embodiment, wherein the above-mentioned number of pages according to the second current document, the step of determining output subdocument, comprising: when the When the number of pages of two current documents is greater than 1, the first current document is determined as first object document, and the second current document is determined After the second destination document, above-mentioned steps are re-executed: according to first object document and the second destination document, determining output Ziwen Shelves;When the number of pages of the second current document is 1, the first current document is determined as the first output subdocument, and first is exported The corresponding page of subdocument is rejected from the target PDF source document that user terminal uploads, and obtains the first PDF source document;By the first PDF Source document re-executes above-mentioned steps as target PDF source document: according to cutting operation parameter, carrying out to target PDF source document Segmentation, obtains the corresponding output subdocument of target PDF source document.
The possible embodiment of second with reference to first aspect, the embodiment of the invention provides the 4th kind of first aspect Possible embodiment, wherein the above-mentioned value according to N determines that the step of exporting subdocument includes: when the value of N is greater than 1 When, top n target pages are combined as after the second destination document, re-execute above-mentioned steps: according to first object Document and the second destination document determine output subdocument;When the value of N is 1, first object document is determined as the second output Subdocument;The corresponding page of second output subdocument is rejected from the target PDF source document that user terminal uploads, obtains second PDF source document;Using the 2nd PDF source document as target PDF source document, above-mentioned steps are re-executed: according to cutting operation parameter, PDF source document is split, the corresponding output subdocument of PDF source document is obtained.
The possible embodiment of with reference to first aspect the first, the embodiment of the invention provides the 5th kind of first aspect Possible embodiment, wherein before whether the size for judging first object document is greater than document size threshold value, the above method Further include: when first object document and the second destination document meet the first preset condition, by first object document and the second mesh Mark document is determined as exporting subdocument;Wherein, the first preset condition are as follows: the size of first object document and the second destination document Size no more than document size threshold value, and, the number of pages of first object document and the number of pages of the second destination document mutually add up Equal to the number of pages for the target PDF source document that user terminal uploads.
The possible embodiment of with reference to first aspect the first, the embodiment of the invention provides the 6th kind of first aspect Possible embodiment, the above method further include: when target PDF source document meets the second preset condition or first object document When meeting third preset condition, preset miscue information is sent to user terminal;Wherein, the second preset condition are as follows: target The size of PDF source document is greater than document size threshold value, and, the number of pages of target PDF source document is 1;Third preset condition are as follows: first Destination document size is greater than document size threshold value, and, the number of pages of first object document is 1.
Second aspect, the embodiment of the invention provides a kind of PDF document segmenting devices, comprising: receiving module, for receiving The cutting operation parameter of target PDF source document and input that user terminal uploads;Document segmentation module, for being joined according to cutting operation Number, is split target PDF source document, obtains the corresponding output subdocument of target PDF source document;Sending module, being used for will Output subdocument is sent to user terminal.
The third aspect is deposited in memory the embodiment of the invention provides a kind of electronic equipment, including memory and processor The computer program that can be run on a processor is contained, processor realizes first aspect to first aspect when executing computer program The step of 6th kind of possible embodiment described in any item methods.
Fourth aspect, the embodiment of the invention provides a kind of computer readable storage medium, computer readable storage mediums On be stored with computer program, when computer program is run by processor execute first aspect to first aspect the 6th kind of possibility Embodiment described in any item methods the step of.
The embodiment of the present invention bring it is following the utility model has the advantages that
The embodiment of the invention provides a kind of PDF document dividing method, device and electronic equipments, can receive on user terminal The target PDF source document of biography and the cutting operation parameter of input, then according to cutting operation parameter, to target PDF source document into Row segmentation obtains the corresponding output subdocument of target PDF source document, then output subdocument is sent to user terminal.The present invention is real Apply the aforesaid way of example offer compared to the prior art in PDF document in such a way that fixed number of pages is split, can be according to User's custom parameter is split PDF document, is more suitable for user and specifically divides demand, PDF document is effectively promoted The personalization level of segmentation.
Other features and advantages of the present invention will illustrate in the following description, also, partly become from specification It obtains it is clear that understand through the implementation of the invention.The objectives and other advantages of the invention are in specification, claims And specifically noted structure is achieved and obtained in attached drawing.
To enable the above objects, features and advantages of the present invention to be clearer and more comprehensible, preferred embodiment is cited below particularly, and cooperate Appended attached drawing, is described in detail below.
Detailed description of the invention
It, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical solution in the prior art Embodiment or attached drawing needed to be used in the description of the prior art be briefly described, it should be apparent that, it is described below Attached drawing is some embodiments of the present invention, for those of ordinary skill in the art, before not making the creative labor It puts, is also possible to obtain other drawings based on these drawings.
Fig. 1 is a kind of flow chart of PDF document dividing method provided in an embodiment of the present invention;
Fig. 2 is the flow chart of another PDF document dividing method provided in an embodiment of the present invention;
Fig. 3 is a kind of structural block diagram of PDF document segmenting device provided in an embodiment of the present invention;
Fig. 4 is the structural schematic diagram of a kind of electronic equipment provided in an embodiment of the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with attached drawing to the present invention Technical solution be clearly and completely described, it is clear that described embodiments are some of the embodiments of the present invention, rather than Whole embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not making creative work premise Under every other embodiment obtained, shall fall within the protection scope of the present invention.
PDF document is common, widely used electronic file form, and content of pages uses fixed format, not same page it Between content on the data store without any connection.Currently, usually dividing according to fixed page quantity PDF document It cuts, if user has specific segmentation demand, user needs to carry out multi-pass operation and is likely to complete its segmentation PDF document Purpose;Meanwhile only split according to the page, the file size of generation and the content of the page are related, be it is unknown, can not expire Specific demand of the sufficient user such as to file size limitation.Such mode is not able to satisfy the individual demand of user.
Based on this, a kind of PDF document dividing method, device and electronic equipment provided in an embodiment of the present invention can be promoted The personalization level of PDF document segmentation.
For convenient for understanding the present embodiment, first to PDF document dividing method disclosed in the embodiment of the present invention into Row is discussed in detail, a kind of flow chart of PDF document dividing method shown in Figure 1, this method comprises:
Step S102 receives the cutting operation parameter of the target PDF source document that user terminal uploads and input;
Step S104 is split target PDF source document according to cutting operation parameter, obtains target PDF source document pair The output subdocument answered;
Cutting operation parameter can be the page number of specified page, according to the customized page number of user to target PDF source document point It cuts specified page required for obtaining user, when practical application, the source PDF can be adjusted according to corresponding setting when extracting the page The sequence of the page in document can also be corresponding by the page in segmentation if the page contains bookmark information in PDF source document Bookmark information extracts together;Cutting operation parameter can be specific document size threshold value, according to the customized text of user Shelves size threshold value is split PDF source document, makes the size of its corresponding each output subdocument no more than document size Threshold value.
Output subdocument is sent to user terminal by step S106.
In the specific implementation, an online division management system can be constructed in advance, which includes user interface, and Division management backstage;Wherein, it is provided on user interface and uploads file interface and cutting operation parameter setting interface, respectively For receiving the target PDF source document of user's upload and the cutting operation parameter of input;Division management backstage is used to pass through user Interactive interface obtains target PDF source document and cutting operation parameter, online in real time according to cutting operation parameters on target PDF source document Shelves are split, and the result of segmentation is fed back to user by user interface.In this way, want in user When dividing PDF document, without installing additional software, multiple subdocuments of PDF document can be obtained online, compared to related skill Mode based on installation software segmentation PDF document in art, can effectively promote the Experience Degree of user.
The embodiment of the invention provides a kind of PDF document dividing methods, can receive the target PDF source document of user terminal upload Shelves and the cutting operation parameter of input are split target PDF source document, obtain target then according to cutting operation parameter The corresponding output subdocument of PDF source document, then output subdocument is sent to user terminal.Above-mentioned side provided in an embodiment of the present invention Formula compared to the prior art in PDF document in such a way that fixed number of pages is split, can be according to user's custom parameter pair PDF document is split, and is more suitable for user and is specifically divided demand, and the personalization level of PDF document segmentation is effectively promoted.
Specifically, above-mentioned cutting operation parameter is document size threshold value;The embodiment of the invention also provides another PDF texts Above-mentioned steps S104 is shown in detail as shown in Fig. 2, on the basis of Fig. 1 in the flow chart of shelves dividing method, namely according to point The step of cutting operating parameter, target PDF source document is split, obtaining target PDF source document corresponding output subdocument packet It includes:
Step S202 counts the number of pages of target PDF source document when target PDF source document is greater than document size threshold value.
When target PDF source document is not more than the case where document size threshold value if it exists, then using target PDF source document as defeated Subdocument is sent directly to user terminal out.
Target PDF source document is divided into first object text when the number of pages of target PDF source document is greater than 1 by step S204 Shelves and the second destination document;Wherein, the number of pages of first object document is equal to the number of pages or the second target text of the second destination document The number of pages of shelves adds 1.
In view of the number of pages of the target PDF source document of user terminal upload may be odd number, to target PDF source document two Point, it avoids generating mistake when target PDF source document being also divided into first object document and the second destination document, it is unified to set The number of pages of first object document is the number of pages of target PDF source document divided by rounding up after 2 namely the number of pages of the second destination document Add 1;When the number of pages of target PDF source document is even number, the number of pages of first object document is equal to the number of pages of the second destination document.
Step S206, judges whether the size of first object document is greater than document size threshold value;If so, executing step S208;If not, executing step S210.
Step S208, when the number of pages of first object document is greater than 1, using first object document as target PDF source document, It re-executes above-mentioned steps: according to cutting operation parameter, target PDF source document being split, obtains target PDF source document pair The output subdocument answered;
Step S210 determines output subdocument according to first object document and the second destination document.
Above-mentioned another PDF document dividing method provided in an embodiment of the present invention, using dichotomy to target PDF source document It is split, and whether document size threshold value is greater than according to document size threshold decision first object document, in first object text It when shelves are greater than document size threshold value, is looped to determine after first object document is determined as target PDF source document, until first object Document be not more than document size threshold value when, according at this time first object document and the second destination document determine output subdocument, First output subdocument for meeting the customized document size threshold value of user can be quickly determined in this way.
In a kind of optional mode, above-mentioned steps S210, namely according to first object document and the second destination document, really Surely subdocument is exported, can refer to following steps implementation:
Top n target pages in second destination document are rejected from the second destination document, obtain second by step (1) Current document;Wherein, the value of N is the number of pages of the second destination document divided by determined by rounding up after 2.
Top n target pages are added first object document, obtain the first current document by step (2).
Step (3), judges whether the size of the first current document is greater than document size threshold value;If not, executing step (4);If it is execution step (5).
Step (4) determines output subdocument according to the number of pages of the second current document;
The number of pages of second current document can be divided into two kinds of situations, and corresponding both of these case determines the mode of output subdocument not Together, specifically, it is as follows:
When the number of pages of the second current document be greater than 1 when, the first current document is determined as first object document, and by the After two current documents are determined as the second destination document, above-mentioned steps are re-executed: according to first object document and the second target text Shelves determine output subdocument;
When the number of pages of the second current document is 1, the first current document is determined as the first output subdocument, and by first The corresponding page of output subdocument is rejected from the target PDF source document that user terminal uploads, and obtains the first PDF source document;
Using the first PDF source document as target PDF source document, above-mentioned steps are re-executed: right according to cutting operation parameter Target PDF source document is split, and obtains the corresponding output subdocument of target PDF source document.
Step (5) determines output subdocument according to the value of N.
The value of N can be divided into two kinds of situations, and corresponding both of these case determines that the mode of output subdocument is different, specifically, It is as follows:
When the value of N is greater than 1, top n target pages are combined as after the second destination document, are re-executed Above-mentioned steps: according to first object document and the second destination document, output subdocument is determined;
When the value of N is 1, first object document is determined as the second output subdocument;
The corresponding page of second output subdocument is rejected from the target PDF source document that user terminal uploads, obtains second PDF source document;
It using the 2nd PDF source document as target PDF source document, re-execute the steps: according to cutting operation parameter, to PDF Source document is split, and obtains the corresponding output subdocument of PDF source document.
In conclusion it is provided in an embodiment of the present invention above-mentioned according to first object document and the second destination document, it determines defeated The specific embodiment of subdocument out can be obtained first by constantly updating first object document or the second destination document One document size and the closest output subdocument of document size threshold value, the source target PDF that then user terminal is uploaded again The remaining page in document carries out aforesaid operations again, obtains the multiple and close output subdocument of document size threshold value, so that Output subdocument all meets user for the particular demands of document size threshold value.
Further, it is contemplated that there may be the target PDF source documents uploaded to user terminal to carry out first time dimidiate cut behaviour The case where size of obtained first object document and the second destination document is no more than document size threshold value when making, to reduce Subsequent unnecessary judgement process, promotes the efficiency of aforesaid way, is executing above-mentioned steps step S206, namely judge the first mesh Whether the size of mark document is greater than before document size threshold value, the above method further include:
When first object document and the second destination document meet the first preset condition, by first object document and the second mesh Mark document is determined as exporting subdocument;Wherein, the first preset condition are as follows: the size of first object document and the second destination document Size no more than document size threshold value, and, the number of pages of first object document and the number of pages of the second destination document mutually add up Equal to the number of pages for the target PDF source document that user terminal uploads.
Further, it is contemplated that when practical application, it is understood that there may be the size of target PDF source document or first object document Size is greater than document size threshold value, and target PDF source document or first object document only have page 1 and can not be split again Situation does not meet the execution logic of above method process.The above method further include: preset when target PDF source document meets second When condition or first object document meet third preset condition, preset miscue information is sent to user terminal;Wherein, Second preset condition are as follows: the size of target PDF source document is greater than document size threshold value, and, the number of pages of target PDF source document is 1; Third preset condition are as follows: first object document size is greater than document size threshold value, and, the number of pages of first object document is 1.It is practical In application, being based on aforementioned online division management system, preset miscue information can be fed back to by user interface User, miscue information may include error reason, and particular content can be arranged according to the actual situation, be not limited herein.
The corresponding above method, the embodiment of the invention provides a kind of PDF document segmenting devices, referring to Fig. 3, the device packet It includes:
Receiving module 302, for receiving the target PDF source document of user terminal upload and the cutting operation parameter of input;
Document segmentation module 304, for being split to target PDF source document, obtaining target according to cutting operation parameter The corresponding output subdocument of PDF source document;
Sending module 306 is sent to user terminal for that will export subdocument.
The embodiment of the invention provides a kind of PDF document segmenting devices, can receive the target PDF source document of user terminal upload Shelves and the cutting operation parameter of input are split target PDF source document, obtain target then according to cutting operation parameter The corresponding output subdocument of PDF source document, then output subdocument is sent to user terminal.The embodiment of the present invention is compared to existing skill PDF document can divide PDF document according to user's custom parameter in such a way that fixed number of pages is split in art It cuts, is more suitable for user and specifically divides demand, the personalization level of PDF document segmentation is effectively promoted.
The technical effect of device provided by the present embodiment, realization principle and generation is identical with previous embodiment, for letter It describes, Installation practice part does not refer to place, can refer to corresponding contents in preceding method embodiment.
Further, the present embodiment additionally provides a kind of electronic equipment, including memory, processor, is stored in memory The computer program that can be run on a processor, processor realize above-mentioned PDF document dividing method when executing computer program Step.
The structural schematic diagram of a kind of electronic equipment shown in Figure 4 shows electronic equipment 400, comprising: processor 40, memory 41, bus 42 and communication interface 43, processor 40, communication interface 43 and memory 41 are connected by bus 42;Place Reason device 40 is for executing the executable module deposited and stored in 41, such as computer program.
Wherein, memory 41 may include high-speed random access memory (RAM, Random Access Memory), It may further include non-labile memory (non-volatile memory), for example, at least a magnetic disk storage.By extremely A few communication interface 43 (can be wired or wireless) is realized logical between the system network element and at least one other network element Letter connection, can be used internet, wide area network, local network, Metropolitan Area Network (MAN) etc..
Bus 42 can be isa bus, pci bus or eisa bus etc..It is total that bus can be divided into address bus, data Line, control bus etc..Only to be indicated with a four-headed arrow in Fig. 4, it is not intended that an only bus or one convenient for indicating The bus of seed type.
Wherein, memory 41 is for storing program 401, and processor 40 executes program 401 after receiving and executing instruction, Method performed by the device that the stream process that aforementioned any embodiment of the embodiment of the present invention discloses defines can be applied to processor In 40, or realized by processor 40.
Processor 40 may be a kind of IC chip, the processing capacity with signal.During realization, above-mentioned side Each step of method can be completed by the integrated logic circuit of the hardware in processor 40 or the instruction of software form.Above-mentioned Processor 40 can be general processor, including central processing unit (Central Processing Unit, abbreviation CPU), network Processor (Network Processor, abbreviation NP) etc.;It can also be digital signal processor (Digital Signal Processing, abbreviation DSP), specific integrated circuit (Application Specific Integrated Circuit, referred to as ASIC), ready-made programmable gate array (Field-Programmable Gate Array, abbreviation FPGA) or other are programmable Logical device, discrete gate or transistor logic, discrete hardware components.It may be implemented or execute in the embodiment of the present invention Disclosed each method, step and logic diagram.General processor can be microprocessor or the processor is also possible to appoint What conventional processor etc..The step of method in conjunction with disclosed in the embodiment of the present invention, can be embodied directly in hardware decoding processing Device executes completion, or in decoding processor hardware and software module combination execute completion.Software module can be located at Machine memory, flash memory, read-only memory, programmable read only memory or electrically erasable programmable memory, register etc. are originally In the storage medium of field maturation.The storage medium is located at memory 41, and processor 40 reads the information in memory 41, in conjunction with Its hardware completes the step of above method.
Further, the embodiment of the invention also provides a kind of computer readable storage medium, computer readable storage mediums On be stored with computer program, any of the above-described PDF document dividing method is executed when which is run by processor Step.Specific implementation can be found in embodiment of the method, and details are not described herein.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.It can select some or all of unit therein according to the actual needs to realize the mesh of this embodiment scheme 's.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit It is that each unit physically exists alone, can also be integrated in one unit with two or more units.
It, can be with if the function is realized in the form of SFU software functional unit and when sold or used as an independent product It is stored in the executable non-volatile computer-readable storage medium of a processor.Based on this understanding, of the invention Technical solution substantially the part of the part that contributes to existing technology or the technical solution can be with software in other words The form of product embodies, which is stored in a storage medium, including some instructions use so that One computer equipment (can be personal computer, server or the network equipment etc.) executes each embodiment institute of the present invention State all or part of the steps of method.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read- Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can be with Store the medium of program code.
Finally, it should be noted that embodiment described above, only a specific embodiment of the invention, to illustrate the present invention Technical solution, rather than its limitations, scope of protection of the present invention is not limited thereto, although with reference to the foregoing embodiments to this hair It is bright to be described in detail, those skilled in the art should understand that: anyone skilled in the art In the technical scope disclosed by the present invention, it can still modify to technical solution documented by previous embodiment or can be light It is readily conceivable that variation or equivalent replacement of some of the technical features;And these modifications, variation or replacement, do not make The essence of corresponding technical solution is detached from the spirit and scope of technical solution of the embodiment of the present invention, should all cover in protection of the invention Within the scope of.Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (10)

1. a kind of PDF document dividing method characterized by comprising
Receive the cutting operation parameter of the target PDF source document that user terminal uploads and input;
According to the cutting operation parameter, the target PDF source document is split, obtains the target PDF source document pair The output subdocument answered;
The output subdocument is sent to the user terminal.
2. the method according to claim 1, wherein the cutting operation parameter is document size threshold value;It is described According to the cutting operation parameter, the target PDF source document is split, it is corresponding to obtain the target PDF source document Export subdocument the step of include:
When the target PDF source document is greater than the document size threshold value, the number of pages of the target PDF source document is counted;
When the number of pages of the target PDF source document be greater than 1 when, by the target PDF source document be divided into first object document and Second destination document;Wherein, the number of pages of the first object document is equal to the number of pages or described the of second destination document The number of pages of two destination documents adds 1;
Judge whether the size of the first object document is greater than the document size threshold value;
If so, when the number of pages of the first object document is greater than 1, using the first object document as the target PDF Source document re-executes above-mentioned steps: according to the cutting operation parameter, being split, obtains to the target PDF source document The corresponding output subdocument of the target PDF source document;
If not, determining the output subdocument according to the first object document and second destination document.
3. according to the method described in claim 2, it is characterized in that, described according to the first object document and second mesh The step of marking document, determining the output subdocument, comprising:
Top n target pages in second destination document are rejected from second destination document, it is current to obtain second Document;Wherein, the value of the N is the number of pages of second destination document divided by determined by rounding up after 2;
The first object document is added in target pages described in top n, obtains the first current document;
Judge whether the size of first current document is greater than the document size threshold value;
If not, determining the output subdocument according to the number of pages of second current document;
If so, determining the output subdocument according to the value of the N.
4. according to the method described in claim 3, it is characterized in that, according to the number of pages of second current document, described in determination The step of exporting subdocument, comprising:
When the number of pages of second current document is greater than 1, first current document is determined as the first object document, And after second current document is determined as second destination document, above-mentioned steps are re-executed: according to first mesh Document and second destination document are marked, determines the output subdocument;
When the number of pages of second current document is 1, first current document is determined as the first output subdocument, and will Described first exports the corresponding page of subdocument rejects from the target PDF source document that the user terminal uploads, and obtains the first PDF Source document;
Using the first PDF source document as the target PDF source document, above-mentioned steps are re-executed: being grasped according to the segmentation Make parameter, the target PDF source document is split, obtains the corresponding output subdocument of the target PDF source document.
5. according to the method described in claim 3, it is characterized in that, the value according to the N, determines the output Ziwen Shelves the step of include:
When the value of the N is greater than 1, target pages described in top n are combined as after second destination document, It re-executes above-mentioned steps: according to the first object document and second destination document, determining the output subdocument;
When the value of the N is 1, the first object document is determined as the second output subdocument;
The corresponding page of the second output subdocument is rejected from the target PDF source document that the user terminal uploads, is obtained 2nd PDF source document;
Using the 2nd PDF source document as the target PDF source document, above-mentioned steps are re-executed: being grasped according to the segmentation Make parameter, the PDF source document is split, obtains the corresponding output subdocument of the PDF source document.
6. according to the method described in claim 2, it is characterized in that, judging whether the size of the first object document is greater than Before the document size threshold value, the method also includes:
When the first object document and second destination document meet the first preset condition, by the first object document It is determined as the output subdocument with second destination document;Wherein, first preset condition are as follows: the first object The size of document and the size of second destination document no more than the document size threshold value, and, first object text The number of pages of shelves and the number of pages of second destination document mutually sum to the page for the target PDF source document that the user terminal uploads Number.
7. according to the method described in claim 2, it is characterized in that, the method also includes:
When the target PDF source document the second preset condition of satisfaction or the first object document meet third preset condition When, preset miscue information is sent to the user terminal;
Wherein, second preset condition are as follows: the size of the target PDF source document is greater than the document size threshold value, and, institute The number of pages for stating target PDF source document is 1;The third preset condition are as follows: the first object document size is greater than the document Size threshold value, and, the number of pages of the first object document is 1.
8. a kind of PDF document segmenting device characterized by comprising
Receiving module, for receiving the target PDF source document of user terminal upload and the cutting operation parameter of input;
Document segmentation module, for being split to the target PDF source document, obtaining institute according to the cutting operation parameter State the corresponding output subdocument of target PDF source document;
Sending module, for the output subdocument to be sent to the user terminal.
9. a kind of electronic equipment, which is characterized in that including memory and processor, being stored in the memory can be at the place The computer program run on reason device, the processor realize that the claims 1 to 7 are any when executing the computer program The step of method described in item.
10. a kind of computer readable storage medium, computer program, feature are stored on the computer readable storage medium The step of being, the described in any item methods of the claims 1 to 7 executed when the computer program is run by processor.
CN201811502370.XA 2018-12-07 2018-12-07 PDF document dividing method, device and electronic equipment Pending CN109597980A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811502370.XA CN109597980A (en) 2018-12-07 2018-12-07 PDF document dividing method, device and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811502370.XA CN109597980A (en) 2018-12-07 2018-12-07 PDF document dividing method, device and electronic equipment

Publications (1)

Publication Number Publication Date
CN109597980A true CN109597980A (en) 2019-04-09

Family

ID=65962069

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811502370.XA Pending CN109597980A (en) 2018-12-07 2018-12-07 PDF document dividing method, device and electronic equipment

Country Status (1)

Country Link
CN (1) CN109597980A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111680491A (en) * 2020-05-27 2020-09-18 北京字节跳动科技有限公司 Document information extraction method and device and electronic equipment
CN112036123A (en) * 2020-08-31 2020-12-04 北京奇虎鸿腾科技有限公司 PDF (Portable document Format) generation method, device and equipment based on webpage and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007272339A (en) * 2006-03-30 2007-10-18 Canon Inc Electronic document processing system
CN102855224A (en) * 2011-06-30 2013-01-02 北大方正集团有限公司 Display method and display device of electronic documents
CN103377175A (en) * 2012-04-26 2013-10-30 Sap股份公司 Structured document converting based on partition
CN103455534A (en) * 2013-04-28 2013-12-18 北界创想(北京)软件有限公司 Document clustering method and device
CN103544262A (en) * 2013-10-16 2014-01-29 银江股份有限公司 XML-based stream page release method and system
CN103593333A (en) * 2013-10-16 2014-02-19 小米科技有限责任公司 Electronic book document processing method, terminal and electronic equipment
CN106599183A (en) * 2016-12-13 2017-04-26 北京致远互联软件股份有限公司 Document online previewing method and system
CN107391478A (en) * 2017-08-15 2017-11-24 北京北信源软件股份有限公司 A kind of online document edit methods and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007272339A (en) * 2006-03-30 2007-10-18 Canon Inc Electronic document processing system
CN102855224A (en) * 2011-06-30 2013-01-02 北大方正集团有限公司 Display method and display device of electronic documents
CN103377175A (en) * 2012-04-26 2013-10-30 Sap股份公司 Structured document converting based on partition
CN103455534A (en) * 2013-04-28 2013-12-18 北界创想(北京)软件有限公司 Document clustering method and device
CN103544262A (en) * 2013-10-16 2014-01-29 银江股份有限公司 XML-based stream page release method and system
CN103593333A (en) * 2013-10-16 2014-02-19 小米科技有限责任公司 Electronic book document processing method, terminal and electronic equipment
CN106599183A (en) * 2016-12-13 2017-04-26 北京致远互联软件股份有限公司 Document online previewing method and system
CN107391478A (en) * 2017-08-15 2017-11-24 北京北信源软件股份有限公司 A kind of online document edit methods and device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
李兰友等: "面向Web的PDF文档构建技术", 《计算机与现代化》 *
百科全说: "怎样才能使pdf文件分开?", 《HTTPS://WWW.BKQS.COM.CN/CONTENT/X34056LPK.HTML》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111680491A (en) * 2020-05-27 2020-09-18 北京字节跳动科技有限公司 Document information extraction method and device and electronic equipment
CN111680491B (en) * 2020-05-27 2024-02-02 北京字跳网络技术有限公司 Method and device for extracting document information and electronic equipment
CN112036123A (en) * 2020-08-31 2020-12-04 北京奇虎鸿腾科技有限公司 PDF (Portable document Format) generation method, device and equipment based on webpage and storage medium

Similar Documents

Publication Publication Date Title
CN107766328B (en) Text information extraction method of structured text, storage medium and server
CN106961454A (en) Document down loading method, device and terminal device
CN109241003B (en) File management method and device
CN109597980A (en) PDF document dividing method, device and electronic equipment
CN104731645A (en) Task scheduling method and device and data downloading method and device
CN107657030A (en) Collect method, apparatus, terminal device and storage medium that user reads data
CN105550179A (en) Webpage collection method and browser plug-in
CN112235422B (en) Data processing method and device, computer readable storage medium and electronic device
CN105247481A (en) Web page output selection
CN113407254A (en) Form generation method and device, electronic equipment and storage medium
CN107870921B (en) Log data processing method and device
CN106021582B (en) Method for filtering position information, method and device for extracting effective webpage information
CN107911315B (en) Message classification method and network equipment
CN111414395A (en) Data processing method, system and computer equipment
CN109063142B (en) Webpage resource pushing method, server and storage medium
CN109033189B (en) Compression method and device of link structure log, server and readable storage medium
CN110634018A (en) Feature depiction method, recognition method and related device for lost user
CN115935909A (en) File generation method and device and electronic equipment
US10459983B2 (en) Method and device of hierarchical document filtering
US20140215328A1 (en) Method, terminal, and server for displaying file
CN113746932A (en) Network request merging method and device, electronic device and computer program product
CN104933055B (en) Webpage identification method and webpage identification device
CN109840080B (en) Character attribute comparison method and device, storage medium and electronic equipment
CN111144509B (en) Method, device and computer for classifying system application programs
CN110928902A (en) Query method and system for acquiring cloud platform terminal data aiming at paging

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190409