CN109597980A - PDF document dividing method, device and electronic equipment - Google Patents
PDF document dividing method, device and electronic equipment Download PDFInfo
- Publication number
- CN109597980A CN109597980A CN201811502370.XA CN201811502370A CN109597980A CN 109597980 A CN109597980 A CN 109597980A CN 201811502370 A CN201811502370 A CN 201811502370A CN 109597980 A CN109597980 A CN 109597980A
- Authority
- CN
- China
- Prior art keywords
- document
- subdocument
- pages
- target
- pdf source
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 50
- 238000005520 cutting process Methods 0.000 claims abstract description 38
- 230000011218 segmentation Effects 0.000 claims abstract description 20
- 238000004590 computer program Methods 0.000 claims description 12
- 238000010586 diagram Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000006399 behavior Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
Abstract
The present invention provides a kind of PDF document dividing method, device and electronic equipments, are related to electronic document editing and processing technical field, this method comprises: receiving the cutting operation parameter of the target PDF source document that user terminal uploads and input;According to cutting operation parameter, target PDF source document is split, obtains the corresponding output subdocument of target PDF source document;Output subdocument is sent to user terminal.The present invention is able to ascend the personalization level of PDF document segmentation.
Description
Technical field
The present invention relates to electronic document editing and processing technical fields, more particularly, to a kind of PDF document dividing method, device
And electronic equipment.
Background technique
PDF (Portable Document Format, Portable document format) document is common, widely used electronics
Document format, content of pages use fixed format, and the content between same page is not on the data store without any connection.Mesh
Before, usually PDF document is split according to fixed page quantity, if user has specific segmentation demand, user
It needs to carry out multi-pass operation to be likely to complete its purpose for dividing PDF document;Meanwhile only being split according to the page, generation
File size is related with the content of the page, is unknown, it is impossible to meet user such as to file size limitation specific demand.
Such mode is not able to satisfy the individual demand of user.
Summary of the invention
In view of this, the purpose of the present invention is to provide a kind of PDF document dividing method, device and electronic equipment, it can
Promote the personalization level of PDF document segmentation.
In a first aspect, the embodiment of the invention provides a kind of PDF document dividing methods, comprising: receive what user terminal uploaded
Target PDF source document and the cutting operation parameter of input;According to cutting operation parameter, target PDF source document is split, is obtained
To the corresponding output subdocument of target PDF source document;Output subdocument is sent to user terminal.
With reference to first aspect, the embodiment of the invention provides the first possible embodiments of first aspect, wherein point
Cutting operating parameter is document size threshold value;It is above-mentioned according to cutting operation parameter, target PDF source document is split, mesh is obtained
The step of mark PDF source document corresponding output subdocument includes: to count when target PDF source document is greater than document size threshold value
The number of pages of target PDF source document;When the number of pages of target PDF source document is greater than 1, target PDF source document is divided into the first mesh
Mark document and the second destination document;Wherein, the number of pages of first object document is equal to the number of pages or the second mesh of the second destination document
The number of pages of mark document adds 1;Judge whether the size of first object document is greater than document size threshold value;If so, working as first object
When the number of pages of document is greater than 1, using first object document as target PDF source document, above-mentioned steps are re-executed: being grasped according to segmentation
Make parameter, target PDF source document is split, obtains the corresponding output subdocument of target PDF source document;If not, according to
First object document and the second destination document determine output subdocument.
The possible embodiment of with reference to first aspect the first, the embodiment of the present invention are supplied to the second of first aspect
The possible embodiment of kind, wherein it is above-mentioned according to first object document and the second destination document, determine the step of output subdocument
Suddenly, comprising: the top n target pages in the second destination document are rejected from the second destination document, obtain the second current document;
Wherein, the value of N is the number of pages of the second destination document divided by determined by rounding up after 2;Top n target pages are added the
One destination document obtains the first current document;Judge whether the size of the first current document is greater than document size threshold value;If
It is no, according to the number of pages of the second current document, determine output subdocument;If so, determining output subdocument according to the value of N.
The possible embodiment of second with reference to first aspect, the embodiment of the invention provides the third of first aspect
Possible embodiment, wherein the above-mentioned number of pages according to the second current document, the step of determining output subdocument, comprising: when the
When the number of pages of two current documents is greater than 1, the first current document is determined as first object document, and the second current document is determined
After the second destination document, above-mentioned steps are re-executed: according to first object document and the second destination document, determining output Ziwen
Shelves;When the number of pages of the second current document is 1, the first current document is determined as the first output subdocument, and first is exported
The corresponding page of subdocument is rejected from the target PDF source document that user terminal uploads, and obtains the first PDF source document;By the first PDF
Source document re-executes above-mentioned steps as target PDF source document: according to cutting operation parameter, carrying out to target PDF source document
Segmentation, obtains the corresponding output subdocument of target PDF source document.
The possible embodiment of second with reference to first aspect, the embodiment of the invention provides the 4th kind of first aspect
Possible embodiment, wherein the above-mentioned value according to N determines that the step of exporting subdocument includes: when the value of N is greater than 1
When, top n target pages are combined as after the second destination document, re-execute above-mentioned steps: according to first object
Document and the second destination document determine output subdocument;When the value of N is 1, first object document is determined as the second output
Subdocument;The corresponding page of second output subdocument is rejected from the target PDF source document that user terminal uploads, obtains second
PDF source document;Using the 2nd PDF source document as target PDF source document, above-mentioned steps are re-executed: according to cutting operation parameter,
PDF source document is split, the corresponding output subdocument of PDF source document is obtained.
The possible embodiment of with reference to first aspect the first, the embodiment of the invention provides the 5th kind of first aspect
Possible embodiment, wherein before whether the size for judging first object document is greater than document size threshold value, the above method
Further include: when first object document and the second destination document meet the first preset condition, by first object document and the second mesh
Mark document is determined as exporting subdocument;Wherein, the first preset condition are as follows: the size of first object document and the second destination document
Size no more than document size threshold value, and, the number of pages of first object document and the number of pages of the second destination document mutually add up
Equal to the number of pages for the target PDF source document that user terminal uploads.
The possible embodiment of with reference to first aspect the first, the embodiment of the invention provides the 6th kind of first aspect
Possible embodiment, the above method further include: when target PDF source document meets the second preset condition or first object document
When meeting third preset condition, preset miscue information is sent to user terminal;Wherein, the second preset condition are as follows: target
The size of PDF source document is greater than document size threshold value, and, the number of pages of target PDF source document is 1;Third preset condition are as follows: first
Destination document size is greater than document size threshold value, and, the number of pages of first object document is 1.
Second aspect, the embodiment of the invention provides a kind of PDF document segmenting devices, comprising: receiving module, for receiving
The cutting operation parameter of target PDF source document and input that user terminal uploads;Document segmentation module, for being joined according to cutting operation
Number, is split target PDF source document, obtains the corresponding output subdocument of target PDF source document;Sending module, being used for will
Output subdocument is sent to user terminal.
The third aspect is deposited in memory the embodiment of the invention provides a kind of electronic equipment, including memory and processor
The computer program that can be run on a processor is contained, processor realizes first aspect to first aspect when executing computer program
The step of 6th kind of possible embodiment described in any item methods.
Fourth aspect, the embodiment of the invention provides a kind of computer readable storage medium, computer readable storage mediums
On be stored with computer program, when computer program is run by processor execute first aspect to first aspect the 6th kind of possibility
Embodiment described in any item methods the step of.
The embodiment of the present invention bring it is following the utility model has the advantages that
The embodiment of the invention provides a kind of PDF document dividing method, device and electronic equipments, can receive on user terminal
The target PDF source document of biography and the cutting operation parameter of input, then according to cutting operation parameter, to target PDF source document into
Row segmentation obtains the corresponding output subdocument of target PDF source document, then output subdocument is sent to user terminal.The present invention is real
Apply the aforesaid way of example offer compared to the prior art in PDF document in such a way that fixed number of pages is split, can be according to
User's custom parameter is split PDF document, is more suitable for user and specifically divides demand, PDF document is effectively promoted
The personalization level of segmentation.
Other features and advantages of the present invention will illustrate in the following description, also, partly become from specification
It obtains it is clear that understand through the implementation of the invention.The objectives and other advantages of the invention are in specification, claims
And specifically noted structure is achieved and obtained in attached drawing.
To enable the above objects, features and advantages of the present invention to be clearer and more comprehensible, preferred embodiment is cited below particularly, and cooperate
Appended attached drawing, is described in detail below.
Detailed description of the invention
It, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical solution in the prior art
Embodiment or attached drawing needed to be used in the description of the prior art be briefly described, it should be apparent that, it is described below
Attached drawing is some embodiments of the present invention, for those of ordinary skill in the art, before not making the creative labor
It puts, is also possible to obtain other drawings based on these drawings.
Fig. 1 is a kind of flow chart of PDF document dividing method provided in an embodiment of the present invention;
Fig. 2 is the flow chart of another PDF document dividing method provided in an embodiment of the present invention;
Fig. 3 is a kind of structural block diagram of PDF document segmenting device provided in an embodiment of the present invention;
Fig. 4 is the structural schematic diagram of a kind of electronic equipment provided in an embodiment of the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with attached drawing to the present invention
Technical solution be clearly and completely described, it is clear that described embodiments are some of the embodiments of the present invention, rather than
Whole embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not making creative work premise
Under every other embodiment obtained, shall fall within the protection scope of the present invention.
PDF document is common, widely used electronic file form, and content of pages uses fixed format, not same page it
Between content on the data store without any connection.Currently, usually dividing according to fixed page quantity PDF document
It cuts, if user has specific segmentation demand, user needs to carry out multi-pass operation and is likely to complete its segmentation PDF document
Purpose;Meanwhile only split according to the page, the file size of generation and the content of the page are related, be it is unknown, can not expire
Specific demand of the sufficient user such as to file size limitation.Such mode is not able to satisfy the individual demand of user.
Based on this, a kind of PDF document dividing method, device and electronic equipment provided in an embodiment of the present invention can be promoted
The personalization level of PDF document segmentation.
For convenient for understanding the present embodiment, first to PDF document dividing method disclosed in the embodiment of the present invention into
Row is discussed in detail, a kind of flow chart of PDF document dividing method shown in Figure 1, this method comprises:
Step S102 receives the cutting operation parameter of the target PDF source document that user terminal uploads and input;
Step S104 is split target PDF source document according to cutting operation parameter, obtains target PDF source document pair
The output subdocument answered;
Cutting operation parameter can be the page number of specified page, according to the customized page number of user to target PDF source document point
It cuts specified page required for obtaining user, when practical application, the source PDF can be adjusted according to corresponding setting when extracting the page
The sequence of the page in document can also be corresponding by the page in segmentation if the page contains bookmark information in PDF source document
Bookmark information extracts together;Cutting operation parameter can be specific document size threshold value, according to the customized text of user
Shelves size threshold value is split PDF source document, makes the size of its corresponding each output subdocument no more than document size
Threshold value.
Output subdocument is sent to user terminal by step S106.
In the specific implementation, an online division management system can be constructed in advance, which includes user interface, and
Division management backstage;Wherein, it is provided on user interface and uploads file interface and cutting operation parameter setting interface, respectively
For receiving the target PDF source document of user's upload and the cutting operation parameter of input;Division management backstage is used to pass through user
Interactive interface obtains target PDF source document and cutting operation parameter, online in real time according to cutting operation parameters on target PDF source document
Shelves are split, and the result of segmentation is fed back to user by user interface.In this way, want in user
When dividing PDF document, without installing additional software, multiple subdocuments of PDF document can be obtained online, compared to related skill
Mode based on installation software segmentation PDF document in art, can effectively promote the Experience Degree of user.
The embodiment of the invention provides a kind of PDF document dividing methods, can receive the target PDF source document of user terminal upload
Shelves and the cutting operation parameter of input are split target PDF source document, obtain target then according to cutting operation parameter
The corresponding output subdocument of PDF source document, then output subdocument is sent to user terminal.Above-mentioned side provided in an embodiment of the present invention
Formula compared to the prior art in PDF document in such a way that fixed number of pages is split, can be according to user's custom parameter pair
PDF document is split, and is more suitable for user and is specifically divided demand, and the personalization level of PDF document segmentation is effectively promoted.
Specifically, above-mentioned cutting operation parameter is document size threshold value;The embodiment of the invention also provides another PDF texts
Above-mentioned steps S104 is shown in detail as shown in Fig. 2, on the basis of Fig. 1 in the flow chart of shelves dividing method, namely according to point
The step of cutting operating parameter, target PDF source document is split, obtaining target PDF source document corresponding output subdocument packet
It includes:
Step S202 counts the number of pages of target PDF source document when target PDF source document is greater than document size threshold value.
When target PDF source document is not more than the case where document size threshold value if it exists, then using target PDF source document as defeated
Subdocument is sent directly to user terminal out.
Target PDF source document is divided into first object text when the number of pages of target PDF source document is greater than 1 by step S204
Shelves and the second destination document;Wherein, the number of pages of first object document is equal to the number of pages or the second target text of the second destination document
The number of pages of shelves adds 1.
In view of the number of pages of the target PDF source document of user terminal upload may be odd number, to target PDF source document two
Point, it avoids generating mistake when target PDF source document being also divided into first object document and the second destination document, it is unified to set
The number of pages of first object document is the number of pages of target PDF source document divided by rounding up after 2 namely the number of pages of the second destination document
Add 1;When the number of pages of target PDF source document is even number, the number of pages of first object document is equal to the number of pages of the second destination document.
Step S206, judges whether the size of first object document is greater than document size threshold value;If so, executing step
S208;If not, executing step S210.
Step S208, when the number of pages of first object document is greater than 1, using first object document as target PDF source document,
It re-executes above-mentioned steps: according to cutting operation parameter, target PDF source document being split, obtains target PDF source document pair
The output subdocument answered;
Step S210 determines output subdocument according to first object document and the second destination document.
Above-mentioned another PDF document dividing method provided in an embodiment of the present invention, using dichotomy to target PDF source document
It is split, and whether document size threshold value is greater than according to document size threshold decision first object document, in first object text
It when shelves are greater than document size threshold value, is looped to determine after first object document is determined as target PDF source document, until first object
Document be not more than document size threshold value when, according at this time first object document and the second destination document determine output subdocument,
First output subdocument for meeting the customized document size threshold value of user can be quickly determined in this way.
In a kind of optional mode, above-mentioned steps S210, namely according to first object document and the second destination document, really
Surely subdocument is exported, can refer to following steps implementation:
Top n target pages in second destination document are rejected from the second destination document, obtain second by step (1)
Current document;Wherein, the value of N is the number of pages of the second destination document divided by determined by rounding up after 2.
Top n target pages are added first object document, obtain the first current document by step (2).
Step (3), judges whether the size of the first current document is greater than document size threshold value;If not, executing step
(4);If it is execution step (5).
Step (4) determines output subdocument according to the number of pages of the second current document;
The number of pages of second current document can be divided into two kinds of situations, and corresponding both of these case determines the mode of output subdocument not
Together, specifically, it is as follows:
When the number of pages of the second current document be greater than 1 when, the first current document is determined as first object document, and by the
After two current documents are determined as the second destination document, above-mentioned steps are re-executed: according to first object document and the second target text
Shelves determine output subdocument;
When the number of pages of the second current document is 1, the first current document is determined as the first output subdocument, and by first
The corresponding page of output subdocument is rejected from the target PDF source document that user terminal uploads, and obtains the first PDF source document;
Using the first PDF source document as target PDF source document, above-mentioned steps are re-executed: right according to cutting operation parameter
Target PDF source document is split, and obtains the corresponding output subdocument of target PDF source document.
Step (5) determines output subdocument according to the value of N.
The value of N can be divided into two kinds of situations, and corresponding both of these case determines that the mode of output subdocument is different, specifically,
It is as follows:
When the value of N is greater than 1, top n target pages are combined as after the second destination document, are re-executed
Above-mentioned steps: according to first object document and the second destination document, output subdocument is determined;
When the value of N is 1, first object document is determined as the second output subdocument;
The corresponding page of second output subdocument is rejected from the target PDF source document that user terminal uploads, obtains second
PDF source document;
It using the 2nd PDF source document as target PDF source document, re-execute the steps: according to cutting operation parameter, to PDF
Source document is split, and obtains the corresponding output subdocument of PDF source document.
In conclusion it is provided in an embodiment of the present invention above-mentioned according to first object document and the second destination document, it determines defeated
The specific embodiment of subdocument out can be obtained first by constantly updating first object document or the second destination document
One document size and the closest output subdocument of document size threshold value, the source target PDF that then user terminal is uploaded again
The remaining page in document carries out aforesaid operations again, obtains the multiple and close output subdocument of document size threshold value, so that
Output subdocument all meets user for the particular demands of document size threshold value.
Further, it is contemplated that there may be the target PDF source documents uploaded to user terminal to carry out first time dimidiate cut behaviour
The case where size of obtained first object document and the second destination document is no more than document size threshold value when making, to reduce
Subsequent unnecessary judgement process, promotes the efficiency of aforesaid way, is executing above-mentioned steps step S206, namely judge the first mesh
Whether the size of mark document is greater than before document size threshold value, the above method further include:
When first object document and the second destination document meet the first preset condition, by first object document and the second mesh
Mark document is determined as exporting subdocument;Wherein, the first preset condition are as follows: the size of first object document and the second destination document
Size no more than document size threshold value, and, the number of pages of first object document and the number of pages of the second destination document mutually add up
Equal to the number of pages for the target PDF source document that user terminal uploads.
Further, it is contemplated that when practical application, it is understood that there may be the size of target PDF source document or first object document
Size is greater than document size threshold value, and target PDF source document or first object document only have page 1 and can not be split again
Situation does not meet the execution logic of above method process.The above method further include: preset when target PDF source document meets second
When condition or first object document meet third preset condition, preset miscue information is sent to user terminal;Wherein,
Second preset condition are as follows: the size of target PDF source document is greater than document size threshold value, and, the number of pages of target PDF source document is 1;
Third preset condition are as follows: first object document size is greater than document size threshold value, and, the number of pages of first object document is 1.It is practical
In application, being based on aforementioned online division management system, preset miscue information can be fed back to by user interface
User, miscue information may include error reason, and particular content can be arranged according to the actual situation, be not limited herein.
The corresponding above method, the embodiment of the invention provides a kind of PDF document segmenting devices, referring to Fig. 3, the device packet
It includes:
Receiving module 302, for receiving the target PDF source document of user terminal upload and the cutting operation parameter of input;
Document segmentation module 304, for being split to target PDF source document, obtaining target according to cutting operation parameter
The corresponding output subdocument of PDF source document;
Sending module 306 is sent to user terminal for that will export subdocument.
The embodiment of the invention provides a kind of PDF document segmenting devices, can receive the target PDF source document of user terminal upload
Shelves and the cutting operation parameter of input are split target PDF source document, obtain target then according to cutting operation parameter
The corresponding output subdocument of PDF source document, then output subdocument is sent to user terminal.The embodiment of the present invention is compared to existing skill
PDF document can divide PDF document according to user's custom parameter in such a way that fixed number of pages is split in art
It cuts, is more suitable for user and specifically divides demand, the personalization level of PDF document segmentation is effectively promoted.
The technical effect of device provided by the present embodiment, realization principle and generation is identical with previous embodiment, for letter
It describes, Installation practice part does not refer to place, can refer to corresponding contents in preceding method embodiment.
Further, the present embodiment additionally provides a kind of electronic equipment, including memory, processor, is stored in memory
The computer program that can be run on a processor, processor realize above-mentioned PDF document dividing method when executing computer program
Step.
The structural schematic diagram of a kind of electronic equipment shown in Figure 4 shows electronic equipment 400, comprising: processor
40, memory 41, bus 42 and communication interface 43, processor 40, communication interface 43 and memory 41 are connected by bus 42;Place
Reason device 40 is for executing the executable module deposited and stored in 41, such as computer program.
Wherein, memory 41 may include high-speed random access memory (RAM, Random Access Memory),
It may further include non-labile memory (non-volatile memory), for example, at least a magnetic disk storage.By extremely
A few communication interface 43 (can be wired or wireless) is realized logical between the system network element and at least one other network element
Letter connection, can be used internet, wide area network, local network, Metropolitan Area Network (MAN) etc..
Bus 42 can be isa bus, pci bus or eisa bus etc..It is total that bus can be divided into address bus, data
Line, control bus etc..Only to be indicated with a four-headed arrow in Fig. 4, it is not intended that an only bus or one convenient for indicating
The bus of seed type.
Wherein, memory 41 is for storing program 401, and processor 40 executes program 401 after receiving and executing instruction,
Method performed by the device that the stream process that aforementioned any embodiment of the embodiment of the present invention discloses defines can be applied to processor
In 40, or realized by processor 40.
Processor 40 may be a kind of IC chip, the processing capacity with signal.During realization, above-mentioned side
Each step of method can be completed by the integrated logic circuit of the hardware in processor 40 or the instruction of software form.Above-mentioned
Processor 40 can be general processor, including central processing unit (Central Processing Unit, abbreviation CPU), network
Processor (Network Processor, abbreviation NP) etc.;It can also be digital signal processor (Digital Signal
Processing, abbreviation DSP), specific integrated circuit (Application Specific Integrated Circuit, referred to as
ASIC), ready-made programmable gate array (Field-Programmable Gate Array, abbreviation FPGA) or other are programmable
Logical device, discrete gate or transistor logic, discrete hardware components.It may be implemented or execute in the embodiment of the present invention
Disclosed each method, step and logic diagram.General processor can be microprocessor or the processor is also possible to appoint
What conventional processor etc..The step of method in conjunction with disclosed in the embodiment of the present invention, can be embodied directly in hardware decoding processing
Device executes completion, or in decoding processor hardware and software module combination execute completion.Software module can be located at
Machine memory, flash memory, read-only memory, programmable read only memory or electrically erasable programmable memory, register etc. are originally
In the storage medium of field maturation.The storage medium is located at memory 41, and processor 40 reads the information in memory 41, in conjunction with
Its hardware completes the step of above method.
Further, the embodiment of the invention also provides a kind of computer readable storage medium, computer readable storage mediums
On be stored with computer program, any of the above-described PDF document dividing method is executed when which is run by processor
Step.Specific implementation can be found in embodiment of the method, and details are not described herein.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit
The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple
In network unit.It can select some or all of unit therein according to the actual needs to realize the mesh of this embodiment scheme
's.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit
It is that each unit physically exists alone, can also be integrated in one unit with two or more units.
It, can be with if the function is realized in the form of SFU software functional unit and when sold or used as an independent product
It is stored in the executable non-volatile computer-readable storage medium of a processor.Based on this understanding, of the invention
Technical solution substantially the part of the part that contributes to existing technology or the technical solution can be with software in other words
The form of product embodies, which is stored in a storage medium, including some instructions use so that
One computer equipment (can be personal computer, server or the network equipment etc.) executes each embodiment institute of the present invention
State all or part of the steps of method.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-
Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can be with
Store the medium of program code.
Finally, it should be noted that embodiment described above, only a specific embodiment of the invention, to illustrate the present invention
Technical solution, rather than its limitations, scope of protection of the present invention is not limited thereto, although with reference to the foregoing embodiments to this hair
It is bright to be described in detail, those skilled in the art should understand that: anyone skilled in the art
In the technical scope disclosed by the present invention, it can still modify to technical solution documented by previous embodiment or can be light
It is readily conceivable that variation or equivalent replacement of some of the technical features;And these modifications, variation or replacement, do not make
The essence of corresponding technical solution is detached from the spirit and scope of technical solution of the embodiment of the present invention, should all cover in protection of the invention
Within the scope of.Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.
Claims (10)
1. a kind of PDF document dividing method characterized by comprising
Receive the cutting operation parameter of the target PDF source document that user terminal uploads and input;
According to the cutting operation parameter, the target PDF source document is split, obtains the target PDF source document pair
The output subdocument answered;
The output subdocument is sent to the user terminal.
2. the method according to claim 1, wherein the cutting operation parameter is document size threshold value;It is described
According to the cutting operation parameter, the target PDF source document is split, it is corresponding to obtain the target PDF source document
Export subdocument the step of include:
When the target PDF source document is greater than the document size threshold value, the number of pages of the target PDF source document is counted;
When the number of pages of the target PDF source document be greater than 1 when, by the target PDF source document be divided into first object document and
Second destination document;Wherein, the number of pages of the first object document is equal to the number of pages or described the of second destination document
The number of pages of two destination documents adds 1;
Judge whether the size of the first object document is greater than the document size threshold value;
If so, when the number of pages of the first object document is greater than 1, using the first object document as the target PDF
Source document re-executes above-mentioned steps: according to the cutting operation parameter, being split, obtains to the target PDF source document
The corresponding output subdocument of the target PDF source document;
If not, determining the output subdocument according to the first object document and second destination document.
3. according to the method described in claim 2, it is characterized in that, described according to the first object document and second mesh
The step of marking document, determining the output subdocument, comprising:
Top n target pages in second destination document are rejected from second destination document, it is current to obtain second
Document;Wherein, the value of the N is the number of pages of second destination document divided by determined by rounding up after 2;
The first object document is added in target pages described in top n, obtains the first current document;
Judge whether the size of first current document is greater than the document size threshold value;
If not, determining the output subdocument according to the number of pages of second current document;
If so, determining the output subdocument according to the value of the N.
4. according to the method described in claim 3, it is characterized in that, according to the number of pages of second current document, described in determination
The step of exporting subdocument, comprising:
When the number of pages of second current document is greater than 1, first current document is determined as the first object document,
And after second current document is determined as second destination document, above-mentioned steps are re-executed: according to first mesh
Document and second destination document are marked, determines the output subdocument;
When the number of pages of second current document is 1, first current document is determined as the first output subdocument, and will
Described first exports the corresponding page of subdocument rejects from the target PDF source document that the user terminal uploads, and obtains the first PDF
Source document;
Using the first PDF source document as the target PDF source document, above-mentioned steps are re-executed: being grasped according to the segmentation
Make parameter, the target PDF source document is split, obtains the corresponding output subdocument of the target PDF source document.
5. according to the method described in claim 3, it is characterized in that, the value according to the N, determines the output Ziwen
Shelves the step of include:
When the value of the N is greater than 1, target pages described in top n are combined as after second destination document,
It re-executes above-mentioned steps: according to the first object document and second destination document, determining the output subdocument;
When the value of the N is 1, the first object document is determined as the second output subdocument;
The corresponding page of the second output subdocument is rejected from the target PDF source document that the user terminal uploads, is obtained
2nd PDF source document;
Using the 2nd PDF source document as the target PDF source document, above-mentioned steps are re-executed: being grasped according to the segmentation
Make parameter, the PDF source document is split, obtains the corresponding output subdocument of the PDF source document.
6. according to the method described in claim 2, it is characterized in that, judging whether the size of the first object document is greater than
Before the document size threshold value, the method also includes:
When the first object document and second destination document meet the first preset condition, by the first object document
It is determined as the output subdocument with second destination document;Wherein, first preset condition are as follows: the first object
The size of document and the size of second destination document no more than the document size threshold value, and, first object text
The number of pages of shelves and the number of pages of second destination document mutually sum to the page for the target PDF source document that the user terminal uploads
Number.
7. according to the method described in claim 2, it is characterized in that, the method also includes:
When the target PDF source document the second preset condition of satisfaction or the first object document meet third preset condition
When, preset miscue information is sent to the user terminal;
Wherein, second preset condition are as follows: the size of the target PDF source document is greater than the document size threshold value, and, institute
The number of pages for stating target PDF source document is 1;The third preset condition are as follows: the first object document size is greater than the document
Size threshold value, and, the number of pages of the first object document is 1.
8. a kind of PDF document segmenting device characterized by comprising
Receiving module, for receiving the target PDF source document of user terminal upload and the cutting operation parameter of input;
Document segmentation module, for being split to the target PDF source document, obtaining institute according to the cutting operation parameter
State the corresponding output subdocument of target PDF source document;
Sending module, for the output subdocument to be sent to the user terminal.
9. a kind of electronic equipment, which is characterized in that including memory and processor, being stored in the memory can be at the place
The computer program run on reason device, the processor realize that the claims 1 to 7 are any when executing the computer program
The step of method described in item.
10. a kind of computer readable storage medium, computer program, feature are stored on the computer readable storage medium
The step of being, the described in any item methods of the claims 1 to 7 executed when the computer program is run by processor.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811502370.XA CN109597980A (en) | 2018-12-07 | 2018-12-07 | PDF document dividing method, device and electronic equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811502370.XA CN109597980A (en) | 2018-12-07 | 2018-12-07 | PDF document dividing method, device and electronic equipment |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109597980A true CN109597980A (en) | 2019-04-09 |
Family
ID=65962069
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811502370.XA Pending CN109597980A (en) | 2018-12-07 | 2018-12-07 | PDF document dividing method, device and electronic equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109597980A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111680491A (en) * | 2020-05-27 | 2020-09-18 | 北京字节跳动科技有限公司 | Document information extraction method and device and electronic equipment |
CN112036123A (en) * | 2020-08-31 | 2020-12-04 | 北京奇虎鸿腾科技有限公司 | PDF (Portable document Format) generation method, device and equipment based on webpage and storage medium |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007272339A (en) * | 2006-03-30 | 2007-10-18 | Canon Inc | Electronic document processing system |
CN102855224A (en) * | 2011-06-30 | 2013-01-02 | 北大方正集团有限公司 | Display method and display device of electronic documents |
CN103377175A (en) * | 2012-04-26 | 2013-10-30 | Sap股份公司 | Structured document converting based on partition |
CN103455534A (en) * | 2013-04-28 | 2013-12-18 | 北界创想(北京)软件有限公司 | Document clustering method and device |
CN103544262A (en) * | 2013-10-16 | 2014-01-29 | 银江股份有限公司 | XML-based stream page release method and system |
CN103593333A (en) * | 2013-10-16 | 2014-02-19 | 小米科技有限责任公司 | Electronic book document processing method, terminal and electronic equipment |
CN106599183A (en) * | 2016-12-13 | 2017-04-26 | 北京致远互联软件股份有限公司 | Document online previewing method and system |
CN107391478A (en) * | 2017-08-15 | 2017-11-24 | 北京北信源软件股份有限公司 | A kind of online document edit methods and device |
-
2018
- 2018-12-07 CN CN201811502370.XA patent/CN109597980A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007272339A (en) * | 2006-03-30 | 2007-10-18 | Canon Inc | Electronic document processing system |
CN102855224A (en) * | 2011-06-30 | 2013-01-02 | 北大方正集团有限公司 | Display method and display device of electronic documents |
CN103377175A (en) * | 2012-04-26 | 2013-10-30 | Sap股份公司 | Structured document converting based on partition |
CN103455534A (en) * | 2013-04-28 | 2013-12-18 | 北界创想(北京)软件有限公司 | Document clustering method and device |
CN103544262A (en) * | 2013-10-16 | 2014-01-29 | 银江股份有限公司 | XML-based stream page release method and system |
CN103593333A (en) * | 2013-10-16 | 2014-02-19 | 小米科技有限责任公司 | Electronic book document processing method, terminal and electronic equipment |
CN106599183A (en) * | 2016-12-13 | 2017-04-26 | 北京致远互联软件股份有限公司 | Document online previewing method and system |
CN107391478A (en) * | 2017-08-15 | 2017-11-24 | 北京北信源软件股份有限公司 | A kind of online document edit methods and device |
Non-Patent Citations (2)
Title |
---|
李兰友等: "面向Web的PDF文档构建技术", 《计算机与现代化》 * |
百科全说: "怎样才能使pdf文件分开?", 《HTTPS://WWW.BKQS.COM.CN/CONTENT/X34056LPK.HTML》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111680491A (en) * | 2020-05-27 | 2020-09-18 | 北京字节跳动科技有限公司 | Document information extraction method and device and electronic equipment |
CN111680491B (en) * | 2020-05-27 | 2024-02-02 | 北京字跳网络技术有限公司 | Method and device for extracting document information and electronic equipment |
CN112036123A (en) * | 2020-08-31 | 2020-12-04 | 北京奇虎鸿腾科技有限公司 | PDF (Portable document Format) generation method, device and equipment based on webpage and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107766328B (en) | Text information extraction method of structured text, storage medium and server | |
CN106961454A (en) | Document down loading method, device and terminal device | |
CN109241003B (en) | File management method and device | |
CN109597980A (en) | PDF document dividing method, device and electronic equipment | |
CN104731645A (en) | Task scheduling method and device and data downloading method and device | |
CN107657030A (en) | Collect method, apparatus, terminal device and storage medium that user reads data | |
CN105550179A (en) | Webpage collection method and browser plug-in | |
CN112235422B (en) | Data processing method and device, computer readable storage medium and electronic device | |
CN105247481A (en) | Web page output selection | |
CN113407254A (en) | Form generation method and device, electronic equipment and storage medium | |
CN107870921B (en) | Log data processing method and device | |
CN106021582B (en) | Method for filtering position information, method and device for extracting effective webpage information | |
CN107911315B (en) | Message classification method and network equipment | |
CN111414395A (en) | Data processing method, system and computer equipment | |
CN109063142B (en) | Webpage resource pushing method, server and storage medium | |
CN109033189B (en) | Compression method and device of link structure log, server and readable storage medium | |
CN110634018A (en) | Feature depiction method, recognition method and related device for lost user | |
CN115935909A (en) | File generation method and device and electronic equipment | |
US10459983B2 (en) | Method and device of hierarchical document filtering | |
US20140215328A1 (en) | Method, terminal, and server for displaying file | |
CN113746932A (en) | Network request merging method and device, electronic device and computer program product | |
CN104933055B (en) | Webpage identification method and webpage identification device | |
CN109840080B (en) | Character attribute comparison method and device, storage medium and electronic equipment | |
CN111144509B (en) | Method, device and computer for classifying system application programs | |
CN110928902A (en) | Query method and system for acquiring cloud platform terminal data aiming at paging |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190409 |