CN115048339A - Method and device for efficiently browsing pdf document - Google Patents

Method and device for efficiently browsing pdf document Download PDF

Info

Publication number
CN115048339A
CN115048339A CN202210443657.XA CN202210443657A CN115048339A CN 115048339 A CN115048339 A CN 115048339A CN 202210443657 A CN202210443657 A CN 202210443657A CN 115048339 A CN115048339 A CN 115048339A
Authority
CN
China
Prior art keywords
record
pdf
mark
document
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210443657.XA
Other languages
Chinese (zh)
Other versions
CN115048339B (en
Inventor
林礼挺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan Feiyu Technology Co ltd
Original Assignee
Wuhan Feiyu Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Feiyu Technology Co ltd filed Critical Wuhan Feiyu Technology Co ltd
Priority to CN202210443657.XA priority Critical patent/CN115048339B/en
Publication of CN115048339A publication Critical patent/CN115048339A/en
Application granted granted Critical
Publication of CN115048339B publication Critical patent/CN115048339B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/14Details of searching files based on file metadata
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/13File access structures, e.g. distributed indices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/1727Details of free space management performed by the file system
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/08Network architectures or network communication protocols for network security for authentication of entities
    • H04L63/083Network architectures or network communication protocols for network security for authentication of entities using passwords

Abstract

The invention relates to the technical field of computer document application, and provides a method and a device for efficiently browsing pdf documents. When a first user logs in an application, a default account login process is completed by using the account and the password of the first user historical login; when a first user opens a pdf document, an application acquires the pdf document name, and searches a record containing the same or similar keywords in a server according to the keywords contained in the pdf document name; after the application acquires one or more records, generating a record list in an application window interface for an operator to select one or more records; and after the first user selects one record, loading the marked content into the document opened by the current application according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record. The method and the device increase the inheritance of pdf document browsing and improve the efficiency of pdf document browsing.

Description

Method and device for efficiently browsing pdf document
Technical Field
The invention relates to the technical field of computer document application, in particular to a method and a device for efficiently browsing pdf documents.
Background
PDF is short for Portable Document Format, meaning "Portable Document Format", and is a file Format developed by Adobe Systems for exchanging files in a manner unrelated to application programs, operating Systems, and hardware. The PDF file is based on a PostScript language image model, and accurate colors and accurate printing effects can be guaranteed on any printer, i.e., the PDF faithfully reproduces every character, color, and image of the original.
PDF is popular with various platforms due to the fact that the PDF carries watermarking, compared with word documents, the PDF has edibility avoiding performance, and the PDF is mostly used as a storage mode of public information documents in a secondary format, for example, the national intellectual property office is mostly disclosed in a PDF or picture mode for public patent documents, examination opinion documents and the like, wherein the PDF has extremely high picture conversion compatibility, namely, original pictures can be converted into a complete PDF document.
In many existing pdf reading tools, the pdf marking mode is performed in an independent overlay mode, which is equivalent to additionally loading on the pdf document content, and finally forming a marking effect. This is a necessary function for browsing related documents, especially in the case of a large document size and a new key model in that place, however, in the prior art, the browsing manner of pdf documents is mostly based on the personal browsing of users, which not only causes the problem of low efficiency, but also fails to bring into play the real efficient browsing experience of pdf reading tools in the specific application scene field.
In view of the above, overcoming the drawbacks of the prior art is an urgent problem in the art.
Disclosure of Invention
The technical problem to be solved by the invention is that in the field of similar intellectual property rights, public published patent documents which can be downloaded from official or third-party channels keep high consistency in content; the present invention particularly considers an application scenario in which the pdf patent document is downloaded and processed as a comparison document, and in this case, in cooperation with review opinion analysis and the patent analysis process of the invention, the pdf patent document downloaded correspondingly and used as the comparison document is marked with a lot of contents. However, in the prior art, especially in the agent scene, it is very likely that the response of a patent will be due to the deputy and temporary arrangement of the agents, and the first round of review opinion response and the second round of review opinion response will be processed by different agents, even when the third round of review opinion may exist, the three review opinions may be completed by different agents. At this time, most of documents which can be acquired by the intermediately received agent are the modified comparison page and the comment statement which are backed up on the agent system, but the matched comparison file may be different due to regulations or due to the storage resource of the server, and is not effectively stored in the actual situation, at this time, for the intermediately involved agent who replies, the intermediately involved agent needs to download the comparison file and browse the comparison file from the zero basis, whereas for the efficient browsing manner, if browsing of the comparison file can be performed on the basis of the mark of the previous reply, the effort is increased. This is the prior art problem scenario that the present invention is intended to solve.
The technical problem to be solved by the invention is how to transfer the 'marking' information to the server provided by the invention as much as possible for some agents who like printing out the comparison file to operate according to the operation habits of different people.
The invention adopts the following technical scheme:
in a first aspect, the present invention provides a method for efficiently browsing pdf documents, including:
when a first user logs in an application, a default account login process is completed by using the account and the password of the first user historical login;
when a first user opens a pdf document, an application acquires the pdf document name, and searches a record containing the same or similar keywords in a server according to the keywords contained in the pdf document name;
the record is composed of one or more items of pdf document page number, mark type, mark start coordinate value and mark end coordinate value, and the corresponding record is also associated with account information and time of a mark user;
after the application acquires one or more records, generating a record list in an application window interface for an operator to select one or more records;
and after the first user selects one record, loading the marked content into the document opened by the current application according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record.
Preferably, the method further comprises:
when the first user saves the modification of the current pdf document, the corresponding relevant mark existing in the current document is recorded locally in a log mode; synchronizing the records to a server according to a preassigned synchronization mode;
if the first user performs the storage under the condition that the mark is newly added after loading the history record of the first user, the server replaces the history record of the user under the same pdf document name in a covering mode;
if the first user also includes records of other account numbers in the process of loading the records, the server further analyzes newly added items in the records stored by the current user and record items of historical other account numbers, and performs record storage in a mode of combining the newly added items and historical other account number information.
Preferably, when the first user selects a plurality of records and further selects and combines the presentation, loading the marked content into a document opened by the current application according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding records;
when a first user selects a plurality of records and further selects to independently present, according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record, making a corresponding number of pdf document copies according to the number of the records, and loading the mark content under each record into the corresponding pdf document copy.
Preferably, the pdf file name includes a default name included when downloading the pdf document from a specified website, and the user adds a keyword to the default name; searching records containing the same or similar keywords in the server according to the keywords contained in the pdf document name, which specifically comprises:
searching for records containing the same or similar keywords in a server by using one or more types of character strings contained in a pdf file name as one or more keywords and using the keywords contained in the pdf file name;
further, one or more keywords are extracted from the folder name named according to the preset specification where the pdf document is located, the obtained records are screened, and the screening result is used as the record content finally presented in the record list;
the history record also includes the folder name and/or path name of the corresponding file.
Preferably, the folder name includes one or more of a patent application number, a patent name, a several rounds of review comments reply, an agency case number, and an enterprise case number.
Preferably, after the first user opens one or more pdf documents as a comparison document and completes the composition of content in a corresponding opinion statement reply document, the method further comprises:
selecting an operation of loading an opinion statement reply document on a dpf document operation interface;
obtaining paragraph description contents related to the current pdf document in an opinion statement reply document through semantic recognition, and positioning the corresponding description contents in the current pdf document;
determining whether the located corresponding description content position has the mark content, and skipping to locate the next description position if the mark content exists; if not, adopting a default marking mode to mark; the default marking mode comprises one or more of a square frame mark, a transverse line mark, a wavy line mark and an annotation frame mark;
the newly generated markup content for the reply document from the opinion statement is also added to the first user's record for the current pdf.
Preferably, when the content of each record stored in the server includes matched case information, the method further includes:
the server receives an end-to-end list imported by an operator, wherein the end-to-end list comprises authorized case information and/or rejected case information;
the server searches the locally stored records according to the keywords contained in the case information in the case list;
and clearing one or more records locally corresponding to the server when the case associated with the corresponding record is already put on a table.
In a second aspect, the present invention further provides an apparatus for efficiently browsing pdf documents, which is used to implement the method for efficiently browsing pdf documents in the first aspect, and the apparatus includes:
at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor for performing the method of the first aspect as applied to efficient browsing of pdf documents.
In a third aspect, the present invention also provides a non-transitory computer storage medium storing computer-executable instructions for execution by one or more processors for performing the method of the first aspect as applied to efficient browsing of pdf documents.
The invention catches the characteristic that the patent documents downloaded by official or third party platforms in the patent field have high consistency, and the patent names and/or publication numbers in the downloaded file names are used as the index basis, so that the association relation between the patent documents and the marked content in the same comparison file marked by other agents in the historical response can be established with high consistency; therefore, the corresponding records stored in the server are projected into the comparison file downloaded by the current agent, so that the reading focus of the historical agent can be effectively presented to the agent of the middle-end, and the mode not only can maximally compress the storage space in the server, but also can form the possibility of subsequent optimization space.
In the preferred scheme of the invention, writing formats in corresponding opinion statement books are further captured, even if the operation habits of historical agents are carried out by printing paper pieces, the statements of rational data contents in the statement books are still observed in a way of matching paragraph numbers in comparison files, so that the possibility of generating the related records is generated by extracting the paragraph description contents related to comparison in the opinion statement books based on semantic learning in the preferred scheme of the invention, and therefore, the defects of the technical scheme in a specific operation habit agent are filled, and a high-practicability technical scheme is formed.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required to be used in the embodiments of the present invention will be briefly described below. It is obvious that the drawings described below are only some embodiments of the invention, and that for a person skilled in the art, other drawings can be derived from them without inventive effort.
Fig. 1 is a flow chart of an efficient browsing method applied to pdf documents according to an embodiment of the present invention;
fig. 2 is a schematic flow chart of another efficient browsing method applied to pdf documents according to an embodiment of the present invention;
fig. 3 is a schematic diagram illustrating an interface effect of an efficient browsing method applied to a pdf document according to an embodiment of the present invention;
fig. 4 is a schematic diagram illustrating an interface effect of an efficient browsing method applied to a pdf document according to an embodiment of the present invention;
fig. 5 is a flowchart of a further method for efficiently browsing pdf documents according to an embodiment of the present invention;
fig. 6 is a flowchart of a further method for efficiently browsing pdf documents according to an embodiment of the present invention;
fig. 7 is a schematic structural diagram of an efficient browsing apparatus for pdf documents according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and do not limit the invention.
After the applicant intensively studies some characteristics of the intellectual property field, the published patent documents which can be downloaded from official or third party channels by the public are confirmed, and the contents of the patent documents are kept highly consistent; particularly, an application scenario is considered in which the pdf patent document is downloaded and processed as a comparison document, and in this case, in cooperation with the examination and opinion analysis and the patent analysis process of the pdf patent document, the pdf patent document downloaded and processed as a comparison document is marked with a lot of contents.
However, in the prior art, especially in the agent scene, it is very likely that the response of a patent will be due to the deputy and temporary arrangement of the agents, and the first round of review opinion response and the second round of review opinion response will be processed by different agents, even when the third round of review opinion may exist, the three review opinions may be completed by different agents. At this time, most of documents which can be acquired by the intermediately received agent are the modified comparison page and the comment statement which are backed up on the agent system, but the matched comparison file may be different due to regulations or due to the storage resource of the server, and is not effectively stored in the actual situation, at this time, for the intermediately involved agent who replies, the intermediately involved agent needs to download the comparison file and browse the comparison file from the zero basis, whereas for the efficient browsing manner, if browsing of the comparison file can be performed on the basis of the mark of the previous reply, the effort is increased. This is the prior art problem scenario that the present invention is intended to solve.
In addition, the technical features involved in the embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.
Example 1:
embodiment 1 of the present invention provides an efficient browsing method applied to pdf documents, as shown in fig. 1, including:
in step 201, when the first user logs in the application, the default account login process is completed by using the account and the password of the first user historical login.
Because the application scenario targeted by the embodiment of the invention is a specific scenario similar to that of an agent, the account and the password can be uniformly applied for registration by the agent according to the information of the agent, and the corresponding information can quickly confirm who the historical agent is based on the account information even if the corresponding agent leaves the work and a previous server provided by a local server or a pdf reader of the agent is used subsequently. By the technical scheme of the invention, when the record at the server side is loaded, the degree of reference can be prejudged according to the matched historical account information, for example, the stronger the corresponding historical agent capability is, the higher the degree of reference of the loaded record is, and the less the record is, the more the receiving agent is.
In step 202, when the first user opens the pdf document, the application obtains the pdf document name, and searches the server for a record containing the same or similar keywords according to the keywords contained in the pdf document name.
As a typical patent document, as a constitution of possible pdf document names, for example, "comparison file 1CN 112819673A", or "CN 112819673A", or "CN 202110195736.9 based on an information platform of a cloud six-terminal framework", or "comparison file 1CN202110195736.9 based on an information platform of a cloud six-terminal framework"; the format of the pdf document can be various rich combinations after possible editing operations of the first user are introduced, however, for the server side to match to the corresponding record while the pdf document is opened, it is necessary that the corresponding file name includes a unique identifier of the pdf document, such as the above-mentioned publication number "CN 112819673A" or application number "CN 202110195736.9", in the following embodiments of the present invention, the publication number is set as a valid unique identifier. However, as an alternative, a multi-element combination search under a naming rule may also be adopted, so as to further improve the accuracy, and redundant description is not repeated here.
The record is composed of one or more items of pdf document page number, mark type, mark start coordinate value and mark end coordinate value, and the corresponding record is also associated with account information and time of a mark user. The account information is preferably formed by a full spelling of the name of the agent, and the corresponding time is the time when the corresponding record is completed, not the time uploaded to the server.
In step 203, after the application acquires one or more records, a record list is generated in an application window interface for an operator to select one or more of the records.
In the specific presentation process, the account information, the time information, the patent name, the publication number, and other information may be presented together, or if the content is more, the information may be presented by a dragging bar or a window shrinking/expanding manner, which is not described herein in detail.
In step 204, after the first user selects one record each time, the markup content is loaded into the document opened by the current application according to the page number, the markup type, the markup start coordinate value and the markup end coordinate value of the pdf document under the corresponding record.
The embodiment of the invention captures the characteristic that the patent documents downloaded by official or third-party platforms in the patent field have high consistency, and the patent names and/or publication numbers in the downloaded file names are used as the index basis, so that the association relation between the patent documents and the marked contents in the same comparison file marked by other agents in the historical response can be established with high consistency; therefore, the corresponding records stored in the server are projected into the comparison file downloaded by the current agent, so that the reading focus of the historical agent can be effectively presented to the agent of the middle-end, and the mode not only can maximally compress the storage space in the server, but also can form the possibility of subsequent optimization space.
In consideration of the completeness of the technical solution of the embodiment of the present invention, the step 201 to the step 204 represent an opening operation link, and in practical cases, it is also determined that a saving closing link is involved, so as shown in fig. 2, the method further includes:
in step 205, while the first user saves the modifications to the current pdf document, the corresponding relevant tags existing in the current document are logged locally; and synchronizing the records to a server according to a pre-specified synchronization mode.
A preassigned synchronization mode comprises that the terminal is synchronized to the server at the first time after online networking; it is also possible to set a fixed upload time, such as 10 monday per week; or may be performed by a user's active operation, which is not limited herein.
In step 206, if the first user has executed the saving with the tag added after loading its own history record, the server replaces the history record under the same pdf document name of the user in an overlay manner.
In step 207, if the first user further includes records of other accounts in the process of loading the records, the server further analyzes the newly added items in the records saved by the current user and the record items of the historical other accounts, and performs record storage in a manner of combining the newly added items and the historical other account information.
Namely, a mapping mode is adopted, and repeated recorded contents are mapped and replaced in a corresponding account information mode; this is achieved because, in the implementation process of the technical solution of the embodiment of the present invention, the record content under the account information corresponding to the mapping relationship is also loaded in step 203, so that even if the mapping manner described in step 207 is adopted to simplify the storage at the server side, no delay that can be perceived by any user is provided for the final loading at the first user side.
In combination with the embodiment of the present invention, after the first user selects the server-side record, at least two choices are provided for a possible long-line mode:
the selection mode is as follows: when a first user selects a plurality of records and further selects and combines the records to present, loading the marked content into a document opened by the current application according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding records; as shown in fig. 3, pdf document markup 1, pdf document markup 2, pdf document markup 3 and pdf document markup 4 are schematically shown, which are represented in different forms, and for convenience and typicality of the second mode, the pdf document markup 1, pdf document markup 2, pdf document markup 3 and pdf document markup 4 are assigned to 4 different history users, and the situation is more variable in the actual situation, and more, there are usually multiple markup in the record associated with each history user, rather than the schematic one markup object shown in fig. 3; in practice, there are usually no more than three users under the same document that may be associated, as determined by the number of rounds of routine review.
And a second selection mode: when a first user selects a plurality of records and further selects to independently present, according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record, making a corresponding number of pdf document copies according to the number of the records, and loading the mark content under each record into the corresponding pdf document copy. As shown in FIG. 4, taking the situation described in FIG. 3 above as an example, the corresponding presented effect graph is represented as 4 application windows in FIG. 4, wherein, except that the first application window is presented as "CN 112819673A", the other three windows are respectively presented as "CN 112819673A-copy 1", "CN 112819673A-copy 2" and "CN 112819673A-copy 3", and the mark objects in the corresponding windows are also described above and correspond to the historical users.
In the implementation process of the embodiment of the invention, the pdf file name comprises a default name contained when the pdf document is downloaded from a specified website, and a keyword newly added to the default name by a user; searching a record containing the same or similar keywords in a server according to the keywords contained in the pdf document name, which specifically comprises:
searching for records containing the same or similar keywords in a server by using one or more types of character strings contained in a pdf file name as one or more keywords and using the keywords contained in the pdf file name;
further, one or more keywords are extracted from the folder name named according to the preset specification where the pdf document is located, the obtained records are screened, and the screening result is used as the record content finally presented in the record list;
the history record also includes the folder name and/or path name of the corresponding file.
The name of the folder contains one or more of a patent application number, a patent name, a few rounds of examination and comment responses, an agency case number and an enterprise case number.
In view of the working habits of some agents as the habits of marking on paper contrast documents after printing contrast documents and the possible habits of some agents as not marking, the embodiment of the present invention further provides a feasible compensation scheme in the implementation process, which is to arrange and store the attention points actually related to the agents, specifically, after the first user opens one or more pdf documents as contrast documents and completes the content writing in the corresponding comment statement reply document, as shown in fig. 5, the method further includes:
in step 301, at the dpf document operation interface, an operation of loading an opinion statement reply document is selected.
In the most intuitive example, a "load mark" function button arranged in a "toolbar" triggers a pop-up sub-window, and then a file path where the comment statement to be loaded is located can be selected through the sub-window, and after the loading is confirmed by clicking, the step 302 is entered.
In step 302, the paragraph descriptive content in the opinion statement reply document about the current pdf document is obtained via semantic recognition and located in the current pdf document to the corresponding descriptive content location.
The preferred scheme can be provided, and is obtained after the examination opinions of the agency are deeply researched to reply to the content specification; in order to achieve the logical tightness and clarity of the review comment response, the contents of the response indicate that the paragraph numbers in the comparison file are necessarily required in the specification of the regular agency, and therefore, the technical feasibility of the implementation of steps 301 to 303 in the optimized implementation scheme is brought.
In step 303, determining whether the located corresponding description content position has a mark content, and if yes, skipping to locate the next description position; if not, marking by adopting a default marking mode.
The default marking mode comprises one or more of a square mark, a transverse line mark, a wavy line mark and an annotation frame mark; the newly generated markup content for the reply document from the opinion statement is also added to the first user's record for the current pdf.
In the preferred embodiment of the present invention, the writing format in the corresponding opinion statement book is further captured, even if the operation habit of the historical agent is performed by printing paper, the historical agent still sees the statement in the statement book with rational data content in the way of matching the paragraph number in the comparison file, so that the possibility of extracting the paragraph description content related to the comparison in the opinion statement book based on semantic learning in the preferred embodiment of the present invention and generating the related record is generated, thereby filling up the defects of the technical scheme provided by the present invention in the specific operation habit agent, and forming a technical scheme with high practicability.
With reference to the embodiment of the present invention, in order to better manage the storage resources in the server, when the content of each record stored in the server includes the matched case information, as shown in fig. 6, the method further includes:
in step 401, the server receives an end list imported by the operator, where the end list includes authorized case information and/or rejected case information.
The road plan list can be an authorization/rejection list which is arranged by the agency every month or every quarter, and the server can be used for combing the locally stored records aiming at specific information columns in the list according to the confirmation of the information contained in the list; for example, a column in a typical grant/reject list will necessarily contain one or more of a patent publication number and/or application number, an agency number, and a patent name.
In the actual operation process, the operator is usually a manager of the server, and may be a person in charge of a patent group or a process manager in the agency.
In step 402, the server retrieves locally stored records based on keywords contained in case information in the outcome list.
The keywords are one or more of the above-mentioned patent publication numbers and/or application numbers, the contents of the attorney docket numbers, and the patent names.
In step 403, when the case associated with the corresponding record has been finalized, one or more records locally corresponding to the server are cleared.
Therefore, how the record stored by the server is loaded into the pdf document opened locally from the first user terminal side is formed, and the browsing record of the history agent is quickly inherited when the similar midway reply receiving task is realized; moreover, after the corresponding patent case authorizer finalizes the case, the storage space in the server can be quickly and accurately released without increasing the existing workload, and the logic closed loop realized by one scheme is completed.
Example 2:
fig. 7 is a schematic diagram of an architecture of an efficient pdf document browsing device according to an embodiment of the present invention. The device for efficiently browsing pdf documents of the present embodiment includes one or more processors 21 and a memory 22. In fig. 7, one processor 21 is taken as an example.
The processor 21 and the memory 22 may be connected by a bus or other means, and fig. 7 illustrates the connection by a bus as an example.
The memory 22, which is a non-volatile computer-readable storage medium, can be used to store non-volatile software programs and non-volatile computer-executable programs, such as the method applied to efficiently browse pdf documents in embodiment 1. The processor 21 executes the efficient browsing method applied to the pdf document by executing the non-volatile software programs and instructions stored in the memory 22.
The memory 22 may include high speed random access memory and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, the memory 22 may optionally include memory located remotely from the processor 21, and these remote memories may be connected to the processor 21 via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The program instructions/modules are stored in the memory 22 and when executed by the one or more processors 21, perform the method for efficiently browsing pdf documents in embodiment 1 described above, for example, perform the steps shown in fig. 1-2 and fig. 5-6 described above.
It should be noted that, for the information interaction, execution process and other contents between the modules and units in the apparatus and system, the specific contents may refer to the description in the embodiment of the method of the present invention because the same concept is used as the embodiment of the processing method of the present invention, and are not described herein again.
Those of ordinary skill in the art will appreciate that all or part of the steps of the various methods of the embodiments may be performed by associated hardware as instructed by a program, which may be stored on a computer-readable storage medium, which may include: read Only Memory (ROM), Random Access Memory (RAM), magnetic or optical disks, and the like.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims (8)

1. A method for efficiently browsing pdf documents is characterized by comprising the following steps:
when a first user logs in an application, a default account login process is completed by using the account and the password of the first user historical login;
when a first user opens a pdf document, an application acquires the pdf document name, and searches a record containing the same or similar keywords in a server according to the keywords contained in the pdf document name;
the record comprises one or more of pdf document page number, mark type, mark start coordinate value and mark end coordinate value, and the corresponding record is also associated with account information and time of a mark user;
after the application acquires one or more records, generating a record list in an application window interface for an operator to select one or more records;
and after the first user selects one record, loading the marked content into the document opened by the current application according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record.
2. The method of claim 1, further comprising:
when the first user saves the modification of the current pdf document, the corresponding relevant mark existing in the current document is recorded locally in a log mode; synchronizing the records to a server according to a preassigned synchronization mode;
if the first user performs the storage under the condition that the mark is newly added after loading the history record of the first user, the server replaces the history record of the user under the same pdf document name in a covering mode;
if the first user also includes records of other account numbers in the process of loading the records, the server further analyzes newly added items in the records stored by the current user and record items of historical other account numbers, and performs record storage in a mode of combining the newly added items and historical other account number information.
3. The method of claim 1, wherein when the first user selects a plurality of records and further selects a combined presentation, the markup content is loaded into a document currently opened by the application according to the page number, the markup type, the markup start coordinate value and the markup end coordinate value of the pdf document under the corresponding record;
when a first user selects a plurality of records and further selects individual presentation, according to the page number, mark type, mark start coordinate value and mark end coordinate value of the pdf document under the corresponding record, making a corresponding number of pdf document copies according to the number of records, and loading the mark content under each record into the corresponding pdf document copy.
4. The method as claimed in claim 1, wherein the pdf file name comprises a default name included when downloading the pdf document from a specified website, and the user adds a keyword to the default name; searching a record containing the same or similar keywords in a server according to the keywords contained in the pdf document name, which specifically comprises:
searching for records containing the same or similar keywords in a server by using one or more types of character strings contained in a pdf file name as one or more keywords and using the keywords contained in the pdf file name;
further, one or more keywords are extracted from the folder name named according to the preset specification where the pdf document is located, the obtained records are screened, and the screening result is used as the record content finally presented in the record list;
the history record also includes the folder name and/or path name of the corresponding file.
5. The method as recited in claim 4, wherein the folder name comprises one or more of a patent application number, a patent name, a review comment reply of several rounds, a case number of an agency, and a case number of an enterprise.
6. The method as claimed in any one of claims 1 to 5, wherein after the first user opens one or more pdf documents as a comparison document and completes the content composition of the corresponding comment statement reply document, the method further comprises:
selecting an operation of loading an opinion statement reply document on a dpf document operation interface;
obtaining paragraph description contents related to the current pdf document in an opinion statement reply document through semantic recognition, and positioning the corresponding description contents in the current pdf document;
determining whether the located corresponding description content position has the mark content, and skipping to locate the next description position if the mark content exists; if not, adopting a default marking mode to mark; the default marking mode comprises one or more of a square frame mark, a transverse line mark, a wavy line mark and an annotation frame mark;
the newly generated markup content for the reply document from the opinion statement is also added to the first user's record for the current pdf.
7. The method as claimed in any one of claims 1 to 5, wherein when the content of each record stored in the server contains the matched case information, the method further comprises:
the server receives an end-to-end list imported by an operator, wherein the end-to-end list comprises authorized case information and/or rejected case information;
the server searches the locally stored records according to the keywords contained in the case information in the final list;
and clearing one or more records locally corresponding to the server when the case associated with the corresponding record is already put on a table.
8. An apparatus for efficiently browsing pdf documents, the apparatus comprising:
at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor for performing the method of any of claims 1-7 applied to pdf document efficient browsing.
CN202210443657.XA 2022-04-26 2022-04-26 Method and device for efficiently browsing pdf document Active CN115048339B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210443657.XA CN115048339B (en) 2022-04-26 2022-04-26 Method and device for efficiently browsing pdf document

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210443657.XA CN115048339B (en) 2022-04-26 2022-04-26 Method and device for efficiently browsing pdf document

Publications (2)

Publication Number Publication Date
CN115048339A true CN115048339A (en) 2022-09-13
CN115048339B CN115048339B (en) 2023-03-21

Family

ID=83157784

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210443657.XA Active CN115048339B (en) 2022-04-26 2022-04-26 Method and device for efficiently browsing pdf document

Country Status (1)

Country Link
CN (1) CN115048339B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090282074A1 (en) * 2008-05-07 2009-11-12 Anand Balaji Ramakrishnan Document Creator
CN101739415A (en) * 2008-11-25 2010-06-16 华中师范大学 Browser-oriented webpage labeling system
CN102819531A (en) * 2011-06-10 2012-12-12 北大方正集团有限公司 Cloud reading service system, cloud reading service method and device
CN106294290A (en) * 2015-06-05 2017-01-04 腾讯科技(深圳)有限公司 A kind of method and apparatus showing document
CN110162619A (en) * 2019-05-27 2019-08-23 上海吉江数据技术有限公司 Online comparison reading system, method and device
CN112487766A (en) * 2020-12-10 2021-03-12 北京明略软件系统有限公司 Document labeling method and system and computer equipment
CN112989766A (en) * 2021-05-11 2021-06-18 金锐同创(北京)科技股份有限公司 Method and device for processing document labeling information and terminal equipment

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090282074A1 (en) * 2008-05-07 2009-11-12 Anand Balaji Ramakrishnan Document Creator
CN101739415A (en) * 2008-11-25 2010-06-16 华中师范大学 Browser-oriented webpage labeling system
CN102819531A (en) * 2011-06-10 2012-12-12 北大方正集团有限公司 Cloud reading service system, cloud reading service method and device
CN106294290A (en) * 2015-06-05 2017-01-04 腾讯科技(深圳)有限公司 A kind of method and apparatus showing document
CN110162619A (en) * 2019-05-27 2019-08-23 上海吉江数据技术有限公司 Online comparison reading system, method and device
CN112487766A (en) * 2020-12-10 2021-03-12 北京明略软件系统有限公司 Document labeling method and system and computer equipment
CN112989766A (en) * 2021-05-11 2021-06-18 金锐同创(北京)科技股份有限公司 Method and device for processing document labeling information and terminal equipment

Also Published As

Publication number Publication date
CN115048339B (en) 2023-03-21

Similar Documents

Publication Publication Date Title
US8037046B2 (en) Collecting and presenting temporal-based action information
US6161124A (en) Method and system for preparing and registering homepages, interactive input apparatus for multimedia information, and recording medium including interactive input programs of the multimedia information
KR101653268B1 (en) Processing method of tagged information and the client-server system for the same
US20240020270A1 (en) Efficient similarity detection
CN101131702A (en) Apparatus and program for generating electronic albums
CN106407078B (en) Client performance monitoring device and method based on information exchange
CN105718554A (en) Document collaboration conversion method and system
US20160342691A1 (en) Method and system for processing information in social network system
CN112583918B (en) Intranet and extranet document interaction system, method and storage medium
US7171433B2 (en) Document preservation
CN104050207B (en) Information processing unit and file management system
CN113221535A (en) Information processing method, device, computer equipment and storage medium
CN115048339B (en) Method and device for efficiently browsing pdf document
US8296647B1 (en) Reviewing and editing word processing documents
US20070233818A1 (en) Recording medium storing input/output screen generation program, and method for suppressing an unreasonable screen shift
KR101248186B1 (en) System for generating blog using each content in search result page and method thereof
US20120192046A1 (en) Generation of a source complex document to facilitate content access in complex document creation
CN111625508A (en) Information processing method and device
Conotter et al. A crowdsourced data set of edited images online
CN112417295A (en) Education cloud information pushing method, storage medium and system
CN109145214A (en) A kind of link filter method, apparatus, equipment and the medium of Website page
US10614105B2 (en) System and method of designating documents to associate with a search record
US20230205812A1 (en) Ai-powered raw file management
US20170220644A1 (en) Media discovery across content respository
JP2006268703A (en) Document management system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant