CN115048339B - Method and device for efficiently browsing pdf document - Google Patents

Method and device for efficiently browsing pdf document Download PDF

Info

Publication number
CN115048339B
CN115048339B CN202210443657.XA CN202210443657A CN115048339B CN 115048339 B CN115048339 B CN 115048339B CN 202210443657 A CN202210443657 A CN 202210443657A CN 115048339 B CN115048339 B CN 115048339B
Authority
CN
China
Prior art keywords
mark
record
pdf
document
pdf document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210443657.XA
Other languages
Chinese (zh)
Other versions
CN115048339A (en
Inventor
林礼挺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan Feiyu Technology Co ltd
Original Assignee
Wuhan Feiyu Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Feiyu Technology Co ltd filed Critical Wuhan Feiyu Technology Co ltd
Priority to CN202210443657.XA priority Critical patent/CN115048339B/en
Publication of CN115048339A publication Critical patent/CN115048339A/en
Application granted granted Critical
Publication of CN115048339B publication Critical patent/CN115048339B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/14Details of searching files based on file metadata
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/13File access structures, e.g. distributed indices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/1727Details of free space management performed by the file system
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/08Network architectures or network communication protocols for network security for authentication of entities
    • H04L63/083Network architectures or network communication protocols for network security for authentication of entities using passwords

Abstract

The invention relates to the technical field of computer document application, and provides a method and a device for efficiently browsing pdf documents. When a first user logs in an application, a default account login process is completed by using the account and the password of the first user historical login; when a first user opens a pdf document, an application acquires the pdf document name, and searches a record containing the same or similar keywords in a server according to the keywords contained in the pdf document name; after the application acquires one or more records, generating a record list in an application window interface for an operator to select one or more records; and after the first user selects one record, loading the marked content into the document opened by the current application according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record. The method and the device increase the inheritance of pdf document browsing and improve the efficiency of pdf document browsing.

Description

Method and device for efficiently browsing pdf document
Technical Field
The invention relates to the technical field of computer document application, in particular to a method and a device for efficiently browsing pdf documents.
Background
PDF is short for Portable Document Format, meaning "Portable Document Format", and is a file Format developed by Adobe Systems for exchanging files in a manner unrelated to application programs, operating Systems, and hardware. The PDF file is based on a PostScript language image model, and accurate colors and accurate printing effects can be guaranteed regardless of the printer, i.e., the PDF faithfully reproduces each character, color, and image of the original.
PDF is popular with various platforms due to the fact that the PDF is provided with a watermark, compared with word documents, the PDF is free of edibility, and the PDF is mostly in a secondary format as a storage mode of public information documents, for example, relevant management departments mostly disclose public information documents such as public patent documents, examination opinion documents and the like in a PDF or picture mode, wherein the PDF has extremely high picture conversion compatibility, namely, original pictures can be converted into a complete PDF document.
In many existing pdf reading tools, the pdf marking mode is performed in an independent overlay mode, which is equivalent to additionally loading on the pdf document content, and finally forming a marking effect. This is a necessary function for browsing related documents, especially in the case of a large document size and a new key model in that place, however, in the prior art, the browsing manner of pdf documents is mostly based on the personal browsing of users, which not only causes the problem of low efficiency, but also fails to bring into play the real efficient browsing experience of pdf reading tools in the specific application scene field.
In view of the above, overcoming the drawbacks of the prior art is an urgent problem in the art.
Disclosure of Invention
The technical problem to be solved by the invention is that in the field of similar intellectual property rights, public published patent documents which can be downloaded from official or third-party channels keep high consistency in content; the present invention particularly considers an application scenario in which the pdf patent document is downloaded and processed as a comparison document, and in this case, in cooperation with review opinion analysis and the patent analysis process of the invention, the pdf patent document downloaded correspondingly and used as the comparison document is marked with a lot of contents. However, in the prior art, especially in the agent scene, it is very likely that the response of a patent will be due to the deputy and temporary arrangement of the agents, and the first round of review opinion response and the second round of review opinion response will be processed by different agents, even when the third round of review opinion may exist, the three review opinions may be completed by different agents. At this time, most of documents that can be acquired by the intermediately-received agent are the modification comparison page and the opinion statement backed up on the agent system, but the matched comparison files may be different due to regulations or due to the storage resources of the server, and are not effectively stored in the actual situation, and for the intermediately-involved agent, the intermediately-involved agent needs to download the comparison files and browse the comparison files on a zero basis. This is the prior art problem scenario that the present invention is intended to solve.
The technical problem to be further solved by the invention is how to transfer the 'mark' information of some agents who like printing comparison files to the server provided by the invention as much as possible according to the operation habits of different people.
The invention adopts the following technical scheme:
in a first aspect, the present invention provides a method for efficiently browsing pdf documents, including:
when a first user logs in an application, a default account login process is completed by using the account and the password of the first user historical login;
when a first user opens a pdf document, an application acquires the pdf document name, and searches a record containing the same or similar keywords in a server according to the keywords contained in the pdf document name;
the record is composed of one or more items of pdf document page number, mark type, mark start coordinate value and mark end coordinate value, and the corresponding record is also associated with account information and time of a mark user;
after the application acquires one or more records, generating a record list in an application window interface for an operator to select one or more records;
and after the first user selects one record, loading the marked content into the document opened by the current application according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record.
Preferably, the method further comprises:
when the first user saves the modification to the current pdf document, the corresponding relevant tags existing in the current document will be logged locally; synchronizing the records to a server according to a preassigned synchronization mode;
if the first user performs the storage under the condition that the mark is newly added after loading the history record of the first user, the server replaces the history record of the user under the same pdf document name in a covering mode;
if the first user also includes records of other account numbers in the process of loading the records, the server further analyzes newly added items in the records stored by the current user and record items of historical other account numbers, and performs record storage in a mode of combining the newly added items and historical other account number information.
Preferably, when the first user selects a plurality of records and further selects and combines the presentation, loading the marked content into a document opened by the current application according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding records;
when a first user selects a plurality of records and further selects to independently present, according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record, making a corresponding number of pdf document copies according to the number of the records, and loading the mark content under each record into the corresponding pdf document copy.
Preferably, the pdf file name includes a default name included when downloading the pdf document from a specified website, and the user adds a keyword to the default name; searching a record containing the same or similar keywords in a server according to the keywords contained in the pdf document name, which specifically comprises:
searching for records containing the same or similar keywords in a server by using one or more types of character strings contained in a pdf file name as one or more keywords and using the keywords contained in the pdf file name;
further, one or more keywords are extracted from the folder name named according to the preset specification where the pdf document is located, the obtained records are screened, and the screening result is used as the record content finally presented in the record list;
the history record also includes the folder name and/or path name of the corresponding file.
Preferably, the folder name includes one or more of a patent application number, a patent name, a several rounds of review comments reply, an agency case number, and an enterprise case number.
Preferably, after the first user opens one or more pdf documents as a comparison document and completes the composition of content in a corresponding opinion statement reply document, the method further comprises:
selecting the operation of loading the opinion statement to reply the document on a dpf document operation interface;
obtaining paragraph description contents related to the current pdf document in an opinion statement reply document through semantic recognition, and positioning the corresponding description contents in the current pdf document;
determining whether the located corresponding description content position has the mark content, and skipping to locate the next description position if the mark content exists; if not, adopting a default marking mode to mark; the default marking mode comprises one or more of a square frame mark, a transverse line mark, a wavy line mark and an annotation frame mark;
the newly generated markup content for the reply document from the opinion statement is also added to the first user's record for the current pdf.
Preferably, when the content of each record stored in the server includes matched case information, the method further includes:
the server receives an end-to-end list imported by an operator, wherein the end-to-end list comprises authorized case information and/or rejected case information;
the server searches the locally stored records according to the keywords contained in the case information in the case list;
and clearing one or more records locally corresponding to the server when the case associated with the corresponding record is already put on a table.
In a second aspect, the present invention further provides an apparatus for efficiently browsing pdf documents, which is used to implement the method for efficiently browsing pdf documents in the first aspect, and the apparatus includes:
at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor for performing the method of the first aspect as applied to efficient browsing of pdf documents.
In a third aspect, the present invention also provides a non-volatile computer storage medium storing computer-executable instructions for execution by one or more processors for performing the method for efficiently browsing pdf documents as described in the first aspect.
The invention catches the characteristic that the patent documents downloaded by official or third party platforms in the patent field have high consistency, takes the patent names and/or publication numbers in the file names downloaded by the official or third party platforms as an index basis, and can establish the association relationship with the marked content in the same comparison file marked by other agents in the historical reply with high consistency; therefore, the corresponding records stored in the server are projected into the comparison file downloaded by the current agent, so that the reading emphasis of the historical agent can be effectively presented to the agent of the middle-end, and the mode not only can maximally compress the storage space in the server, but also can form the possibility of subsequent optimization space.
In the preferred scheme of the invention, the writing format in the corresponding opinion statement book is further captured, even if the operation habit of a historical agent is carried out by printing a paper piece, the statement of the reasonable contents in the statement book in a way of matching paragraph numbers in a comparison file can still be observed, so that the possibility of generating the related records is generated by extracting the description contents of the related paragraphs in the opinion statement book based on semantic learning in the preferred scheme of the invention, thereby filling the defects of the technical scheme provided by the invention in a specific operation habit agent and forming a high-practicability technical scheme.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required to be used in the embodiments of the present invention will be briefly described below. It is obvious that the drawings described below are only some embodiments of the invention, and that for a person skilled in the art, other drawings can be derived from them without inventive effort.
Fig. 1 is a flow chart of an efficient browsing method applied to pdf documents according to an embodiment of the present invention;
fig. 2 is a schematic flow chart of another efficient browsing method applied to pdf documents according to an embodiment of the present invention;
fig. 3 is a schematic diagram illustrating an interface effect of an efficient browsing method applied to a pdf document according to an embodiment of the present invention;
fig. 4 is a schematic diagram illustrating an interface effect of an efficient browsing method applied to a pdf document according to an embodiment of the present invention;
fig. 5 is a flowchart of a further method for efficiently browsing pdf documents according to an embodiment of the present invention;
fig. 6 is a flowchart of a further method for efficiently browsing pdf documents according to an embodiment of the present invention;
fig. 7 is a schematic structural diagram of an efficient browsing apparatus for pdf documents according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and do not limit the invention.
After intensively studying some characteristics of the intellectual property field, the applicant confirms that the public can download published patent documents from official or third party channels, and the contents of the published patent documents are highly consistent; in particular, an application scenario is considered in which the pdf patent document is downloaded and processed as a comparison document, and in this case, in cooperation with review opinion analysis and the patent analysis process of the pdf patent document, the pdf patent document corresponding to the comparison document downloaded and processed as the comparison document is marked with a lot of contents.
However, in the prior art, especially in the agent scene, it is very likely that the response of a patent will be due to the deputy and temporary arrangement of the agents, and the first round of review opinion response and the second round of review opinion response will be processed by different agents, even when the third round of review opinion may exist, the three review opinions may be completed by different agents. At this time, most of documents which can be acquired by the intermediately received agent are the modified comparison page and the comment statement which are backed up on the agent system, but the matched comparison file may be different due to regulations or due to the storage resource of the server, and is not effectively stored in the actual situation, at this time, for the intermediately involved agent who replies, the intermediately involved agent needs to download the comparison file and browse the comparison file from the zero basis, whereas for the efficient browsing manner, if browsing of the comparison file can be performed on the basis of the mark of the previous reply, the effort is increased. This is the prior art problem scenario that the present invention is intended to solve.
In addition, the technical features involved in the embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.
Example 1:
embodiment 1 of the present invention provides an efficient browsing method applied to pdf documents, as shown in fig. 1, including:
in step 201, when the first user logs in the application, the default account login process is completed by using the account and the password of the first user historical login.
Because the application scenario targeted by the embodiment of the invention is a specific scenario similar to that of an agent, the account and the password can be uniformly applied for registration by the agent according to the information of the agent, and the corresponding information can quickly confirm who the historical agent is based on the account information even if the corresponding agent leaves the work and a previous server provided by a local server or a pdf reader of the agent is used subsequently. By the technical scheme of the invention, when the record at the server side is loaded, the degree of reference can be prejudged according to the matched historical account information, for example, the stronger the corresponding historical agent capability is, the higher the degree of reference of the loaded record is, and the less the record is, the more the receiving agent is.
In step 202, when the first user opens the pdf document, the application obtains the pdf document name, and searches the server for a record containing the same or similar keywords according to the keywords contained in the pdf document name.
As a typical patent document, as a constitution of possible pdf document names, for example, "comparison document 1 CN112819673A", or "CN112819673A", or "CN202110195736.9 based on an information platform of a cloud six-terminal framework", or "comparison document 1CN202110195736.9 based on an information platform of a cloud six-terminal framework"; the format of the pdf document can be various rich combinations after possible editing operations of the first user are introduced, however, in order to complete the opening of the pdf document and match the corresponding record on the server side, it is necessary that the corresponding file name contains the unique identifier of the pdf document, for example, the above-mentioned publication number "CN112819673A" or application number "CN202110195736.9", in the subsequent embodiments of the present invention, the publication number is set as a valid unique identifier for explanation. However, as an alternative, a multi-element combination search under a naming rule may also be adopted, so as to further improve the accuracy, and redundant description is not repeated here.
The record is composed of one or more items of pdf document page number, mark type, mark start coordinate value and mark end coordinate value, and the corresponding record is also associated with account information and time of a mark user. The account information is preferably formed by a full spelling of the name of the agent, and the corresponding time is the time when the corresponding record is completed, not the time uploaded to the server.
In step 203, after the application acquires one or more records, a record list is generated in an application window interface for an operator to select one or more records.
In the specific presentation process, the account information, the time information, the patent name, the publication number, and other information may be presented together, or if the content is more, the information may be presented by a dragging bar or a window shrinking/expanding manner, which is not described herein in detail.
In step 204, after the first user selects one record each time, the markup content is loaded into the document opened by the current application according to the page number, the markup type, the markup start coordinate value and the markup end coordinate value of the pdf document under the corresponding record.
The embodiment of the invention captures the characteristic that the patent documents downloaded by official or third-party platforms in the patent field have high consistency, and the patent names and/or publication numbers in the downloaded file names are used as the index basis, so that the association relation between the patent documents and the marked contents in the same comparison file marked by other agents in the historical response can be established with high consistency; therefore, the corresponding records stored in the server are projected into the comparison file downloaded by the current agent, so that the reading focus of the historical agent can be effectively presented to the agent of the middle-end, and the mode not only can maximally compress the storage space in the server, but also can form the possibility of subsequent optimization space.
In consideration of the completeness of the technical solution of the embodiment of the present invention, the step 201 to the step 204 represent an opening operation link, and in practical cases, it is also determined that a saving closing link is involved, so as shown in fig. 2, the method further includes:
in step 205, when the first user saves the modifications to the current pdf document, the corresponding relevant tags existing in the current document will be logged locally; and synchronizing the records to a server according to a pre-specified synchronization mode.
A preassigned synchronization mode comprises that the terminal is synchronized to the server at the first time after online networking; or a fixed uploading time can be set, such as 10 monday of each week; or may be performed by a user's active operation, which is not limited herein.
In step 206, if the first user has executed the saving with the tag added after loading its own history record, the server replaces the history record under the same pdf document name of the user in an overlay manner.
In step 207, if the first user further includes records of other accounts in the process of loading the records, the server further analyzes the newly added items in the records saved by the current user and the record items of the historical other accounts, and performs record storage in a manner of combining the newly added items and the historical other account information.
Namely, a mapping mode is adopted, and repeated recorded contents are mapped and replaced in a corresponding account information mode; this is achieved because, in the implementation process of the technical solution of the embodiment of the present invention, the record content under the account information corresponding to the mapping relationship is also loaded in step 203, so that even if the mapping manner described in step 207 is adopted to simplify the storage at the server side, no delay that can be perceived by any user is provided for the final loading at the first user side.
In combination with the embodiment of the present invention, after the first user selects the server-side record, at least two choices are provided for a possible long-line mode:
the selection mode is as follows: when a first user selects a plurality of records and further selects and combines the records to present, loading the marked content into a document opened by the current application according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding records; as shown in fig. 3, pdf document markup 1, pdf document markup 2, pdf document markup 3 and pdf document markup 4 are schematically shown, which are represented in different forms, and for convenience and typicality of the second mode, the pdf document markup 1, pdf document markup 2, pdf document markup 3 and pdf document markup 4 are assigned to 4 different history users, and the situation is more variable in the actual situation, and more, there are usually multiple markup in the record associated with each history user, rather than the schematic one markup object shown in fig. 3; in practice, there are usually no more than three users under the same document that may be associated, as determined by the number of routine rounds of review.
The second selection mode: when a first user selects a plurality of records and further selects to independently present, according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record, making a corresponding number of pdf document copies according to the number of the records, and loading the mark content under each record into the corresponding pdf document copy. As shown in fig. 4, taking the case described in the above fig. 3 as an example, the effect diagram presented correspondingly is represented as 4 application windows in fig. 4, wherein, except that the first application window presents "CN112819673A", the other three windows respectively present "CN 112819673A-copy 1", "CN 112819673A-copy 2" and "CN 112819673A-copy 3", and the mark objects in the respective windows are also as described above and correspond to the respective historical users.
In the implementation process of the embodiment of the invention, the pdf file name comprises a default name contained when the pdf document is downloaded from a specified website, and a keyword newly added to the default name by a user; searching a record containing the same or similar keywords in a server according to the keywords contained in the pdf document name, which specifically comprises:
taking one or more types of character strings contained in a pdf file name as one or more keywords for retrieval, and searching records containing the same or similar keywords in a server by using the keywords contained in the pdf file name;
further, one or more keywords are extracted from the folder name named according to the preset specification where the pdf document is located, the obtained records are screened, and the screening result is used as the record content finally presented in the record list;
the history record also includes the folder name and/or path name of the corresponding file.
The name of the folder contains one or more of a patent application number, a patent name, a few rounds of examination and comment responses, an agency case number and an enterprise case number.
In view of the working habits of some agents as the habits of marking on paper contrast documents after printing contrast documents and the possible habits of some agents as not marking, the embodiment of the present invention further provides a feasible compensation scheme in the implementation process, which is to arrange and store the attention points actually related to the agents, specifically, after the first user opens one or more pdf documents as contrast documents and completes the content writing in the corresponding comment statement reply document, as shown in fig. 5, the method further includes:
in step 301, at the dpf document operation interface, an operation of loading an opinion statement reply document is selected.
In the most intuitive example, a "load mark" function button arranged in a "toolbar" triggers a pop-up sub-window, and then a file path where the comment statement to be loaded is located can be selected through the sub-window, and after the loading is confirmed by clicking, the step 302 is entered.
In step 302, paragraph descriptors in the opinion statement reply document regarding the current pdf document are obtained via semantic recognition and located in the current pdf document to the corresponding descriptor locations.
The preferred scheme can be provided, and is obtained after the examination opinions of the agency are deeply researched to reply to the content specification; in order to achieve the logical tightness and clarity of the review comment response, the contents of the response indicate that the paragraph numbers in the comparison file are necessarily required in the specification of the regular agency, and therefore, the technical feasibility of the implementation of steps 301 to 303 in the optimized implementation scheme is brought.
In step 303, determining whether the located corresponding description content position has a mark content, and if so, skipping to locate the next description position; if not, marking by adopting a default marking mode.
The default marking mode comprises one or more of a square frame mark, a transverse line mark, a wavy line mark and an annotation frame mark; the newly generated markup content for the reply document from the opinion statement is also added to the first user's record for the current pdf.
In the preferred embodiment of the present invention, the writing format in the corresponding opinion statement book is further captured, even if the operation habit of the historical agent is performed by printing paper, the historical agent still sees the statement in the statement book with rational data content in the way of matching the paragraph number in the comparison file, so that the possibility of extracting the paragraph description content related to the comparison in the opinion statement book based on semantic learning in the preferred embodiment of the present invention and generating the related record is generated, thereby filling up the defects of the technical scheme provided by the present invention in the specific operation habit agent, and forming a technical scheme with high practicability.
In combination with the embodiment of the present invention, in order to better manage the storage resources in the server, when the content of each record stored in the server includes the matched case information, as shown in fig. 6, the method further includes:
in step 401, the server receives an end list imported by the operator, where the end list includes authorized case information and/or rejected case information.
The road plan list can be an authorization/rejection list which is arranged by the agency every month or every quarter, and the server can be used for combing the locally stored records according to the confirmation of the information contained in the list; for example, a column in a typical authorization/rejection list will necessarily contain one or more of a patent publication number and/or application number, an agency's content number, and a patent name.
In the actual operation process, the operator is usually a manager of the server, and may be a person in charge of a patent group or a process manager in the agency.
In step 402, the server retrieves locally stored records based on keywords contained in case information in the outcome list.
The keywords are one or more of the above-mentioned patent publication numbers and/or application numbers, the contents of the attorney docket numbers, and the patent names.
In step 403, when the case associated with the corresponding record has been finalized, one or more records locally corresponding to the server are cleared.
Therefore, how the record stored by the server is loaded into the pdf document opened locally from the first user terminal side is formed, and the browsing record of the history agent is quickly inherited when the similar midway reply receiving task is realized; moreover, after the corresponding patent case authorizer finalizes the case, the storage space in the server can be quickly and accurately released without increasing the existing workload, and the logic closed loop realized by one scheme is completed.
Example 2:
fig. 7 is a schematic diagram of an architecture of an efficient pdf document browsing device according to an embodiment of the present invention. The device for efficiently browsing pdf documents of the present embodiment includes one or more processors 21 and a memory 22. In fig. 7, one processor 21 is taken as an example.
The processor 21 and the memory 22 may be connected by a bus or other means, and fig. 7 illustrates the connection by a bus as an example.
The memory 22, which is a non-volatile computer-readable storage medium, can be used to store non-volatile software programs and non-volatile computer-executable programs, such as the method applied to efficiently browse pdf documents in embodiment 1. The processor 21 executes the efficient browsing method applied to pdf documents by running non-volatile software programs and instructions stored in the memory 22.
The memory 22 may include high speed random access memory and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, the memory 22 may optionally include memory located remotely from the processor 21, and these remote memories may be connected to the processor 21 via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The program instructions/modules are stored in the memory 22 and when executed by the one or more processors 21, perform the method for efficiently browsing pdf documents in embodiment 1 described above, for example, perform the steps shown in fig. 1-2 and fig. 5-6 described above.
It should be noted that, for the information interaction, execution process and other contents between the modules and units in the apparatus and system, the specific contents may refer to the description in the embodiment of the method of the present invention because the same concept is used as the embodiment of the processing method of the present invention, and are not described herein again.
Those of ordinary skill in the art will appreciate that all or part of the steps of the various methods of the embodiments may be implemented by associated hardware as instructed by a program, which may be stored on a computer-readable storage medium, which may include: a Read Only Memory (ROM), a Random Access Memory (RAM), a magnetic or optical disk, and the like.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims (7)

1. A method for efficiently browsing pdf documents is characterized by comprising the following steps:
when a first user logs in an application, a default account login process is completed by using the account and the password of the first user historical login;
when a first user opens a pdf document, an application acquires a pdf document name, and searches records containing the same or similar keywords in a server according to keywords contained in the pdf document name;
the record is composed of one or more items of pdf document page number, mark type, mark start coordinate value and mark end coordinate value, and the corresponding record is also associated with account information and time of a mark user;
after the application acquires one or more records, generating a record list in an application window interface for an operator to select one or more records;
after a first user selects one record, loading the marked content into a document opened by the current application according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record;
after the first user opens one or more pdf documents as a comparison document and completes the composition of content in the corresponding opinion statement reply document, the method further comprises:
selecting the operation of loading the opinion statement reply document on a pdf document operation interface;
obtaining paragraph description contents related to the current pdf document in an opinion statement reply document through semantic recognition, and positioning the corresponding description contents in the current pdf document;
determining whether the located corresponding description content position has the mark content, and skipping to locate the next description position if the mark content exists; if not, adopting a default marking mode to mark; the default marking mode comprises one or more of a square mark, a transverse line mark, a wavy line mark and an annotation frame mark;
wherein newly generated markup content responsive to the opinion statement reply document is also added to the first user's record for the current pdf.
2. The method of claim 1, further comprising:
when the first user saves the modification of the current pdf document, the corresponding relevant mark existing in the current document is recorded locally in a log mode; synchronizing the records to a server according to a preassigned synchronization mode;
if the first user loads the history record of the first user and performs the storage under the condition of adding a mark, the server replaces the history record of the first user under the same pdf document name in a covering mode;
if the first user also includes records of other account numbers in the process of loading the records, the server further analyzes newly added items in the records stored by the current user and record items of historical other account numbers, and performs record storage in a mode of combining the newly added items and historical other account number information.
3. The method as claimed in claim 1, wherein when the first user selects a plurality of records and further selects a combined presentation, the markup content is loaded into a document opened by the current application according to the page number, the markup type, the markup start coordinate value and the markup end coordinate value of the pdf document under the corresponding record;
when a first user selects a plurality of records and further selects to independently present, according to the page number, the mark type, the mark start coordinate value and the mark end coordinate value of the pdf document under the corresponding record, making a corresponding number of pdf document copies according to the number of the records, and loading the mark content under each record into the corresponding pdf document copy.
4. The method as claimed in claim 1, wherein the pdf file name comprises a default name included when downloading the pdf document from a specified website, and the user adds a keyword to the default name; searching records containing the same or similar keywords in the server according to the keywords contained in the pdf document name, which specifically comprises:
taking one or more types of character strings contained in the pdf file name as one or more keywords, and searching records containing the same or similar keywords in a server;
further, one or more keywords are extracted from the folder name named according to the preset specification where the pdf document is located, the obtained records are screened, and the screening result is used as the record content finally presented in the record list;
the history record also includes the folder name and/or path name of the corresponding file.
5. The method as recited in claim 4, wherein the folder name comprises one or more of a patent application number, a patent name, a review comment reply of several rounds, a case number of an agency, and a case number of an enterprise.
6. The method as claimed in any one of claims 1 to 5, wherein when the content of each record stored in the server contains the matched case information, the method further comprises:
the server receives an end-to-end list imported by an operator, wherein the end-to-end list comprises authorized case information and/or rejected case information;
the server searches the locally stored records according to the keywords contained in the case information in the case list;
and clearing one or more records locally corresponding to the server when the case associated with the corresponding record is already put on a table.
7. An apparatus for efficiently browsing pdf documents, the apparatus comprising:
at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor for performing the method of any of claims 1-6 applied to pdf document efficient browsing.
CN202210443657.XA 2022-04-26 2022-04-26 Method and device for efficiently browsing pdf document Active CN115048339B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210443657.XA CN115048339B (en) 2022-04-26 2022-04-26 Method and device for efficiently browsing pdf document

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210443657.XA CN115048339B (en) 2022-04-26 2022-04-26 Method and device for efficiently browsing pdf document

Publications (2)

Publication Number Publication Date
CN115048339A CN115048339A (en) 2022-09-13
CN115048339B true CN115048339B (en) 2023-03-21

Family

ID=83157784

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210443657.XA Active CN115048339B (en) 2022-04-26 2022-04-26 Method and device for efficiently browsing pdf document

Country Status (1)

Country Link
CN (1) CN115048339B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110162619A (en) * 2019-05-27 2019-08-23 上海吉江数据技术有限公司 Online comparison reading system, method and device

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090282074A1 (en) * 2008-05-07 2009-11-12 Anand Balaji Ramakrishnan Document Creator
CN101739415A (en) * 2008-11-25 2010-06-16 华中师范大学 Browser-oriented webpage labeling system
CN102819531B (en) * 2011-06-10 2016-03-09 北大方正集团有限公司 A kind of cloud reading service system, cloud reading service method and apparatus
CN106294290A (en) * 2015-06-05 2017-01-04 腾讯科技(深圳)有限公司 A kind of method and apparatus showing document
CN112487766A (en) * 2020-12-10 2021-03-12 北京明略软件系统有限公司 Document labeling method and system and computer equipment
CN112989766B (en) * 2021-05-11 2021-08-03 金锐同创(北京)科技股份有限公司 Method and device for processing document labeling information and terminal equipment

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110162619A (en) * 2019-05-27 2019-08-23 上海吉江数据技术有限公司 Online comparison reading system, method and device

Also Published As

Publication number Publication date
CN115048339A (en) 2022-09-13

Similar Documents

Publication Publication Date Title
US8037046B2 (en) Collecting and presenting temporal-based action information
US6161124A (en) Method and system for preparing and registering homepages, interactive input apparatus for multimedia information, and recording medium including interactive input programs of the multimedia information
WO2019200783A1 (en) Method for data crawling in page containing dynamic image or table, device, terminal, and storage medium
KR101653268B1 (en) Processing method of tagged information and the client-server system for the same
JP5023715B2 (en) Information processing system, information processing apparatus, and program
US20100274714A1 (en) Sharing of presets for visual effects or other computer-implemented effects
US20240020270A1 (en) Efficient similarity detection
US20090293059A1 (en) Automatically connecting items of workflow in a computer program
CN112583918B (en) Intranet and extranet document interaction system, method and storage medium
US20060173755A1 (en) Catalog management apparatus, catalog generation method and catalog retrieval method
WO2004012103A2 (en) Content management system
CN113221535B (en) Information processing method, device, computer equipment and storage medium
CN115048339B (en) Method and device for efficiently browsing pdf document
CN104050207B (en) Information processing unit and file management system
CN113268232B (en) Page skin generation method and device and computer readable storage medium
KR101248186B1 (en) System for generating blog using each content in search result page and method thereof
US20070233818A1 (en) Recording medium storing input/output screen generation program, and method for suppressing an unreasonable screen shift
WO2023236257A1 (en) Document search platform, search method and apparatus, electronic device, and storage medium
CN108536553A (en) Backup data store method for determining position, device, equipment and system
CN112417295A (en) Education cloud information pushing method, storage medium and system
CN111625508A (en) Information processing method and device
JP2004062216A (en) Method and device for data filing, storage medium, and program
CN116382596B (en) Space-time big data storage method and system based on distributed technology
US20230205812A1 (en) Ai-powered raw file management
US20230067956A1 (en) Multiple product identification assistance in an electronic marketplace application

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant