CN113807072A - Method and system for quickly identifying difference before and after revision of online approval document - Google Patents
Method and system for quickly identifying difference before and after revision of online approval document Download PDFInfo
- Publication number
- CN113807072A CN113807072A CN202010535450.6A CN202010535450A CN113807072A CN 113807072 A CN113807072 A CN 113807072A CN 202010535450 A CN202010535450 A CN 202010535450A CN 113807072 A CN113807072 A CN 113807072A
- Authority
- CN
- China
- Prior art keywords
- document
- difference
- revision
- version document
- previous version
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/194—Calculation of difference between files
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/169—Annotation, e.g. comment data or footnotes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/197—Version control
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Document Processing Apparatus (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a method for quickly identifying differences before and after revision of an online approval document, which comprises the following steps: step S1, the revision monitoring unit monitors the revision state of the document in real time, when the user performs revision operation on the document, the revision monitoring unit automatically generates a copy according to the document and takes the copy as a prior-version document, and after the user performs revision operation, the revised document is taken as a current-version document; step S2, when the user uploads the current version document to the system, the difference calculating unit calculates the difference rate between the current version document and the previous version document; step S3, when the user calls the current version document again, the system displays the difference rate; in step S4, a difference marking unit is used to mark the difference between the current version document and the previous version document. The method can quickly, efficiently and accurately monitor and identify the revision difference of the online text, and well meets the requirements of efficient and high-precision document inspection.
Description
Technical Field
The invention relates to a document revision identification method, in particular to a method and a system for quickly identifying differences before and after revision of an online approval document.
Background
In the daily work process, when enterprises and public institutions handle various services on line, the traditional method usually involves the approval work such as manual writing of materials, and the like.
Disclosure of Invention
The invention aims to solve the technical problem of providing a quick identification method and a quick identification system which can quickly, efficiently and accurately monitor and identify revision differences of online texts so as to meet the requirements of efficient and high-precision document inspection, aiming at the defects of the prior art.
In order to solve the technical problems, the invention adopts the following technical scheme.
A method for quickly identifying differences before and after revision of an online approval document is realized based on a system, wherein the system comprises a revision monitoring unit, a difference calculating unit and a difference marking unit, and the method comprises the following steps: step S1, when the user carries out the task of on-line examination and approval of the document through the system, the revision monitoring unit monitors the revision state of the document in real time, when the user carries out revision operation on the document, the revision monitoring unit automatically generates a copy according to the document and takes the copy as a previous version document, and after the user carries out revision operation, the revised document is taken as a current version document; step S2, when the user uploads the current version document to the system, the difference calculating unit obtains the previous version document generated in step S1, and calculates the difference rate between the current version document and the previous version document; step S3, when the user calls the current version document again, the system displays the difference rate; step S4, when the user enters a difference analysis instruction into the system, the difference marking unit is used to mark the part of the current version document that has a difference from the previous version document.
Preferably, in the step S1, when the revision monitoring unit generates a previous version document, a version number is set for the previous version document according to a time stamp.
Preferably, in the step S2, after the difference calculating unit obtains the previous version document, the previous version document is converted into a plain text intermediate format document, and then the difference rate is calculated.
Preferably, in the step S2, the difference calculation unit converts the previous version document into a plain text intermediate format document using an iText, PdfBox, or POI tool.
Preferably, in step S2, the difference calculating unit calculates the similarity between the current version document and the previous version document based on an edit distance algorithm in a text similarity algorithm, and obtains the difference rate according to the similarity.
Preferably, in step S4, the process of indicating the portion with the difference includes: step S40, recording the current version document as a first text file, and recording a previous version document corresponding to the current version document as a second text file according to the version number; step S41, comparing the difference between the first text file and the second text file line by line to generate a document difference table; step S42, marking and displaying the difference between the first text file and the second text file.
Preferably, in step S41, the difference between the first text file and the second text file is compared line by using a preset diff-utils tool, and a document difference table is generated according to a result returned by the diff-utils tool.
Preferably, in step S42, the difference portion between the first text file and the second text file, which includes new lines, deleted lines, and in-line character modifications, is marked with different colors.
Preferably, in step S42, the difference between the first text file and the second text file is displayed by using an HTML format file.
A quick difference identification system before and after revision of an online approval document is characterized by comprising a revision monitoring unit, a difference calculating unit and a difference marking unit, wherein: the revision monitoring unit is used for monitoring the revision state of the document in real time when a user performs an online document approval task through the system, automatically generating a copy according to the document and using the copy as a previous version document when the user performs revision operation on the document, and using the modified document as a current version document after the user performs revision operation; the difference calculating unit is used for acquiring a previous version document generated by the revision monitoring unit when a user uploads the current version document to a system, and calculating the difference rate between the current version document and the previous version document; the difference marking unit is used for marking the part of the current version document and the previous version document which have the difference when a user inputs a difference analysis instruction into the system.
The invention discloses a method for quickly identifying differences before and after revision of an online approval document, which comprises the steps of utilizing a revision monitoring unit to monitor the revision operation of the document in real time, automatically generating a copy of the document as a previous version document when the document is monitored to be revised, taking the revised document as a current version document after a user executes the revision operation, automatically generating the difference rate of the current version document and the previous version document when the user uploads the current version document, displaying the difference rate by a system when the user uses the current version document again, and obviously marking character parts with differences according to an instruction input by the user, thereby realizing the automatic identification and display of the revision position of the document The online text revision difference is accurately monitored and identified, and the high-efficiency and high-precision document inspection requirement is well met.
Drawings
FIG. 1 is a flow chart of a method for rapidly identifying differences before and after revision of an online approval document according to the present invention;
FIG. 2 is a block diagram of the system for rapidly identifying differences before and after revision of an online approval document according to the present invention.
Detailed Description
The invention is described in more detail below with reference to the figures and examples.
The invention discloses a method for quickly identifying differences before and after revision of an online approval document, which is realized based on a system as shown in a combined figure 1 and a figure 2, wherein the system comprises a revision monitoring unit 1, a difference calculating unit 2 and a difference marking unit 3, and the method comprises the following steps:
step S1, when the user carries out the task of on-line examining and approving the document through the system, the revision monitoring unit 1 monitors the revision state of the document in real time, when the user carries out revision operation on the document, the revision monitoring unit 1 automatically generates a copy according to the document and takes the copy as a previous version document, and after the user carries out revision operation, the revised document is taken as a current version document;
step S2, when the user uploads the current version document to the system, the difference calculating unit 2 obtains the previous version document generated in step S1, and calculates the difference rate between the current version document and the previous version document;
step S3, when the user calls the current version document again, the system displays the difference rate;
step S4, when the user enters a difference analysis instruction into the system, the difference marking unit 3 is used to mark the part of the current version document that has a difference from the previous version document.
In the method, the revision monitoring unit 1 is used for monitoring the document revision operation in real time, when the document is monitored to be revised, a copy of the document is automatically generated and used as a previous version document, after the revision operation is executed by a user, the revised document is used as a current version document, when the user uploads the current version document, the difference rate between the current version document and the previous version document is automatically generated, when the user uses the current version document again, the system displays the difference rate, and simultaneously, the character part with difference is obviously marked according to an instruction input by the user, so that the automatic identification and display of the revision position of the document are realized, compared with the prior art, the invention can quickly, efficiently and accurately monitor and identify the revision difference of online texts, the requirement of high-efficiency and high-precision document inspection is well met.
In order to distinguish between documents of different versions and documents of previous versions, in this embodiment, in step S1, when the revision monitoring unit 1 generates a document of a previous version, a version number is set for the document of the previous version according to a timestamp.
The mode of setting the version number according to the timestamp is beneficial to quickly numbering the documents of the prior versions generated at different time points, and is also beneficial to quickly finding the document of the prior version closest to the generation time of the current document version in the subsequent processing process, so that the accuracy and the reliability of the identification process are improved.
In a preferable mode, in step S2, after the difference calculation unit 2 obtains the document of the previous version, the document of the previous version is converted into a plain text intermediate format document, and then the difference rate is calculated.
Further, in the step S2, the difference calculation unit 2 converts the previous version document into a plain-text intermediate format document using an iText, PdfBox, or POI tool.
In order to calculate the difference rate more accurately, in step S2 of this embodiment, the difference calculating unit 2 calculates the similarity between the current version document and the previous version document based on an edit distance algorithm in a text similarity algorithm, and obtains the difference rate according to the similarity.
In a preferable mode, the step S4, the process of indicating the portion having the difference includes:
step S40, recording the current version document as a first text file, and recording a previous version document corresponding to the current version document as a second text file according to the version number;
step S41, comparing the difference between the first text file and the second text file line by line to generate a document difference table;
step S42, marking and displaying the difference between the first text file and the second text file.
The embodiment preferably adopts a mode of generating the difference table firstly and then marking the revised position specially according to the difference, which is helpful for the user to intuitively master the revised position of the front and back text documents.
In order to generate the document difference table more accurately, in step S41 of this embodiment, the difference between the first text file and the second text file is compared line by using a preset diff-utils tool, and the document difference table is generated according to a result returned by the diff-utils tool.
Regarding the indication manner, in the step S42 of this embodiment, the difference portion including the new line addition, the new line deletion, and the intra-line character modification in the first text file and the second text file is indicated by using different colors.
Further, in step S42, the difference between the first text file and the second text file is displayed by using an HTML format file.
In order to better describe the technical solution of the present invention, the present invention further discloses a system for rapidly identifying differences before and after revision of an online approval document, please refer to fig. 2, wherein the system comprises a revision monitoring unit 1, a difference calculating unit 2 and a difference marking unit 3, wherein:
the revision monitoring unit 1 is used for monitoring the revision state of the document in real time when a user performs an online document approval task through the system, automatically generating a copy according to the document and using the copy as a previous version document when the user performs revision operation on the document, and using the modified document as a current version document after the user performs revision operation;
the difference calculating unit 2 is used for acquiring the previous version document generated by the revision monitoring unit 1 when the user uploads the current version document to the system, and calculating the difference rate between the current version document and the previous version document;
the difference marking unit 3 is configured to mark a part of the current version document that has a difference from the previous version document when a user inputs a difference analysis instruction into the system.
The invention discloses a method and a system for quickly identifying differences before and after revision of an online approval document, which can refer to the following embodiments in the practical application process:
step 4, for the documents with the displayed differences, the approver can further check the specific differences between the document of the current version and the document of the previous or selected version;
and 5, calculating and displaying document differences in real time when the system receives the user difference analysis instruction.
The specific processing method of the step 5 comprises the following steps: the current document is read as "text file 1" using document processing techniques (e.g., using PDFBOX, iText, POI). Reading the last version as a text 2, comparing the difference of the text file 1 or the text file 2 line by adopting a diff-files tool of google, returning a result according to the diff-files, generating a document difference table, displaying the difference of two documents at the position by adopting an HTML format, and marking the characters which are newly added, deleted and modified in line by different colors.
Compared with the prior art, the method and the system for rapidly identifying the difference before and after revision of the online approval document have the advantages that the historical version can be automatically generated before the content of the document changes, the similarity of the document is automatically calculated when the system monitors that the document is updated so as to be preliminarily judged by an approver, and on the basis, the specific difference before and after revision of the document is rapidly output by a text difference comparison technology so as to be clearly checked and judged in detail by the approver.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents or improvements made within the technical scope of the present invention should be included in the scope of the present invention.
Claims (10)
1. A method for quickly identifying differences before and after revision of an online approval document is characterized in that the method is realized based on a system, the system comprises a revision monitoring unit (1), a difference calculating unit (2) and a difference marking unit (3), and the method comprises the following steps:
step S1, when the user carries out the task of on-line examination and approval of the document through the system, the revision monitoring unit (1) monitors the revision state of the document in real time, when the user carries out revision operation on the document, the revision monitoring unit (1) automatically generates a copy according to the document and takes the copy as a previous version document, and after the user carries out revision operation, the revised document is taken as a current version document;
step S2, when the user uploads the current version document to the system, the difference calculating unit (2) acquires the previous version document generated in the step S1, and calculates the difference rate of the current version document and the previous version document;
step S3, when the user calls the current version document again, the system displays the difference rate;
and step S4, when the user inputs the difference analysis instruction into the system, the difference marking unit (3) is used for marking the part of the current version document and the previous version document which have difference.
2. The method for rapidly recognizing the difference before and after revision of an on-line approval document according to claim 1, wherein in the step S1, when the revision monitoring unit (1) generates a previous version document, a version number is set for the previous version document according to a time stamp.
3. The method for rapidly recognizing difference before and after revision of an on-line approval document according to claim 1, wherein in the step S2, after the difference calculation unit (2) acquires the previous version document, the previous version document is first converted into a plain text intermediate format document, and then the difference rate is calculated.
4. The method for rapidly recognizing the difference before and after revision of the on-line approval document according to claim 3, wherein in the step S2, the difference calculation unit (2) converts the previous version document into a plain text intermediate format document using an iText, PdfBox or POI tool.
5. The method for rapidly identifying differences before and after revision of an online approval document according to claim 1, wherein in step S2, the difference calculation unit (2) calculates the similarity between the current version document and the previous version document based on an edit distance algorithm in a text similarity algorithm, and obtains the difference rate from the similarity.
6. The method for rapidly identifying differences before and after revision of an online approval document according to claim 1, wherein the step S4 of marking the parts having the differences comprises:
step S40, recording the current version document as a first text file, and recording a previous version document corresponding to the current version document as a second text file according to the version number;
step S41, comparing the difference between the first text file and the second text file line by line to generate a document difference table;
step S42, marking and displaying the difference between the first text file and the second text file.
7. The method for rapidly identifying the difference before and after revision of the online approved document according to claim 6, wherein in step S41, the difference between the first text file and the second text file is compared line by using a preset diff-utils tool, and a document difference table is generated according to the result returned by the diff-utils tool.
8. The method for rapidly identifying the difference between the revised before and after the online approval document according to claim 7, wherein in step S42, the difference between the first text file and the second text file containing the new line addition, the new line deletion and the intra-line character modification is marked by different colors.
9. The method for rapidly identifying the difference between the revised before and after approval document according to claim 8, wherein the difference between the first text file and the second text file is displayed by using an HTML format file in step S42.
10. A quick difference identification system before and after revision of an online approval document is characterized in that the system comprises a revision monitoring unit (1), a difference calculating unit (2) and a difference marking unit (3), wherein:
the revision monitoring unit (1) is used for monitoring the revision state of the document in real time when a user performs an online document examination and approval task through the system, automatically generating a copy according to the document and using the copy as a previous version document when the user performs revision operation on the document, and using the modified document as a current version document after the user performs revision operation;
the difference calculating unit (2) is used for acquiring a previous version document generated by the revision monitoring unit (1) when a user uploads the current version document to a system, and calculating the difference rate of the current version document and the previous version document;
the difference marking unit (3) is used for marking the part of the current version document which has the difference with the previous version document when a user inputs a difference analysis instruction into the system.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010535450.6A CN113807072A (en) | 2020-06-12 | 2020-06-12 | Method and system for quickly identifying difference before and after revision of online approval document |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010535450.6A CN113807072A (en) | 2020-06-12 | 2020-06-12 | Method and system for quickly identifying difference before and after revision of online approval document |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113807072A true CN113807072A (en) | 2021-12-17 |
Family
ID=78892124
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010535450.6A Pending CN113807072A (en) | 2020-06-12 | 2020-06-12 | Method and system for quickly identifying difference before and after revision of online approval document |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113807072A (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020002567A1 (en) * | 2000-06-30 | 2002-01-03 | Yukie Kanie | Method and system for managing documents |
CN105589838A (en) * | 2015-12-24 | 2016-05-18 | 中国电子科技集团公司第三十三研究所 | Electronic official document trace reserving method based on file comparison |
CN106202136A (en) * | 2014-12-05 | 2016-12-07 | 成都彩程软件设计有限公司 | Document management method |
CN107977342A (en) * | 2016-10-25 | 2018-05-01 | 阿里巴巴集团控股有限公司 | A kind of document control methods and device |
CN108733456A (en) * | 2017-04-18 | 2018-11-02 | 北大方正集团有限公司 | A kind of revision method and system based on online editing application |
CN109740124A (en) * | 2018-12-25 | 2019-05-10 | 东软集团股份有限公司 | Difference output method, device, storage medium and the electronic equipment of document comparison |
CN110991163A (en) * | 2019-11-29 | 2020-04-10 | 达而观信息科技(上海)有限公司 | Document comparison analysis method and device, electronic equipment and storage medium |
-
2020
- 2020-06-12 CN CN202010535450.6A patent/CN113807072A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020002567A1 (en) * | 2000-06-30 | 2002-01-03 | Yukie Kanie | Method and system for managing documents |
CN106202136A (en) * | 2014-12-05 | 2016-12-07 | 成都彩程软件设计有限公司 | Document management method |
CN105589838A (en) * | 2015-12-24 | 2016-05-18 | 中国电子科技集团公司第三十三研究所 | Electronic official document trace reserving method based on file comparison |
CN107977342A (en) * | 2016-10-25 | 2018-05-01 | 阿里巴巴集团控股有限公司 | A kind of document control methods and device |
CN108733456A (en) * | 2017-04-18 | 2018-11-02 | 北大方正集团有限公司 | A kind of revision method and system based on online editing application |
CN109740124A (en) * | 2018-12-25 | 2019-05-10 | 东软集团股份有限公司 | Difference output method, device, storage medium and the electronic equipment of document comparison |
CN110991163A (en) * | 2019-11-29 | 2020-04-10 | 达而观信息科技(上海)有限公司 | Document comparison analysis method and device, electronic equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022027943A1 (en) | Water pollution tracing system and method based on pollution factor source apportionment | |
JP2020201772A (en) | Attitude analysis program and attitude analyzer | |
CN108334484B (en) | Data entry method and device | |
CN112579524B (en) | Method and device for conveniently tracing circuit board processing information, electronic equipment and storage medium | |
CN109933803B (en) | Idiom information display method, idiom information display device, electronic equipment and storage medium | |
CN110931097A (en) | Processing and analyzing system for inspection report | |
CN113823418A (en) | Epidemiology traceability system and method based on GIS geographic information technology | |
CN115859935A (en) | Data analysis report template generation system and method based on index library | |
CN113807072A (en) | Method and system for quickly identifying difference before and after revision of online approval document | |
CN113902402A (en) | Document auxiliary filling method, system, storage medium and device based on AR technology | |
CN110751994B (en) | Differential display method and system for medical record template | |
WO2019193923A1 (en) | Image processing device, image processing method and recording medium | |
CN115423421A (en) | Method and device for automatically auditing process of inspection report, electronic equipment and medium | |
CN115587098A (en) | Method and system for intelligently identifying chart data | |
CN115097974A (en) | Intelligent auxiliary consultation system and method for BIM (building information modeling) | |
CN111783211B (en) | Automatic generation method and generation system for laminated plan of completion monomer building | |
CN110751995B (en) | Electronic medical record defect labeling and scoring method | |
JPH10111876A (en) | Information retrieval device | |
CN112598503A (en) | OCR recognition system and method based on credit investigation recognition | |
CN113887758B (en) | Visual inspection system and method for nuclear power station equipment | |
US8595706B2 (en) | Control chart technique for analyzing software engineering activities affected by intrinsic process variation and statistical fluctuations | |
CN113326206B (en) | Test method, apparatus, storage medium and program product for data processing system | |
JP2020108598A (en) | Identification system | |
KR102273398B1 (en) | Apparatus for processing data and method thereof | |
CN112541085B (en) | Method for structuring questionnaire, apparatus for structuring questionnaire, and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20211217 |