CN112464907A - Document processing system and method - Google Patents

Document processing system and method Download PDF

Info

Publication number
CN112464907A
CN112464907A CN202011496519.5A CN202011496519A CN112464907A CN 112464907 A CN112464907 A CN 112464907A CN 202011496519 A CN202011496519 A CN 202011496519A CN 112464907 A CN112464907 A CN 112464907A
Authority
CN
China
Prior art keywords
document
module
template
information data
adjustment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011496519.5A
Other languages
Chinese (zh)
Inventor
吴裕宙
翁校新
邓汉荣
李韵诗
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Power Grid Co Ltd
Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd
Original Assignee
Guangdong Power Grid Co Ltd
Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Power Grid Co Ltd, Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd filed Critical Guangdong Power Grid Co Ltd
Priority to CN202011496519.5A priority Critical patent/CN112464907A/en
Publication of CN112464907A publication Critical patent/CN112464907A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/416Extracting the logical structure, e.g. chapters, sections or page numbers; Identifying elements of the document, e.g. authors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/109Font handling; Temporal or kinetic typography
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/186Templates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/56Extraction of image or video features relating to colour
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/146Aligning or centring of the image pick-up or image-field
    • G06V30/1475Inclination or skew detection or correction of characters or of image to be recognised
    • G06V30/1478Inclination or skew detection or correction of characters or of image to be recognised of characters or characters lines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Human Computer Interaction (AREA)
  • Databases & Information Systems (AREA)
  • Processing Or Creating Images (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention discloses a document processing system and a document processing method. The document processing system includes: the device comprises an image acquisition module, an operation module, a processing module, a display module, a template adjusting module and an editing module; the image acquisition module is used for scanning the document to obtain document content; the processing module is used for extracting first document information data of the document content; the display module is used for displaying the first document information data; the editing module is used for obtaining second document information data; the template adjusting module is used for performing style adjustment on the document content according to the classification result to generate a template adjusting document; the operation module is used for filing the template adjustment document. The embodiment of the invention realizes the identification and acquisition of the document and the correction of the document information, thereby improving the document archiving accuracy. Meanwhile, the template style adjustment of the document is realized, and the use convenience of a user is improved.

Description

Document processing system and method
Technical Field
The embodiment of the invention relates to a document processing technology, in particular to a document processing system and a document processing method.
Background
With the development of the Internet, various network data transmission activities are increasing. Electronic documents are various in types, wide in range and large in quantity, the workload of document management is large, and the document processing process is complicated. Therefore, the processing and storage management requirements for electronic documents are also relatively increasing.
The collection and arrangement process of the documents requires a large amount of work, and the classification of the documents is prone to errors in the filing process of the documents.
Disclosure of Invention
The invention provides a document processing system and a document processing method, which realize document identification acquisition and document information correction so as to improve document archiving accuracy. Meanwhile, the template style adjustment of the document is realized, and the use convenience of a user is improved.
In a first aspect, an embodiment of the present invention provides a document processing system, including: the device comprises an image acquisition module, an operation module, a processing module, a display module, a template adjusting module and an editing module;
the image acquisition module is connected with the processing module and used for scanning a document to obtain document content and sending the document content to the processing module;
the processing module is connected with the display module and the editing module, and is used for extracting first document information data of the document content and sending the extracted first document information data to the editing module and the display module; wherein the first document information data comprises a title, a document drawing and a multi-frequency word;
the display module is used for displaying the first document information data;
the editing module is connected with the operation module and used for receiving correction information input by a user, correcting the first document information data according to the correction information to obtain second document information data, and sending the second document information data to the operation module;
the operation module is connected with the template adjusting module and used for classifying the document contents according to the received second document information data and sending the classification result to the template adjusting module;
the template adjusting module is used for performing style adjustment on the document content according to the classification result to generate a template adjusting document and sending the template adjusting document to the operation module; the style adjustment comprises font and layout adjustment;
the operation module is used for filing the template adjustment document.
Optionally, the template adjusting module is further configured to receive a template adjusting instruction input by a user, and perform style adjustment on the document content according to the template adjusting instruction.
Optionally, the document processing system further includes: a storage module; the storage module is connected with the operation module, and the operation module is also used for storing the template adjustment document to the storage module.
Optionally, the operation module further includes a catalog generation unit, where the catalog generation unit is configured to generate a catalog of the template adjustment document, and the catalog includes a primary title, a secondary title, and a tertiary title; the primary title comprises a category of the template adjustment document, the secondary title comprises a title of the template adjustment document, and the tertiary mark comprises keywords of the template adjustment document, wherein the keywords comprise subtitles and multi-frequency words.
Optionally, the operation module further includes a recording unit, and the recording unit is configured to store the directory.
Optionally, the operation module further includes a document calling unit;
the document calling unit is used for calling corresponding contents in the template adjustment document according to the directory contents clicked by the user and sending the corresponding contents to the display module;
the display panel is used for displaying the corresponding content.
Optionally, the document invoking unit is further configured to invoke the corresponding content in the template adjustment document according to a search term input by a user, and send the corresponding content to the display module.
In a second aspect, an embodiment of the present invention provides a document processing method, which is executed by a document processing system, where the document processing system includes an image acquisition module, a processing module, a display module, a template adjustment module, and an editing module; the image acquisition module is connected with the processing module; the processing module is connected with the display module and the editing module; the editing module is connected with the operation module; the operation module is connected with the template adjusting module;
the method comprises the following steps: the image acquisition module scans a document to obtain document content and sends the document content to the processing module;
the processing module extracts first document information data from the document content and sends the extracted first document information data to the editing module and the display module;
the display module displays the first document information data;
the editing module receives correction information input by a user, corrects the first document information data according to the correction information to obtain second document information data, and sends the second document information data to the operation module;
the operation module classifies the document contents according to the received second document information data and sends a classification result to the template adjusting module;
the template adjusting module performs style adjustment on the document content according to the classification result to generate a template adjusting document, and sends the template adjusting document to the operation module;
and the operation module is used for filing the template adjustment document.
Optionally, the template adjusting module receives a template adjusting instruction input by a user, and performs style adjustment on the document content according to the template adjusting instruction.
Optionally, the operation module further includes a catalog generation unit;
the method comprises the following steps: a catalog generation unit generates a catalog of the template adjustment document; the directory comprises a first-level title, a second-level title and a third-level title; the primary title comprises a category of the template adjustment document, the secondary title comprises a title of the template adjustment document, and the tertiary mark comprises keywords of the template adjustment document, wherein the keywords comprise subtitles and multi-frequency words.
The invention provides a document processing system and a document processing method, which realize document identification acquisition and document information correction through an image acquisition module, an operation module, a processing module, a display module, a template adjustment module and an editing module so as to improve document filing accuracy. Meanwhile, the template style adjustment of the document is realized, and the use convenience of a user is improved.
Drawings
FIG. 1 is a schematic structural diagram of a document processing system according to an embodiment of the present invention.
FIG. 2 is a schematic diagram of a document processing system according to an embodiment of the present invention.
FIG. 3 provides a schematic diagram of another document processing system according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Fig. 1 is a schematic structural diagram of a document processing system according to an embodiment of the present invention, and referring to fig. 1, the document processing system includes: the image processing system comprises an image acquisition module 110, an operation module 150, a processing module 120, a display module 130, a template adjustment module 160 and an editing module 140;
the image acquisition module 110 is connected to the processing module 120, and is configured to scan a document to obtain document content, and send the document content to the processing module 120;
the processing module 120 is connected to the display module 130 and the editing module 140, and the processing module 120 is configured to extract first document information data from document content and send the extracted first document information data to the editing module 140 and the display module 130; the first document information data comprises a title, a document drawing and multi-frequency words;
the display module 130 is used for displaying the first document information data;
the editing module 140 is connected to the operation module 150, and is configured to receive correction information input by a user, correct the first document information data according to the correction information to obtain second document information data, and send the second document information data to the operation module 150;
the operation module 150 is connected to the template adjustment module 160, and is configured to classify the document content according to the received second document information data, and send the classification result to the template adjustment module 160;
the template adjusting module 160 is configured to perform style adjustment on the document content according to the classification result to generate a template adjusting document, and send the template adjusting document to the operation module 150; the style adjustment comprises font and format adjustment;
the operation module 150 is used for filing the template adjustment document.
Specifically, the image obtaining module 110 is connected to the processing module 120, and the image obtaining module 110 performs identification scanning on a document in need, where the document may include a paper document and an electronic document. The electronic documents include text documents, image documents, and the like, and can be used in formats such as doc documents, ppt slides, pdf portable documents, jpg pictures, and the like. The document content is obtained after scanning, wherein the document content may include text content and picture content, and the document content is sent to the processing module 120. The processing module 120 is connected with the display module 130 and the editing module 140, and the processing module 120 adopts an image retrieval technology, an image retrieval based on document contents and an image retrieval based on texts; the gray feature extraction, the color feature extraction, the texture feature extraction, the shape feature extraction, the point-based gradient feature extraction and the like of the image are taken as the basis of image retrieval; by preprocessing the language character image, filtering the image, performing binarization processing and correcting characters, the blurred part in the image is removed, and a clear and clean gray-scale image with high contrast ratio is obtained. By performing binarization processing on the image, namely: the character part in the image is '0', the rest part is '1', the extraction of the characters is convenient, and the threshold value T can be obtained by adopting an Otsu method to carry out binarization processing on the image.
Suppose that the original image is s, the threshold value is T, and the binarized image is
s(i,j)=
(1)1,s(i,j)>T;
(2)0,s(i,j)<T;
The obtained picture content is subjected to detection of an inclination angle, and if the inclination occurs, the text image is corrected by a parallelogram method (PCP). The processing module extracts gray level features, color features, texture features, shape features and gradient features based on points of the image; and simultaneously extracting text characteristic information to generate first document information data, wherein the first document information data comprises a title, a document drawing and multi-frequency words. The display module 130 may be a touch display, and may perform human-computer interaction. The display module 130 may display the generated first document information data, and meanwhile, the user may generate second document information data by performing a proofreading and modification on the title, the document drawing, and the multi-frequency word in the first document information data through the editing module 140, and the accuracy of document archiving is improved by performing the proofreading and modification on the first document information data. The calculation module 150 may cluster the information characteristics of the second document information data according to the second document information data, and further analyze the specific cluster set in a centralized manner. Collections of classes, made up of similar objects, are grouped together with the purpose of collecting data for classification on a similar basis. Wherein the partition method in the cluster analysis adopts a k-means cluster analysis algorithm to accept the input quantity as k; the n data objects are then divided into k clusters so that the obtained clusters satisfy: the similarity of objects in the same cluster is higher; while the object similarity in different clusters is smaller. The operation module 150 is connected to the template adjustment module 160, the operation module 150 sends the classification result to the template adjustment module 160, and the module adjustment module 160 may further include a template library, and may perform style adjustment on document content according to the classification result matching templates in the template library, where the style adjustment includes content such as font size, color, picture size, format, and the like. The template adjustment document is generated after the document content style is adjusted, and the template adjustment document is filed by the operation module 150. The user can perform query retrieval through the second document information data or the document contents according to the archived contents.
The embodiment realizes document identification acquisition and document information correction through the image acquisition module, the operation module, the processing module, the display module, the template adjustment module and the editing module, thereby improving the document filing accuracy. Meanwhile, template style adjustment of the document is realized, and the use convenience of a user and the standardization of the document filing are improved. Optionally, the template adjusting module 160 is further configured to receive a template adjusting instruction input by a user, and perform style adjustment on the document content according to the template adjusting instruction.
Specifically, the user may connect to an external device through a human-machine interface, i.e., an input/output interface, in the system or may edit and modify the document through the display module 130 by using the template adjustment module. Such as font type, picture type, editing and modifying. And editing the document style according to the user-defined requirement.
Fig. 2 is a schematic structural diagram of another document processing system according to an embodiment of the present invention, and referring to fig. 2, based on the above embodiment, optionally, the document processing system further includes:
the storage module 210 is connected to the operation module 150, and the operation module 150 is further configured to store the template adjustment document in the storage module 210.
Specifically, the template adjustment document generated by the template adjustment module 160 is stored in the storage module 210 for the convenience of the user to backup, or copy and copy by using an external storage device.
FIG. 3 is a schematic diagram of another document processing system according to an embodiment of the present invention, referring to FIG. 3
Based on the above embodiment, optionally, the operation module 150 further includes a catalog generating unit 310, where the catalog generating unit 310 is configured to generate a catalog of the template adjustment document, and the catalog includes a first-level title, a second-level title, and a third-level title; the first-level title comprises the category of the template adjustment document, the second-level title comprises the title of the template adjustment document, and the third-level mark comprises keywords of the template adjustment document, wherein the keywords comprise subtitles and multi-frequency words.
Specifically, the catalog generation unit 310 generates a primary title according to the filing category of the template adjustment document, the title of the template adjustment document is a secondary title, and the keywords of the template adjustment document include subtitles and multi-frequency words as tertiary titles and document contents. For example, a fiber laser document, the catalog generates three levels of catalogs: the photoelectric technology-multimode laser based on double-clad optical fiber-double-clad optical fiber and laser device is convenient for users to inquire document contents according to catalogues and keywords.
Optionally, the operation module 150 further includes a recording unit 320, and the recording unit 320 is configured to store the directory.
Specifically, the recording unit 320 stores the generated catalog, and the user can correct the catalog to modify the catalog, thereby implementing update and maintenance of the document catalog.
Optionally, the operation module 150 further includes a document calling unit 330;
the document calling unit 330 is configured to call a template to adjust corresponding content in the document according to the directory content clicked by the user, and send the corresponding content to the display module 130;
the display module 130 is used for displaying the corresponding content.
Specifically, the user can quickly find the required text through the generated catalog, and the searching convenience is improved.
Optionally, the document invoking unit is further configured to invoke a template according to a search term input by a user to adjust corresponding content in the document, and send the corresponding content to the display module.
Specifically, the user can input the required search term, and the document directory generated by the embodiment includes the contents of categories, titles, keywords and the like, so that the document calling unit can quickly locate the relevant document according to the search term and the directory, and the search speed is improved.
The embodiment of the invention provides a document processing method, which is executed by a document processing system, wherein the document processing system comprises an image acquisition module, an operation module, a processing module, a display module, a template adjustment module and an editing module; the image acquisition module is connected with the processing module; the processing module is connected with the display module and the editing module; the editing module is connected with the operation module; the operation module is connected with the template adjusting module;
the method comprises the following steps: the image acquisition module scans the document to obtain document content and sends the document content to the processing module;
the processing module extracts first document information data from the document content and sends the extracted first document information data to the editing module and the display module;
the display module displays the first document information data;
the editing module receives correction information input by a user, corrects the first document information data according to the correction information to obtain second document information data, and sends the second document information data to the operation module;
the operation module classifies the document contents according to the received second document information data and sends a classification result to the template adjusting module;
the template adjusting module performs style adjustment on the document content according to the classification result to generate a template adjusting document, and sends the template adjusting document to the operation module;
and the operation module is used for filing the template adjustment document.
Optionally, the method further includes receiving, by the template adjustment module, a template adjustment instruction input by a user, and performing style adjustment on the document content according to the template adjustment instruction.
Optionally, the operation module further includes a catalog generation unit;
the method further comprises the following steps: the catalog generation unit generates a catalog of the template adjustment document; the directory comprises a first-level title, a second-level title and a third-level title; the first-level title comprises the category of the template adjustment document, the second-level title comprises the title of the template adjustment document, and the third-level mark comprises keywords of the template adjustment document, wherein the keywords comprise subtitles and multi-frequency words.
Optionally, the operation module further includes a document calling unit;
the method further comprises the following steps: and the document calling unit calls the template to adjust the corresponding content in the document according to the directory content clicked by the user.
The document processing method provided by the embodiment of the invention and the document processing system provided by any embodiment of the invention belong to the same inventive concept, have corresponding beneficial effects, and the detailed technical details which are not detailed in the embodiment of the invention are shown in the document processing system provided by any embodiment of the invention.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A document processing system, comprising: the device comprises an image acquisition module, an operation module, a processing module, a display module, a template adjusting module and an editing module;
the image acquisition module is connected with the processing module and used for scanning a document to obtain document content and sending the document content to the processing module;
the processing module is connected with the display module and the editing module, and is used for extracting first document information data of the document content and sending the extracted first document information data to the editing module and the display module; wherein the first document information data comprises a title, a document drawing and a multi-frequency word;
the display module is used for displaying the first document information data;
the editing module is connected with the operation module and used for receiving correction information input by a user, correcting the first document information data according to the correction information to obtain second document information data, and sending the second document information data to the operation module;
the operation module is connected with the template adjusting module and used for classifying the document contents according to the received second document information data and sending the classification result to the template adjusting module;
the template adjusting module is used for performing style adjustment on the document content according to the classification result to generate a template adjusting document and sending the template adjusting document to the operation module; the style adjustment comprises font and layout adjustment;
the operation module is used for filing the template adjustment document.
2. The document processing system of claim 1,
the template adjusting module is also used for receiving a template adjusting instruction input by a user and adjusting the style of the document content according to the template adjusting instruction.
3. The document processing system of claim 1, further comprising: a storage module; the storage module is connected with the operation module, and the operation module is also used for storing the template adjustment document to the storage module.
4. The document processing system of claim 1, wherein:
the operation module further comprises a catalog generation unit, wherein the catalog generation unit is used for generating a catalog of the template adjustment document, and the catalog comprises a first-level title, a second-level title and a third-level title; the primary title comprises a category of the template adjustment document, the secondary title comprises a title of the template adjustment document, and the tertiary mark comprises keywords of the template adjustment document, wherein the keywords comprise subtitles and multi-frequency words.
5. The document processing system of claim 4, wherein:
the operation module further comprises a recording unit, and the recording unit is used for storing the directory.
6. The document processing system of claim 4, wherein:
the operation module also comprises a document calling unit;
the document calling unit is used for calling corresponding contents in the template adjustment document according to the directory contents clicked by the user and sending the corresponding contents to the display module;
the display panel is used for displaying the corresponding content.
7. The document processing system of claim 6, wherein:
the document calling unit is also used for calling the corresponding content in the template adjusting document according to the search words input by the user and sending the corresponding content to the display module.
8. A document processing method executed by a document processing system, characterized by: the document processing system comprises an image acquisition module, an operation module, a processing module, a display module, a template adjusting module and an editing module; the image acquisition module is connected with the processing module; the processing module is connected with the display module and the editing module; the editing module is connected with the operation module; the operation module is connected with the template adjusting module;
the method comprises the following steps:
the image acquisition module scans a document to obtain document content and sends the document content to the processing module;
the processing module extracts first document information data from the document content and sends the extracted first document information data to the editing module and the display module;
the display module displays the first document information data;
the editing module receives correction information input by a user, corrects the first document information data according to the correction information to obtain second document information data, and sends the second document information data to the operation module;
the operation module classifies the document contents according to the received second document information data and sends a classification result to the template adjusting module;
the template adjusting module performs style adjustment on the document content according to the classification result to generate a template adjusting document, and sends the template adjusting document to the operation module;
and the operation module is used for filing the template adjustment document.
9. The document processing method according to claim 8, further comprising:
and the template adjusting module receives a template adjusting instruction input by a user and performs style adjustment on the document content according to the template adjusting instruction.
10. The document processing method according to claim 8, wherein: the operation module also comprises a catalog generation unit;
the method further comprises the following steps: a catalog generation unit generates a catalog of the template adjustment document; the directory comprises a first-level title, a second-level title and a third-level title; the primary title comprises a category of the template adjustment document, the secondary title comprises a title of the template adjustment document, and the tertiary mark comprises keywords of the template adjustment document, wherein the keywords comprise subtitles and multi-frequency words.
CN202011496519.5A 2020-12-17 2020-12-17 Document processing system and method Pending CN112464907A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011496519.5A CN112464907A (en) 2020-12-17 2020-12-17 Document processing system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011496519.5A CN112464907A (en) 2020-12-17 2020-12-17 Document processing system and method

Publications (1)

Publication Number Publication Date
CN112464907A true CN112464907A (en) 2021-03-09

Family

ID=74802910

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011496519.5A Pending CN112464907A (en) 2020-12-17 2020-12-17 Document processing system and method

Country Status (1)

Country Link
CN (1) CN112464907A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113792659A (en) * 2021-09-15 2021-12-14 上海金仕达软件科技有限公司 Document identification method and device and electronic equipment
CN117421487A (en) * 2023-12-19 2024-01-19 西安康奈网络科技有限公司 Multiple network information screening management system based on artificial intelligence

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101046808A (en) * 2006-03-31 2007-10-03 株式会社理光 File process system and method
CN110135264A (en) * 2019-04-16 2019-08-16 深圳壹账通智能科技有限公司 Data entry method, device, computer equipment and storage medium
CN110889309A (en) * 2018-09-07 2020-03-17 上海怀若智能科技有限公司 Financial document classification management system and method
CN110956016A (en) * 2018-09-25 2020-04-03 珠海金山办公软件有限公司 Document content format adjusting method and device and electronic equipment
CN111079511A (en) * 2019-10-25 2020-04-28 湖北富瑞尔科技有限公司 Document automatic classification and optical character recognition method and system based on deep learning
CN111126952A (en) * 2019-12-16 2020-05-08 深圳供电局有限公司 Electronic file filing processing system and method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101046808A (en) * 2006-03-31 2007-10-03 株式会社理光 File process system and method
CN110889309A (en) * 2018-09-07 2020-03-17 上海怀若智能科技有限公司 Financial document classification management system and method
CN110956016A (en) * 2018-09-25 2020-04-03 珠海金山办公软件有限公司 Document content format adjusting method and device and electronic equipment
CN110135264A (en) * 2019-04-16 2019-08-16 深圳壹账通智能科技有限公司 Data entry method, device, computer equipment and storage medium
CN111079511A (en) * 2019-10-25 2020-04-28 湖北富瑞尔科技有限公司 Document automatic classification and optical character recognition method and system based on deep learning
CN111126952A (en) * 2019-12-16 2020-05-08 深圳供电局有限公司 Electronic file filing processing system and method

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113792659A (en) * 2021-09-15 2021-12-14 上海金仕达软件科技有限公司 Document identification method and device and electronic equipment
CN113792659B (en) * 2021-09-15 2024-04-05 上海金仕达软件科技股份有限公司 Document identification method and device and electronic equipment
CN117421487A (en) * 2023-12-19 2024-01-19 西安康奈网络科技有限公司 Multiple network information screening management system based on artificial intelligence
CN117421487B (en) * 2023-12-19 2024-03-08 西安康奈网络科技有限公司 Multiple network information screening management system based on artificial intelligence

Similar Documents

Publication Publication Date Title
EP0539106B1 (en) Electronic information delivery system
US6243501B1 (en) Adaptive recognition of documents using layout attributes
JP3289968B2 (en) Apparatus and method for electronic document processing
US8532384B2 (en) Method of retrieving information from a digital image
JP5095534B2 (en) System and method for generating a junction
US6178417B1 (en) Method and means of matching documents based on text genre
US6621941B1 (en) System of indexing a two dimensional pattern in a document drawing
US6321232B1 (en) Method for creating a geometric hash tree in a document processing system
CN108197119A (en) The archives of paper quality digitizing solution of knowledge based collection of illustrative plates
CN114021543B (en) Document comparison analysis method and system based on table structure analysis
CN113780229A (en) Text recognition method and device
CN112464907A (en) Document processing system and method
CN115828874A (en) Industry table digital processing method based on image recognition technology
CN109271616B (en) Intelligent extraction method based on bibliographic characteristic value of standard literature
CN111860524A (en) Intelligent classification device and method for digital files
CN115830620B (en) Archive text data processing method and system based on OCR
CN115774805B (en) File intelligent query method and system based on digital processing
CN1336604A (en) Method and system of digitizing ancient Chinese books and automatizing the content search
CN112036330A (en) Text recognition method, text recognition device and readable storage medium
CN117095419A (en) PDF document data processing and information extracting device and method
CN117076455A (en) Intelligent identification-based policy structured storage method, medium and system
CN101872344A (en) Control method for image scanning
CN116343210A (en) File digitization management method and device
CN107491814B (en) Construction method of process case layered knowledge model for knowledge push
CN113806368A (en) System and method for identifying document and automatically establishing database

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210309