CN115731559A - Electronic file generation management method and device and computer equipment - Google Patents

Electronic file generation management method and device and computer equipment Download PDF

Info

Publication number
CN115731559A
CN115731559A CN202211090553.1A CN202211090553A CN115731559A CN 115731559 A CN115731559 A CN 115731559A CN 202211090553 A CN202211090553 A CN 202211090553A CN 115731559 A CN115731559 A CN 115731559A
Authority
CN
China
Prior art keywords
file
electronic
double
cataloguing
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211090553.1A
Other languages
Chinese (zh)
Inventor
王荣刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Hanwang Yingyan Technology Co ltd
Original Assignee
Beijing Hanwang Yingyan Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Hanwang Yingyan Technology Co ltd filed Critical Beijing Hanwang Yingyan Technology Co ltd
Priority to CN202211090553.1A priority Critical patent/CN115731559A/en
Publication of CN115731559A publication Critical patent/CN115731559A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application discloses a method, a device and computer equipment for generating and managing electronic files, wherein the method comprises the following steps: receiving a paper file, and sending the paper file to a processing workshop; carrying out image processing on the electronic picture, and carrying out OCR character recognition on the processed electronic picture to obtain a JSON file and a double-layer PDF file of corresponding text contents; cataloguing the double-layer PDF file to obtain a target category; performing cataloging on the double-layer PDF file through a natural language processing technology, and hanging the double-layer PDF file into an electronic file management system; performing format reduction on the JSON file of the text content, and combining cataloguing information to obtain a text volume; and extracting elements and/or analyzing cases of the text file to obtain a data file. The application provides a set of complete workflow that electronic file generated, has realized the paperless of file high-efficiently to save the paper material through intelligent cabinet, make the paper material no longer circulate, ensure the safety and the completeness of paper material in the at utmost.

Description

Electronic file generation management method and device and computer equipment
Technical Field
The present application relates to the field of file technologies, and in particular, to a method, an apparatus, and a computer device for generating and managing electronic files.
Background
The synchronous work of litigation files of the national court along with cases leads to further increase of the internal organization pressure of the court. The coexistence of the paper files and the electronic files causes the disordered use of the carrier in the transition stage, and the utilization ratio of the paper files is still very high due to the historical working habits of judges, so that the integral utilization efficiency of the electronic files cannot reach the expectation all the time, and the paper files lack unified management in turn to cause the events of lost files and missing files. The core of the synchronous generation of the electronic file with the case is the electronic file, but the electronic file still needs to be managed due to the characteristics of the data carrier of the paper file.
The electronic examination paper file cannot be formed in real time due to untimely flow and unclear flow nodes in the traditional mode, so that a judge cannot read the electronic examination paper at the electronic end in the judging or application process, and the judge further conflicts the electronic examination paper and continuously uses the paper examination paper, so that the paper examination paper cannot be electronized in time.
The traditional electronic file generation mainly adopts camera shooting or scanner application software to generate corresponding pictures of paper files, then the generated pictures are subjected to image processing through an image processing tool, the processed pictures are identified through identification software, then different folders are created for the pictures of different directories for classification after identification, and finally a corresponding system is opened to transmit the classified pictures to the corresponding directories.
Litigation files of a court are in a multi-book multi-page text form, the traditional file synchronous generation mostly adopts a picture default ordering mode, and the mode can cause the situation that the logic of litigation file synchronous generation materials is disordered, so that the marking difficulty on a judge line is increased.
The comprehensive contents can find that the synchronous generation work of the litigation files along with the case is exposed to a lot of troublesome problems after the development of years, so that an all-around and integrated comprehensive solution needs to be provided for the synchronous work of the litigation files along with the case of the court, and powerful support is provided for the electronic files and the deep application of the court.
Disclosure of Invention
Based on the above technical problem, an electronic file generation management method, an electronic file generation management device and a computer device are provided, which have solved the problems of low working efficiency of litigation file synchronization with case and lack of unified management of paper files in turn in the prior art.
An electronic volume generation management method includes:
s1, receiving a paper file, and sending the paper file to a processing workshop, wherein the processing workshop is used for scanning the paper file to generate a corresponding electronic picture and pushing the paper file to an intelligent storage cabinet;
s2, carrying out image processing on the electronic picture, and carrying out OCR character recognition on the processed electronic picture to obtain a corresponding text content JSON file and a double-layer PDF file;
s3, cataloguing the double-layer PDF file, and verifying a cataloguing result to obtain a target category;
s4, cataloguing the double-layer PDF file through a natural language processing technology, and hanging the double-layer PDF file into an electronic file management system;
s5, performing format reduction on the text content JSON file, performing content verification, and then combining cataloguing information to obtain a text file;
and S6, performing element extraction and/or case analysis on the text file to obtain a data file.
Optionally, the method further comprises:
receiving an electronic material and judging the type of the electronic material;
when the electronic material is an electronic picture, executing the steps S2-S6;
when the electronic material is a plain text, generating a corresponding double-layer PDF file according to the plain text, and executing the steps S3-S4; obtaining a text file according to the plain text, and executing the step S6;
when the electronic material is an audio/video, processing the electronic material, converting the electronic material into a plain text, generating a corresponding double-layer PDF file according to the plain text, and executing steps S3-S4; and obtaining a text file according to the plain text, and executing the step S6.
Optionally, the sources of the paper portfolio include window receipts, judge moves, party mailings, and attorney mailings.
Further optionally, the sources of electronic material include window receipts, corporate moves, party mailings, attorney mailings, online filing systems, and judgment systems.
Further optionally, the cataloging the dual-layer PDF file by the natural language processing technology specifically includes:
analyzing the double-layer PDF file, and performing semantic analysis on text content by using a theme analysis model;
comparing the analyzed semantics with a semantic library, and judging whether the analyzed semantics exist in the semantic library;
if the semantic library has the analyzed semantics, directly carrying out similarity detection, and if the semantic library does not have the analyzed semantics, updating the semantic library and then carrying out similarity detection;
integrating files of the same type in the double-layer PDF file according to the semantic and similarity detection result obtained by analysis;
performing feature recognition on the double-layer PDF file;
and determining the category of the double-layer PDF file according to the target category in the feature identification result and the cataloguing result, and finishing the cataloguing of the double-layer PDF file.
Further optionally, the topic analysis model is a latent dirichlet distribution model.
Optionally, the paper file is scanned by a browser-side scanner and/or a camera to generate a corresponding electronic picture.
In a second aspect, an electronic portfolio generation management apparatus includes:
the processing workshop is used for scanning the paper files to generate corresponding electronic pictures and pushing the paper files to the intelligent storage cabinet;
the image processing module is used for carrying out image processing on the electronic picture and carrying out OCR character recognition on the processed electronic picture to obtain a corresponding text content JSON file and a double-layer PDF file;
the cataloguing module is used for cataloguing the double-layer PDF file and verifying the cataloguing result to obtain a target category;
the cataloguing module is used for cataloguing the double-layer PDF file through a natural language processing technology and hanging the double-layer PDF file into an electronic file management system;
the format reduction module is used for carrying out format reduction on the text content JSON file, carrying out content verification and then combining cataloguing information to obtain a text file;
and the data file generation module is used for performing element extraction and/or case analysis on the text file to obtain the data file.
In a third aspect, a computer device comprises a memory storing a computer program and a processor implementing the steps of the method of the first aspect when the processor executes the computer program.
In a fourth aspect, a computer-readable storage medium has stored thereon a computer program which, when being executed by a processor, carries out the steps of the method of the first aspect.
The invention has at least the following beneficial effects:
the embodiment of the application provides a set of complete working flow for generating the electronic files, the online electronic generation work is realized through scanning, the electronic pictures are processed through the image processing capacity, the electronic pictures are identified and classified through the integrated identification and cataloguing capacity, and the different types of pictures are automatically matched into the corresponding electronic file catalogues through the natural semantic processing technology, so that the paperless electronic files are efficiently realized; simultaneously this application is saved the paper material through intelligent cabinet, makes the paper material no longer circulate, ensures paper material's safety and completeness to the at utmost.
Meanwhile, the electronic material can be generated synchronously and converted, and a solid foundation is laid for deep application of the electronic file.
Drawings
FIG. 1 is a schematic flow chart illustrating a method for managing generation of electronic files according to an embodiment of the present application;
FIG. 2 is a detailed flowchart illustrating a method for managing generation of electronic files according to an embodiment of the present application;
FIG. 3 is a diagram illustrating an application of an electronic file according to an embodiment of the present application;
FIG. 4 is a schematic flow chart illustrating a specific objective in one embodiment of the present application;
FIG. 5 is a schematic flow chart of a material receipt management method according to an embodiment of the present application;
FIG. 6 is a block diagram illustrating a module architecture of an electronic file generation management apparatus according to an embodiment of the present application;
fig. 7 is an internal structural diagram of a computer device according to an embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more clearly understood, the present application is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of and not restrictive on the broad application.
In one embodiment, as shown in fig. 1, there is provided an electronic portfolio generation management method, including the steps of:
s1, receiving the paper file, and sending the paper file to a processing workshop, wherein the processing workshop is used for scanning the paper file to generate a corresponding electronic picture and pushing the paper file to an intelligent storage cabinet.
Sources of paper documents include window receipts, judge moves, party mailings, and lawyer mailings, among others. After the paper file is stored in the intelligent storage cabinet, a judge can borrow and return the paper file in the intelligent storage cabinet after initiating a borrowing application and passing through identity authentication.
Meanwhile, a paper file filing application can be initiated to the intelligent storage cabinet when the case is reported, and then the staff can take out, bind and file the paper files in the intelligent storage cabinet.
In addition, the paper file is scanned by a browser-side scanner and/or a camera to generate a corresponding electronic picture.
And S2, carrying out image processing on the electronic picture, and carrying out OCR character recognition on the processed electronic picture to obtain a corresponding text content JSON file and a double-layer PDF file.
And S3, cataloguing the double-layer PDF file, and verifying a cataloguing result to obtain a target category.
S4, performing cataloguing on the double-layer PDF file through a natural language processing technology (NLP), and hanging the double-layer PDF file into an electronic file management system; and the cataloging is to match files of different types in the double-layer PDF file into the corresponding electronic file catalogue.
And S5, performing format reduction on the JSON file of the text content, verifying the content, and then combining the cataloguing information to obtain the text file.
And S6, performing element extraction and/or case analysis on the text file to obtain a data file.
Further, the method further comprises:
(1) Receiving the electronic material and judging the type of the electronic material; sources of electronic material include window receipts, judge moves, party mailings, attorney mailings, online filing systems, and judgment systems, among others.
(2) When the electronic material is an electronic picture, executing the steps S2-S6;
(3) When the electronic material is a plain text, generating a corresponding double-layer PDF file according to the plain text, and executing the steps S3-S4; obtaining a text file according to the plain text, and executing the step S6;
(4) When the electronic material is an audio/video, processing the electronic material, converting the electronic material into a plain text, generating a corresponding double-layer PDF file according to the plain text, and executing the steps S3-S4; and obtaining the text file according to the plain text, and executing the step S6.
A more detailed flow diagram of the method is shown in FIG. 2, and a diagram of an application of a corresponding electronic portfolio is shown in FIG. 3.
Further, as shown in fig. 4, the process of cataloguing the dual-layer PDF file by using a natural language processing technology specifically includes:
(1) Analyzing the double-layer PDF file, and performing semantic analysis on text content by using a theme analysis model;
wherein, the topic analysis model can be but is not limited to a hidden Dirichlet distribution model; a Topic analysis Model (Topic Model) is a technology for counting and clustering implicit semantic structures of documents in an unsupervised learning manner so as to mine the semantic structures included in texts; latent Dirichlet Allocation (LDA) is a commonly used topic model calculation method;
(2) Comparing the analyzed semantics with a semantic library, and judging whether the analyzed semantics exist in the semantic library;
(3) If the semantic library has the analyzed semantics, directly carrying out similarity detection, and if the semantic library does not have the analyzed semantics, updating the semantic library and then carrying out similarity detection;
the text similarity is mainly related to the commonalities and the differences among the texts, and the greater the commonalities are, the smaller the differences are, the higher the similarity is; the smaller the commonality is, the larger the difference is, the lower the similarity is; the case of the maximum similarity is that the texts are completely the same;
methods for similarity calculation fall into four broad categories: string-based methods, corpus-based methods, knowledge-based methods, and other methods; the method based on the character strings is based on the matching degree of the character strings and takes the co-occurrence and the repetition degree of the character strings as the measurement standard of the similarity; the method based on the corpus is to calculate the similarity of texts by using information acquired from the corpus; the knowledge-based method is to calculate the similarity of texts by using a knowledge base with a standard organization system;
(4) Integrating files of the same type in the double-layer PDF files according to the semantic and similarity detection results obtained by analysis;
(5) Carrying out feature identification on the double-layer PDF file;
(6) Determining the category of the double-layer PDF file according to the target category in the feature recognition result and the cataloguing result, and completing the cataloguing of the double-layer PDF file;
text classification, namely, according to a document classification system of a specific industry, automatically reading the content of a document by a computer and belonging the content to a technical system of a corresponding category; the typical treatment process can be divided into training and running; the computer reads documents of various categories in advance and extracts features, supervised learning training is completed, and the content of a new document is identified and classified in an operation stage.
In addition, the application also provides a material receiving management method, the flow of the method is shown in fig. 5, the paper materials are prevented from circulating by establishing a mechanism of uniformly receiving, transferring and centrally keeping the paper materials, the storage equipment is managed on line by using the internet of things technology, and the safety and the integrity of the paper materials are guaranteed to the greatest extent.
The embodiment of the application provides a set of complete workflow for generating the electronic file, and the on-line electronic generation work is realized through embedding of a browser-side scanner and a camera; processing the electronic picture through image processing capacity; identifying and classifying the electronic pictures through the integrated identification cataloguing capability; the natural semantic processing technology is used for automatically matching the pictures with different types of purposes into the corresponding electronic file catalogue, so that paperless files are efficiently realized, and the generation time of the electronic files is reduced by 50%; simultaneously this application is saved the paper material through intelligent cabinet, makes the paper material no longer circulate, ensures paper material's safety and completeness to the at utmost.
The method and the device have the advantages that all types of materials are manufactured and processed in a unified mode, synchronous generation and circulation of electronic materials are guaranteed, positioning and retrieval are convenient and rapid, material contents are utilized visually, and a solid foundation is laid for deep application of electronic files.
It should be understood that although the various steps in the flow charts of fig. 1-5 are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least some of the steps in fig. 1-5 may include multiple steps or multiple stages, which are not necessarily performed at the same time, but may be performed at different times, which are not necessarily performed in sequence, but may be performed in turn or alternately with other steps or at least some of the other steps.
In one embodiment, as shown in FIG. 6, there is provided an electronic portfolio generation management apparatus comprising the following modules:
the paper file receiving module 601 is used for receiving paper files and sending the paper files to a processing workshop, and the processing workshop is used for scanning the paper files to generate corresponding electronic pictures and pushing the paper files to the intelligent storage cabinet;
the image processing module 602 is configured to perform image processing on the electronic picture, and perform OCR character recognition on the processed electronic picture to obtain a corresponding text content JSON file and a double-layer PDF file;
the cataloguing module 603 is used for cataloguing the double-layer PDF file and verifying the cataloguing result to obtain a target category;
the cataloging module 604 is used for cataloging the double-layer PDF file through a natural language processing technology and hanging the double-layer PDF file into the electronic file management system;
the format reduction module 605 is configured to perform format reduction on the text content JSON file, perform content verification, and then obtain a text volume by combining with the cataloguing information;
the data file generation module 606 is configured to perform element extraction and/or case analysis on the text file to obtain the data file.
For specific limitations of an electronic file generation management apparatus, refer to the above limitations of an electronic file generation management method, which are not described herein again. The modules in the electronic file generation management device can be wholly or partially realized by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
In one embodiment, a computer device is provided, which may be a terminal, and its internal structure diagram may be as shown in fig. 7. The computer device includes a processor, a memory, a communication interface, a display screen, and an input device connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The communication interface of the computer device is used for carrying out wired or wireless communication with an external terminal, and the wireless communication can be realized through WIFI, an operator network, NFC (near field communication) or other technologies. The computer program is executed by a processor to implement an electronic portfolio generation management method. The display screen of the computer equipment can be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment can be a touch layer covered on the display screen, a key, a track ball or a touch pad arranged on a shell of the computer equipment, an external keyboard, a touch pad or a mouse and the like.
Those skilled in the art will appreciate that the architecture shown in fig. 7 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
In one embodiment, a computer device is provided, which includes a memory and a processor, wherein the memory stores a computer program, and all or part of the procedures in the method of the above embodiment are involved.
In one embodiment, a computer-readable storage medium having a computer program stored thereon is provided, which relates to all or part of the processes of the above-described embodiment methods.
In one embodiment, there is also provided a task allocation system, including:
the planning work acquisition module is used for acquiring planning works of all types from each service system through a data interface;
the standard quantization module is used for setting a unified standard for all different planned work services and determining key elements of single work;
the intelligent dispatching module is used for intelligently dispatching all the planned works after the system acquires the current value dispatching personnel information;
and the monitoring alarm module is used for monitoring the working quantity and the current working quantity which are responsible for each dispatcher in real time, and sending an alarm to remind of dispatching the work after the work exceeds a certain quantity or the uneven degree reaches a certain degree.
The work of development and plan completion is output to the standard quantification module, and the plan work in execution ending in the improper value is transmitted to the administrator for distribution according to the workload. And after determining key elements of single work, the standard quantization module analyzes the weight occupied by each element, quantizes the workload of the single work and counts the equivalent weight of each work, wherein the key elements comprise work content data, operation quantity data, operation time data, work unit data and scheduling work peak period condition data. In the process of intelligently allocating all the planning works by the intelligent allocation module, the planning works of the same procedure are allocated to the same staff; there are no multiple jobs started by the same employee at the same time; the workload difference between different employees does not exceed 10%.
The task allocation system can automatically and uniformly allocate the workload of the staff, can be used by being matched with the electronic file generation management method, integrates the independent work of each previous link in the management system, and realizes all online delivery management. Through the change of the workflow, the working efficiency of a front-line employee generated by the electronic file is improved by 80%, and the error rate is reduced by 30%. The electronic file generation time is reduced by 50%.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database or other medium used in the embodiments provided herein can include at least one of non-volatile and volatile memory. Non-volatile Memory may include Read-Only Memory (ROM), magnetic tape, floppy disk, flash Memory, optical storage, or the like. Volatile Memory can include Random Access Memory (RAM) or external cache Memory. By way of illustration and not limitation, RAM can take many forms, such as Static Random Access Memory (SRAM) or Dynamic Random Access Memory (DRAM), among others.
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (10)

1. An electronic portfolio generation management method, comprising:
the method comprises the following steps that S1, paper files are received and sent to a processing workshop, and the processing workshop is used for scanning the paper files to generate corresponding electronic pictures and pushing the paper files to an intelligent storage cabinet;
s2, carrying out image processing on the electronic picture, and carrying out OCR character recognition on the processed electronic picture to obtain a corresponding text content JSON file and a double-layer PDF file;
s3, cataloguing the double-layer PDF file, and verifying a cataloguing result to obtain a target category;
s4, cataloguing the double-layer PDF file through a natural language processing technology, and hanging the double-layer PDF file into an electronic file management system;
s5, performing format reduction on the text content JSON file, performing content verification, and then combining cataloguing information to obtain a text file;
and S6, performing element extraction and/or case analysis on the text file to obtain a data file.
2. The electronic portfolio generation management method of claim 1, further comprising:
receiving an electronic material and judging the type of the electronic material;
when the electronic material is an electronic picture, executing the steps S2-S6;
when the electronic material is a plain text, generating a corresponding double-layer PDF file according to the plain text, and executing the steps S3-S4; obtaining a text file according to the plain text, and executing the step S6;
when the electronic material is an audio/video, processing the electronic material, converting the electronic material into a plain text, generating a corresponding double-layer PDF file according to the plain text, and executing the steps S3-S4; and obtaining a text file according to the plain text, and executing the step S6.
3. The electronic portfolio generation management method of claim 1, wherein the sources of the paper portfolio include window receipts, judge moves, party mailings, and attorney mailings.
4. The electronic portfolio generation management method of claim 2, wherein the sources of electronic material include window receipts, corporate moves, party mailings, attorney mailings, online filing systems, and trial systems.
5. The electronic portfolio generation and management method of claim 2, wherein the cataloging the dual-layer PDF file via natural language processing techniques specifically comprises:
analyzing the double-layer PDF file, and performing semantic analysis on text content by using a theme analysis model;
comparing the analyzed semantics with a semantic library, and judging whether the analyzed semantics exist in the semantic library;
if the semantic library has the analyzed semantics, directly carrying out similarity detection, and if the semantic library does not have the analyzed semantics, updating the semantic library and then carrying out similarity detection;
integrating files of the same type in the double-layer PDF file according to the semantic and similarity detection result obtained by analysis;
performing feature recognition on the double-layer PDF file;
and determining the category of the double-layer PDF file according to the target category in the feature identification result and the cataloguing result, and finishing the cataloguing of the double-layer PDF file.
6. The electronic portfolio generation management method of claim 5, wherein the topic analysis model is a latent Dirichlet distribution model.
7. The method as claimed in claim 1, wherein the paper file is scanned by a browser-side scanner and/or a camera to generate a corresponding electronic picture.
8. An electronic volume generation management apparatus, comprising:
the processing workshop is used for scanning the paper files to generate corresponding electronic pictures and pushing the paper files to the intelligent storage cabinet;
the image processing module is used for carrying out image processing on the electronic picture and carrying out OCR character recognition on the processed electronic picture to obtain a corresponding text content JSON file and a double-layer PDF file;
the cataloguing module is used for cataloguing the double-layer PDF file and verifying the cataloguing result to obtain a target category;
the cataloguing module is used for cataloguing the double-layer PDF file through a natural language processing technology and hanging the double-layer PDF file into an electronic file management system;
the format reduction module is used for carrying out format reduction on the text content JSON file, carrying out content verification and then combining cataloguing information to obtain a text file;
and the data file generation module is used for performing element extraction and/or case analysis on the text file to obtain the data file.
9. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor, when executing the computer program, implements the steps of the method of any of claims 1 to 7.
10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 7.
CN202211090553.1A 2022-09-07 2022-09-07 Electronic file generation management method and device and computer equipment Pending CN115731559A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211090553.1A CN115731559A (en) 2022-09-07 2022-09-07 Electronic file generation management method and device and computer equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211090553.1A CN115731559A (en) 2022-09-07 2022-09-07 Electronic file generation management method and device and computer equipment

Publications (1)

Publication Number Publication Date
CN115731559A true CN115731559A (en) 2023-03-03

Family

ID=85293184

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211090553.1A Pending CN115731559A (en) 2022-09-07 2022-09-07 Electronic file generation management method and device and computer equipment

Country Status (1)

Country Link
CN (1) CN115731559A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116363667A (en) * 2023-04-26 2023-06-30 公安部信息通信中心 Aggregation file theme identification and classification system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116363667A (en) * 2023-04-26 2023-06-30 公安部信息通信中心 Aggregation file theme identification and classification system
CN116363667B (en) * 2023-04-26 2023-10-13 公安部信息通信中心 Aggregation file theme identification and classification system

Similar Documents

Publication Publication Date Title
CN109947789B (en) Method, device, computer equipment and storage medium for processing data of multiple databases
CN108089843B (en) Intelligent bank enterprise-level demand management system
WO2020134991A1 (en) Automatic input method for paper form, apparatus , and computer device and storage medium
KR20200039035A (en) Learning dataset providing system based on artificial intelligence
CN110866110A (en) Conference summary generation method, device, equipment and medium based on artificial intelligence
CN112732949B (en) Service data labeling method and device, computer equipment and storage medium
CN111738701A (en) Performance analysis method and device, computer equipment and storage medium
CN107291949A (en) Information search method and device
CN115526605A (en) Enterprise internal control management-based approval method and system
CN115731559A (en) Electronic file generation management method and device and computer equipment
Weinzierl et al. Predictive business process monitoring with context information from documents
CN117094729A (en) Request processing method, device, computer equipment and storage medium
KR102496620B1 (en) AI-based search function and OCR electronic research note management system
CN116755688A (en) Component processing method, device, computer equipment and storage medium
CN115690821A (en) Intelligent electronic file cataloging method and computer equipment
TWI793432B (en) Document management method and system for engineering project
CN114037154A (en) Method and system for predicting scientific and technological achievement number and theme based on attention characteristics
CN114298460A (en) Material work order assignment processing method, device, equipment and storage medium
DE102021129862A1 (en) Method and device for identifying hardware performance count events for detecting and classifying malware or workload using artificial intelligence
CN113656579A (en) Text classification method, device, equipment and medium
KR101109425B1 (en) System of managing documents
CN111640491A (en) System and method for self-service uploading and information management of medical detection documents
CN113723093B (en) Personnel management policy recommendation method and device, computer equipment and storage medium
Grandi et al. Management of the citizen’s digital identity and access to multi-version norm texts on the semantic web
CN117389607A (en) Signboard configuration method and device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination