WO2024055862A1 - Document review method and apparatus for implementing ia by combining rpa and ai, and electronic device - Google Patents

Document review method and apparatus for implementing ia by combining rpa and ai, and electronic device Download PDF

Info

Publication number
WO2024055862A1
WO2024055862A1 PCT/CN2023/116767 CN2023116767W WO2024055862A1 WO 2024055862 A1 WO2024055862 A1 WO 2024055862A1 CN 2023116767 W CN2023116767 W CN 2023116767W WO 2024055862 A1 WO2024055862 A1 WO 2024055862A1
Authority
WO
WIPO (PCT)
Prior art keywords
reviewed
document
documents
review
target
Prior art date
Application number
PCT/CN2023/116767
Other languages
French (fr)
Chinese (zh)
Inventor
黄伟
Original Assignee
北京来也网络科技有限公司
来也科技(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京来也网络科技有限公司, 来也科技(北京)有限公司 filed Critical 北京来也网络科技有限公司
Publication of WO2024055862A1 publication Critical patent/WO2024055862A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables

Definitions

  • the present disclosure relates to the technical fields of robotic process automation and artificial intelligence, and specifically relates to a document review method and device, electronic equipment, computer-readable storage media, computer program products and computer programs that combine RPA and AI to implement IA.
  • Robotic Process Automation uses specific "robot software” to simulate human operations on a computer and automatically execute process tasks according to rules.
  • AI Artificial Intelligence
  • Intelligent Automation is a general term for a series of technologies from robotic process automation to artificial intelligence. It combines RPA with Optical Character Recognition (OCR), Intelligent Character Recognition (ICR), and process mining. (Process Mining), Deep Learning (DL), Machine Learning (ML), Natural Language Processing (NLP), Speech Recognition (Automatic Speech Recognition, ASR), Speech Synthesis (Text To Speech) , TTS), Computer Vision (CV) and other AI technologies are combined to create end-to-end business processes that can think, learn and adapt, covering from process discovery, process automation, to automatic and continuous The entire process of data collection, understanding the meaning of data, and using data to manage and optimize business processes.
  • documents submitted by users need to be reviewed.
  • pharmaceutical companies can submit relevant application documents to the Food and Drug Administration (hereinafter referred to as the Food and Drug Administration).
  • the approval department of the Food and Drug Administration will review the documents submitted by the pharmaceutical company. , and if the review passes, the corresponding certificate will be issued. If the review fails, the pharmaceutical company will be notified to modify the document.
  • document review is usually performed manually, which not only has high labor costs but also has low efficiency. How to efficiently review documents with lower labor costs has become an urgent problem that needs to be solved.
  • Embodiments of the present disclosure provide a document review method and device, electronic equipment, computer-readable storage media, computer program products, and computer programs that combine RPA and AI to implement IA to solve the high labor cost of document review methods in related technologies. and low efficiency technical issues.
  • the embodiment of the first aspect of the present disclosure provides a document review method that combines RPA and AI to implement IA, including: obtaining at least one document to be reviewed corresponding to the target business matter; reviewing each document to be reviewed based on AI technology to determine each document to be reviewed. Review whether the document has multiple preset types of problems; and based on determining that each document to be reviewed has at least one preset type of problem, generate modification suggestion information corresponding to the document to be reviewed.
  • the preset types include information completion types; each document to be reviewed is reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed, including: obtaining the information of each document to be reviewed. identification; based on the target business matter, query the knowledge graph corresponding to the pre-created target business matter to obtain the identification of at least one target document required by the target business matter; and compare the identification of each document to be reviewed with the identification of each target document Yes, to determine whether the documents to be reviewed are complete.
  • reviewing each document to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed also includes: text-checking each document to be reviewed based on optical character recognition (OCR) technology. Identify to obtain the text information contained in each document to be reviewed; perform information extraction on the text information contained in each document to be reviewed to obtain the fields to be reviewed and the corresponding field values contained in each text information; based on the target business matters, Query the knowledge graph to obtain the required fields in each target document; and determine whether among the fields to be reviewed contained in each text information, there is a target field that is consistent with the required fields in the corresponding target document, and determine whether the target field is Corresponding field values exist to determine whether the information in each document to be reviewed is complete.
  • OCR optical character recognition
  • reviewing each document to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed also includes: obtaining the same fields to be reviewed in all text information; and based on There are corresponding field values for the same fields to be reviewed, and the field values corresponding to the same fields to be reviewed are compared to determine whether the information in each document to be reviewed is consistent.
  • the preset types include process specification types; each document to be reviewed is reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed, including: based on OCR technology, each document to be reviewed is reviewed. Review documents for text recognition to obtain the text information contained in each document to be reviewed; based on the target business matters, query the knowledge graph corresponding to the pre-created target business matters to obtain the process specifications corresponding to the target business matters; and based on each to be reviewed Documents and the text information they contain to determine whether each document to be reviewed meets the process specifications.
  • the preset types include writing specification types; each document to be reviewed is reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed, including: based on OCR technology, each document to be reviewed is reviewed. Review documents for text recognition to obtain the text information contained in each document to be reviewed; and identify the text information contained in each document to be reviewed The text information is input into the pre-trained language model, so that through the language model, it can be determined whether there are problems with the writing standard type in each document to be reviewed.
  • the document review method for implementing IA by combining RPA and AI also includes: sending each document to be reviewed to a manual review platform based on determining that each document to be reviewed does not have multiple preset types of problems.
  • the document review method that combines RPA and AI to implement IA also includes: calling a robotic process automation RPA robot to access the business system to obtain the contact information of the provider of each document to be reviewed; and using the RPA robot to use the contact information , feedback the review results of each document to be reviewed to the corresponding provider.
  • the target business matter is a pharmaceutical registration matter.
  • the embodiment of the second aspect of the present disclosure provides a document review device that combines RPA and AI to implement IA, including: an acquisition module for acquiring at least one document to be reviewed corresponding to a target business matter; and an review module for reviewing each document based on AI technology.
  • the documents to be reviewed are reviewed to determine whether each document to be reviewed has multiple preset types of problems; and a generation module is used to generate the documents to be reviewed based on determining that each document to be reviewed has at least one preset type of problem. Corresponding modification suggestion information.
  • the preset types include information completion types; an audit module is used to: obtain the identification of each document to be audited; based on the target business matter, query the knowledge graph corresponding to the pre-created target business matter to obtain the target business The identification of at least one target document required by the matter; and compare the identification of each document to be reviewed with the identification of each target document to determine whether each document to be reviewed is complete.
  • the review module is also used to: perform text recognition on each document to be reviewed based on optical character recognition OCR technology to obtain the text information contained in each document to be reviewed; Extract information to obtain the fields to be reviewed and the corresponding field values contained in each text information; query the knowledge graph based on the target business matters to obtain the required fields in each target document; and determine the fields contained in each text information.
  • the fields to be reviewed whether there is a target field that is consistent with the required field in the corresponding target document, and whether there is a corresponding field value in the target field, to determine whether the information in each document to be reviewed is complete.
  • the audit module is also used to: obtain the same field to be audited in all text information; when there is a corresponding field value for the same field to be audited, compare the field values corresponding to the same field to be audited. Yes, to determine whether the information in each document to be reviewed is consistent.
  • the preset type includes a process specification type
  • the audit module is also used to: perform text recognition on each document to be audited based on OCR technology to obtain the text information contained in each document to be audited; based on the target business matters , query the knowledge graph corresponding to the pre-created target business matter to obtain the process specification corresponding to the target business item; and based on each document to be reviewed and the text information contained, determine whether each document to be reviewed meets the process specification.
  • the preset types include writing specification types; the review module is also used to: perform text recognition on each document to be reviewed based on OCR technology to obtain the text information contained in each document to be reviewed; and Review document The text information contained in the document is input into the pre-trained language model, so that through the language model, it can be determined whether there are problems with the standard type of writing in each document to be reviewed.
  • the document review device that combines RPA and AI to implement IA also includes: a first sending module, configured to send each document to be reviewed to a manual based on determining that each document to be reviewed does not have multiple preset types of problems. Review platform.
  • the document review device that combines RPA and AI to implement IA also includes: a calling module for calling a robotic process automation RPA robot to access the business system to obtain the contact information of the provider of each document to be reviewed; and second The sending module is used to use RPA robots to feedback the review results of each document to be reviewed to the corresponding provider through contact information.
  • the target business matter is a pharmaceutical registration matter.
  • the third embodiment of the present disclosure provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor.
  • the processor executes the computer program, the above embodiments of the present disclosure are implemented. the method described.
  • the fourth embodiment of the present disclosure provides a computer-readable storage medium on which a computer program is stored.
  • the computer program is executed by a processor, the method described in the above embodiments of the present disclosure is implemented.
  • the fifth aspect embodiment of the present disclosure proposes a computer program product, including a computer program that implements the method described in the above embodiments of the present disclosure when executed by a processor.
  • the sixth aspect embodiment of the present disclosure proposes a computer program.
  • the computer program includes computer program code.
  • the computer program code When the computer program code is run on a computer, the computer executes the method as described in the above embodiments of the present disclosure.
  • each document to be reviewed is reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed. After determining that each document to be reviewed has at least one preset type of problem, Under the condition of setting types of problems, the modification suggestion information corresponding to the documents to be reviewed is generated, and the automatic review of the documents to be reviewed corresponding to the target business matters is realized based on AI technology, which reduces the labor cost required for document review and improves the efficiency of document review. Review efficiency.
  • modification suggestions can be provided to the provider of the document to be audited, and it is convenient for the provider to modify the document to be audited. Revise.
  • This disclosed embodiment can also combine RPA and AI to obtain the contact information of the IA provider, and automatically feed back the review results of each document to be reviewed to the corresponding provider, thereby further reducing the labor cost required to feedback the review results.
  • Figure 1 is a schematic flowchart of a document review method for implementing IA by combining RPA and AI according to the first embodiment of the present disclosure
  • Figure 2 is a schematic flowchart of a document review method for implementing IA by combining RPA and AI according to the second embodiment of the present disclosure
  • Figure 3 is a schematic flowchart of a document review method for implementing IA by combining RPA and AI according to the third embodiment of the present disclosure
  • Figure 4 is a schematic flowchart of a document review method for implementing IA by combining RPA and AI according to the fourth embodiment of the present disclosure
  • Figure 5 is a schematic flowchart of a document review method for implementing IA by combining RPA and AI according to the fifth embodiment of the present disclosure
  • Figure 6 is a schematic structural diagram of a document review device that combines RPA and AI to implement IA according to the sixth embodiment of the present disclosure
  • FIG. 7 is a block diagram of an electronic device used to implement a document review method for implementing IA by combining RPA and AI according to an embodiment of the present disclosure.
  • Embodiments of the present disclosure provide a document review method and device, electronic equipment, computer-readable storage media, computer program products, and computer programs that combine RPA and AI to implement IA.
  • the method includes: obtaining at least one document to be reviewed corresponding to the target business matter; reviewing each document to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed; based on determining that each document to be reviewed is There is at least one preset type of problem, and modification suggestion information corresponding to the document to be reviewed is generated.
  • the document review method, device, electronic device, and storage medium provided by the embodiments of the present disclosure that combine RPA and AI to implement IA can be applied to any field that requires document review, such as the medical field and the judicial field.
  • the embodiments of the present disclosure do not limit this. .
  • Each embodiment of the present disclosure is explained by taking the medical field as an example.
  • RPA robot refers to a software robot that can combine AI technology and RPA technology to automatically handle online business.
  • RPA robots have two characteristics: “connector” and “non-intrusion”. By simulating human operation methods, they can extract, integrate and connect data from different systems in a non-intrusive way without changing the information system.
  • both "field” and “field value” are fragments composed of a single character or multiple consecutive characters.
  • field can be understood as the attribute item key
  • field value can be understood as the attribute value value
  • the fields and corresponding field values together form a piece of structured data.
  • “Zhang San” is the field value corresponding to the field "Name”
  • “Name” and “Zhang San” form a piece of structured data.
  • documents to be reviewed refers to documents received by the approval department for handling a certain business.
  • target business matters refers to the business.
  • Provide refers to the party that submits documents to be reviewed by the approval department. The provider can be an individual or an enterprise, and the embodiment of the present disclosure does not limit this.
  • pharmaceutical companies can submit applications to the Food and Drug Administration and submit relevant application documents.
  • the drug registration matters are the target business matters
  • the documents submitted by the pharmaceutical company when applying for drug registration matters are the corresponding target business matters.
  • the pharmaceutical company is the provider of the documents to be reviewed.
  • target document refers to the document required to successfully handle the target business matter, that is, the document required by the target business matter.
  • preset type refers to the type of problems that may exist in the preset documents to be reviewed.
  • problems belonging to the information completion type may include, for example: incomplete documents to be reviewed.
  • the target documents required by the target business matter include document A, document B and document C, but the documents to be reviewed only include document A and document B;
  • the information in the document to be reviewed is incomplete.
  • document A required by the target business matter requires field a and the corresponding field value.
  • the document to be reviewed includes document A, but document A does not include field a and its corresponding field value.
  • the document A includes field a but does not include the corresponding field value; the information in the document to be reviewed is inconsistent.
  • the document to be reviewed includes document A, document B and document C, where both document A and document B include field a and the corresponding field value. field value, but the field value corresponding to field a in document A is different from the field value corresponding to field a in document B.
  • the "process specification type" means that the document to be reviewed does not meet the process specification corresponding to the target business matter.
  • the process specifications corresponding to the target business matters are the specifications that should be followed in the handling process of the target business matters.
  • the process specifications corresponding to the medical device registration matter include: If a new mandatory standard or national standard is released and implemented within the validity period of the medical device registration certificate, the registered product must comply with the new If the changes made to mandatory standards and national standards require change registration, the registrant should first go through the change registration procedures and obtain the change registration (filing) document approved by the original approval department before submitting an application for registration renewal.
  • the medical device registration certificate of the registrant i.e., the provider of the materials to be reviewed in this disclosed embodiment
  • a new mandatory standard is released and implemented, and the medical device is a symbol of the new mandatory standard that should be changed to register.
  • the documents to be reviewed submitted by the registrant do not include the change registration (filing) document approved by the original approval department, the documents to be reviewed will have process specification issues.
  • the "standard writing type” means that there are typographical problems, English translation errors, irregular use of professional terms and other writing format problems in the document to be reviewed.
  • OCR Optical Character Recognition, Optical Character Recognition
  • information extraction refers to structuring the information contained in the text into a table-like organizational form.
  • information extraction can include named entity recognition and relationship extraction.
  • Named entity recognition that is, identifying various named entities in a piece of text.
  • the named entities that need to be recognized usually include person names, place names, organization names, drugs, time, etc., which can be set according to different application scenarios.
  • the named entities that need to be identified can include medical nouns, normative nouns, the address of the registrant, the name of the registrant, the name of the agent acting for pharmaceutical registration matters, etc.
  • the purpose of relationship extraction is to identify target relationships in text entities and extract semantic relationships between entities by identifying the relationships between entities. Relation extraction can be achieved through sequence annotation, classification, dependency syntax analysis, semantic dependency analysis and other technologies.
  • a "language model” is any machine model used to determine whether a document to be reviewed has a writing specification type problem, such as a neural network model.
  • the language model can be obtained by training training samples in advance.
  • “manual review platform” refers to a platform that can review documents manually, such as a human-machine collaboration platform.
  • business system refers to the online system used by the approval department to handle business matters, such as the management system of the Food and Drug Administration.
  • Figure 1 is a flow chart of a document review method for implementing IA by combining RPA and AI according to the first embodiment of the present disclosure. As shown in Figure 1, the method may include steps 101 to 103.
  • Step 101 Obtain at least one document to be reviewed corresponding to the target business matter.
  • the document review method that combines RPA and AI to implement IA in the embodiment of the present disclosure can be executed by a document review device that combines RPA and AI to implement IA.
  • the document review device that combines RPA and AI to implement IA will be referred to as a document Audit device.
  • the document review device can be implemented by software and/or hardware, and the document review device can be an electronic device, or can also be configured in an electronic device to realize automatic review of documents, thereby reducing the labor costs required for document review, Improve the efficiency of document review.
  • the electronic device may include but is not limited to a terminal device, a server, etc., and this embodiment does not specifically limit the electronic device.
  • the documents to be reviewed corresponding to the target business matter may include one document or multiple documents, and this disclosure does not limit this.
  • the document review device can provide an upload interface, so that the provider can upload the documents required to handle the target business matter through the upload interface.
  • the document review device can obtain at least one to-be-reviewed document corresponding to the target business matter. document.
  • Step 102 Review each document to be reviewed based on AI technology to determine whether each document to be reviewed contains multiple preset types of problems.
  • multiple preset types corresponding to the target business matter can be determined in advance by: summarizing the problems that frequently occur in documents to be reviewed during the processing of the target business matter, and classifying these problems to obtain multiple preset type.
  • the document review device can review each document to be reviewed one by one, and determine for each preset type whether each document to be reviewed has problems of the preset type.
  • multiple preset types may include, for example, information completion type, process specification type, writing specification type, etc.
  • Step 103 When it is determined that each document to be reviewed has at least one preset type of problem, generate modification suggestion information corresponding to the document to be reviewed.
  • the document review device may generate modification suggestion information corresponding to the document to be reviewed based on preset types of problems existing in the document to be reviewed.
  • the document review device determines that the document to be reviewed has an information completion type problem, where the problem is specifically: among the documents to be reviewed submitted by the provider, document A required by the target business matter is missing, then the document review device Modification suggestion information "Supplementary Document A required" can be generated.
  • the document review method provided by the embodiment of the present disclosure combines RPA and AI to implement IA, and obtains at least one document to be reviewed corresponding to the target business matter; each document to be reviewed is reviewed based on AI technology to determine whether there are multiple documents to be reviewed. Problems of a preset type; when it is determined that each document to be reviewed has at least one problem of a preset type, modification suggestion information corresponding to the document to be reviewed is generated. As a result, it is possible to automatically review documents to be reviewed corresponding to target business matters based on AI technology, reducing the labor costs required for document review and improving the efficiency of document review.
  • modification suggestions can be provided to the provider of the document to be audited, and it is convenient for the provider to modify the document to be audited. Revise.
  • the preset types may include information completion types.
  • each document to be reviewed is reviewed based on AI technology to determine whether there is an information completion type in each document to be reviewed.
  • Figure 2 is a flow chart of a document review method for implementing IA by combining RPA and AI according to the second embodiment of the present disclosure.
  • the document review method for implementing IA by combining RPA and AI may include steps 201 to 211.
  • Step 201 Obtain at least one document to be reviewed corresponding to the target business matter.
  • step 201 For the specific implementation process and principle of step 201, reference can be made to the description of the above embodiments and will not be described again here.
  • problems belonging to the information completion type may include: incomplete documents to be reviewed.
  • the document review device obtains each document to be reviewed, it can determine whether there is an information completion type problem in each document to be reviewed through the following steps 202-205.
  • Step 202 Obtain the identification of each document to be reviewed.
  • this ID is used to uniquely identify the document to be reviewed.
  • the identification of the document to be reviewed may be the file name of the document to be reviewed, or the number corresponding to the document to be reviewed, etc. This disclosure does not limit this.
  • Step 203 Based on the target business matter, query the knowledge graph corresponding to the pre-created target business matter to obtain the identification of at least one target document required by the target business matter.
  • a knowledge graph corresponding to the business matter can be created in advance based on the documents required to handle the business matter.
  • the knowledge graph may include, for example, a first node corresponding to the business matter, and a second node corresponding to the documents required to handle the business matter, and the first node and the second node are connected through edges. Therefore, the document review device can query the knowledge corresponding to the pre-created target business matter based on the target business matter.
  • Graph determine the second node in the knowledge graph that is connected to the first node corresponding to the target business matter through an edge, and determine the identifier of the document corresponding to the second node as the identifier of at least one target document required by the target business matter.
  • Step 204 Compare the identification of each document to be reviewed with the identification of each target document to determine whether each document to be reviewed is complete.
  • Step 205 When it is determined that each document to be reviewed is incomplete, it is determined that each document to be reviewed has an information completion type problem.
  • each target document there is an identification of the document to be reviewed that is the same as the identification, it can be determined that the documents to be reviewed are complete.
  • each document to be reviewed is incomplete, and further it can be determined that each document to be reviewed has an information completion type problem.
  • problems belonging to the information completion type may also include: incomplete information in the document to be reviewed.
  • the document review device can also determine whether there are information completion type problems in each document to be reviewed in the manner shown in the following steps 206-210.
  • Step 206 Based on the optical character recognition OCR technology, perform text recognition on each document to be reviewed to obtain the text information contained in each document to be reviewed.
  • Step 207 Extract text information contained in each document to be reviewed to obtain fields to be reviewed and corresponding field values contained in each text information.
  • Step 208 Based on the target business matters, query the knowledge graph to obtain the required fields in each target document.
  • the knowledge graph can also be created based on the fields that need to be included in each document required to handle the business matter.
  • the knowledge graph in addition to the first node corresponding to the business matter and the second node corresponding to the documents required to handle the business matter, the knowledge graph also includes the third node corresponding to each field that needs to be included in each document. The node is connected to the corresponding second node through an edge. Therefore, the document review device can query the knowledge graph corresponding to the pre-created target business matter based on the target business matter, determine the third node in the knowledge graph connected to each second node through the edge, and determine the fields corresponding to each third node. are the required fields in the corresponding target document (that is, the target document corresponding to the second node connected to the third node).
  • Step 209 Determine whether there is a target field consistent with the required field in the corresponding target document among the fields to be reviewed included in each text information, and determine whether there is a corresponding field value in the target field, so as to determine whether there is a corresponding field value in each document to be reviewed. whether the information is complete.
  • Step 210 When it is determined that the information in each document to be reviewed is incomplete, it is determined that each document to be reviewed has an information completion type problem.
  • the document review device determines that among the fields to be reviewed in the text information contained in the corresponding document to be reviewed, there is a target field that is consistent with the field, and When each target field has a corresponding field value, it can be determined that the information in each document to be reviewed is complete.
  • the document review device determines that among the fields to be reviewed in the text information contained in the corresponding document to be reviewed, there is no target field that is consistent with the field. In this case, it can be determined that the information in each document to be reviewed is incomplete, and further it can be determined that there is an information completion type problem in each document to be reviewed.
  • a certain field in a document to be reviewed does not have a corresponding field value. Then, if the text information contained in each document to be reviewed is extracted, a certain field to be reviewed may not be obtained.
  • the corresponding field value of the field means that there is no corresponding field value for the field to be reviewed.
  • For at least one required field in each target document in the field to be reviewed in the text information contained in the corresponding document to be reviewed determined by the document review device, there is a target field that is consistent with the field, but there is no corresponding target field. In the case of field values, it can be determined that the information in each document to be reviewed is incomplete, and then it can be determined that there is an information completion type problem in each document to be reviewed.
  • problems belonging to the information completion type may also include: information in the document to be reviewed is inconsistent.
  • the document review device can also determine whether there are information completion type problems in each document to be reviewed in the manner shown in the following steps:
  • the document review device determines that the field values corresponding to the same fields to be reviewed in all text information are the same, it can be determined that the information in the documents to be reviewed is consistent.
  • the document review device determines that in all text information, at least one field value corresponding to the same field to be reviewed is different, it can be determined that the information in each document to be reviewed is inconsistent, and further it can be determined that each document to be reviewed has an information completion type. question.
  • Step 211 After it is determined that each document to be reviewed has an information completion type problem, modification suggestion information corresponding to the document to be reviewed is generated.
  • the document review device when the document review device determines that each document to be reviewed does not have any of the above problems, it can determine that each document to be reviewed does not have an information completion type problem. When the document review device determines that each document to be reviewed has at least one of the above problems, it can be determined that each document to be reviewed has an information completion type problem, and then modification suggestion information corresponding to the document to be reviewed can be generated.
  • the document review method provided by the embodiments of the present disclosure for implementing IA by combining RPA and AI realizes the automatic review based on AI technology to see whether the documents to be reviewed corresponding to the target business matters have information completion type problems, thereby reducing the need for document review.
  • the required labor costs improve the efficiency of document review.
  • modification suggestions can be provided to the provider of the document to be audited, and it is convenient for the provider to make information on the document to be audited. Complete or modify.
  • the preset type may include a process specification type.
  • each document to be reviewed is reviewed based on AI technology to determine whether there are issues with the process specification type in each document to be reviewed. The process is explained.
  • Figure 3 is a flow chart of a document review method for implementing IA by combining RPA and AI according to the third embodiment of the present disclosure. As shown in Figure 3, the document review method for implementing IA by combining RPA and AI may include steps 301 to 305.
  • Step 301 Obtain at least one document to be reviewed corresponding to the target business matter.
  • Step 302 Based on OCR technology, text recognition is performed on each document to be reviewed to obtain text information contained in each document to be reviewed.
  • Step 303 Based on the target business item, query the pre-created knowledge graph corresponding to the target business item to obtain the process specification corresponding to the target business item.
  • the knowledge graph when creating a knowledge graph corresponding to the business matter, can be created according to the specifications that should be followed in the handling process of the business matter.
  • the knowledge graph may also include a fourth node corresponding to the specifications that should be followed in the handling process of the business matter.
  • the fourth node is connected to the first node through an edge. Therefore, the document review device can query the pre-created knowledge graph corresponding to the target business matter based on the target business matter, determine the fourth node in the knowledge graph that is connected to the first node corresponding to the target business matter through an edge, and associate the fourth node with the first node corresponding to the target business matter.
  • the specifications are determined as process specifications corresponding to the target business matters.
  • Step 304 Based on each document to be reviewed and the text information contained therein, determine whether each document to be reviewed satisfies the process specification.
  • the process specifications corresponding to the medical device registration matter include: If a new mandatory standard or national standard is released and implemented within the validity period of the medical device registration certificate, the registered product is If changes made to comply with new mandatory standards or national standards require change registration, the registrant should first go through the change registration procedures and obtain the change registration (filing) document approved by the original approval department before submitting an application for registration renewal. Documents to be reviewed include medical device registration certificates.
  • the document review device can determine the validity period of the medical device registration certificate based on the text information contained in the medical device registration certificate, and query whether there are new mandatory standards and national standards released and implemented within the validity period, and determine whether registered products comply with the new Changes made to mandatory standards and national standards require change registration. If so, the document review device can review whether the documents to be reviewed submitted by the registrant include the change registration (filing) document approved by the original approval department. If it is not included, the document review device may determine that each document to be reviewed does not meet the process specifications corresponding to medical device registration matters. If included, the document review device can determine that each document to be reviewed meets the process specifications corresponding to the medical device registration matters.
  • Step 305 When each document to be reviewed does not meet the process specification, it is determined that each document to be reviewed has a process specification type problem, and modification suggestion information corresponding to the document to be reviewed is generated.
  • the document review device may generate modification suggestion information corresponding to the document to be reviewed based on the process specification type problems existing in the document to be reviewed. For example, continuing the above example, the document review device can generate modification suggestion information "Please go to the original approval department to register the change, and after obtaining the change registration (filing) document approved by the original approval department, submit an application for registration renewal.”
  • the document review method provided by the embodiments of the present disclosure for implementing IA by combining RPA and AI realizes the automatic review based on AI technology to see whether the documents to be reviewed corresponding to the target business matters have process specification type issues, thereby reducing the time required for document review. Reduce required labor costs and improve the efficiency of document review.
  • modification suggestions can be provided to the provider of the document to be audited, and it is convenient for the provider to modify the document to be audited.
  • the preset type may include a writing standard type.
  • each document to be reviewed is reviewed based on AI technology to determine whether there is a problem with the writing standard type in each document to be reviewed. The process is explained.
  • Figure 4 is a flow chart of a document review method for implementing IA by combining RPA and AI according to the fourth embodiment of the present disclosure.
  • the document review method for implementing IA by combining RPA and AI may include steps 401 to 404.
  • Step 401 Obtain at least one document to be reviewed corresponding to the target business matter.
  • Step 402 Based on OCR technology, text recognition is performed on each document to be reviewed to obtain text information contained in each document to be reviewed.
  • Step 403 Input the text information contained in each document to be reviewed into a pre-trained language model, so as to use the language model to determine whether each document to be reviewed has problems with writing standards.
  • the language model can be pre-trained to generate a language model.
  • the input of the language model is text information
  • the output is the standard-type questions and corresponding confidence levels existing in the text information.
  • each document to be reviewed can be
  • the text information contained in the document is input into a pre-trained language model, so that the language model can predict the writing standard type problems existing in each text information, and determine the corresponding confidence level, so that the document review device can predict each text based on the language model.
  • the standard type of problems existing in the information and the corresponding confidence level determine whether each document to be reviewed has a standard type of problem.
  • the confidence threshold can be set arbitrarily as needed, for example, it can be set to 0.7, 0.8, etc., and this disclosure does not limit this.
  • Step 404 When it is determined that each document to be reviewed has a standard-type problem, modification suggestion information corresponding to the document to be reviewed is generated.
  • the document review device may generate modification suggestion information corresponding to the document to be reviewed based on the standard-type issues existing in the document to be reviewed. For example, assuming that the document to be reviewed has a writing standard type problem: the English translation of a certain Chinese word x in document A is incorrect, the document review device can generate a modification suggestion message "Please modify the English translation of word x.”
  • the document review method provided by embodiments of the present disclosure that combines RPA and AI to implement IA can automatically review whether the document to be reviewed corresponding to the target business matter has a standard type of writing based on AI technology, thus reducing the complexity of document review. Reduce required labor costs and improve the efficiency of document review.
  • modification suggestions can be provided to the provider of the document to be reviewed, and it is convenient for the provider to modify the document to be reviewed.
  • Figure 5 is a flow chart of a document review method that combines RPA and AI to implement IA according to the fifth embodiment of the present disclosure. As shown in Figure 5, the method may include steps 501 to 506.
  • Step 501 Obtain at least one document to be reviewed corresponding to the target business matter.
  • Step 502 Review each document to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed.
  • the multiple preset types include information completion type, process specification type and writing specification type. .
  • Step 503 When it is determined that each document to be reviewed has at least one preset type of problem, generate modification suggestion information corresponding to the document to be reviewed.
  • Step 504 When it is determined that each document to be reviewed does not have multiple preset types of problems, the document to be reviewed is sent to the manual review platform.
  • the document review method that combines RPA and AI to implement IA provided by the embodiments of the present disclosure can be applied to the pre-review process before manually reviewing documents.
  • the document review device determine each document to be reviewed. If the document does not have multiple preset types of problems, it can be determined that each document to be reviewed has passed the pre-review, so that each document to be reviewed can be sent to the manual review platform for further review of each document to be reviewed manually.
  • Each document to be reviewed is automatically reviewed through the document review device. After it is determined that each document to be reviewed does not have multiple preset types of problems, each document to be reviewed is then sent to the manual review platform for further review, which reduces the need for each document to be reviewed. The labor cost required to review the documents to be reviewed is reduced, and the number of interactions between the provider of each document to be reviewed and the approval department is reduced, and the efficiency of handling target business matters is improved.
  • the document review method that combines RPA and AI to implement IA can be used to pre-review registration application documents submitted by pharmaceutical companies.
  • the document review device determines that the registration application document has at least one type of problem among the information completion type, the process specification type, and the writing specification type
  • corresponding modification suggestion information can be generated so that the pharmaceutical enterprise can make the review based on the modification suggestion information.
  • the document is modified.
  • each document to be reviewed can be sent to the manual review platform so that the approval department can review each document to be reviewed. Review documents for further review.
  • the pharmaceutical company submits the registration application documents, it can provide timely feedback to the pharmaceutical company, reduce the pharmaceutical registration application cycle, and improve the registration application efficiency.
  • Step 505 Call the robotic process automation RPA robot to access the business system to obtain the contact information of the provider of each document to be reviewed.
  • the contact information of the provider can be a phone number, email address, etc., and this disclosure does not limit this.
  • the business system stores the contact information of the providers of each document to be reviewed, and the document review device can obtain the contact information of the providers of each document to be reviewed from the business system through background data access.
  • the document review device can also call the RPA robot to access the business system through web page access to obtain the contact information of the provider of each document to be reviewed.
  • a web page refers to a file on the World Wide Web organized in HTML (Hyper Text Markup Language) format.
  • Step 506 Use the RPA robot to feed back the review results of each document to be reviewed to the corresponding provider through contact information.
  • the review results may include whether each document to be reviewed has passed the review, or each document to be reviewed has failed to pass the review, as well as the reasons for the failure and modification suggestion information, etc.
  • the RPA robot can be used to feed back the review results to the corresponding provider through the provider's contact information, so that the provider can In order to obtain the review results of documents to be reviewed in a timely manner.
  • the linkage between the document review device and the business system can be realized.
  • the RPA robot is used to obtain the contact information of the provider, and the review results of each document to be reviewed are passed.
  • Contact information is fed back to the corresponding provider, and RPA and AI can be combined to achieve IA acquisition and provision.
  • the contact information of each party is provided, and the review results of each document to be reviewed are automatically fed back to the corresponding provider, thereby further reducing the labor cost required to feedback the review results.
  • embodiments of the present disclosure also propose a document review device that combines RPA and AI to implement IA.
  • Figure 6 is a schematic structural diagram of a document review device that combines RPA and AI to implement IA according to the sixth embodiment of the present disclosure.
  • the document review device 600 that combines RPA and AI to implement IA includes: an acquisition module 610, an review module 620, and a generation module 630.
  • the acquisition module 610 is used to acquire at least one document to be reviewed corresponding to the target business matter.
  • the review module 620 is used to review each document to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed.
  • the generation module 630 is configured to generate modification suggestion information corresponding to the document to be reviewed when it is determined that each document to be reviewed has at least one preset type of problem.
  • the document review device 600 that combines RPA and AI to implement IA in the embodiment of the present disclosure can execute the document review method that combines RPA and AI to implement IA provided in the above embodiments.
  • the document review device 600 that combines RPA and AI to implement IA can be implemented by software and/or hardware.
  • the document review device 600 that combines RPA and AI to implement IA can be an electronic device, or can also be configured in an electronic device to implement Automatic review of documents, thereby reducing the labor costs required for document review and improving the efficiency of document review.
  • the electronic device may include but is not limited to a terminal device, a server, etc., and this embodiment does not specifically limit the electronic device.
  • the preset types include information completion types; the audit module 620 is used to:
  • the audit module 620 is also used to:
  • OCR optical character recognition
  • the audit module 620 is also used to:
  • the field values corresponding to the same field to be reviewed are compared to determine whether the information in each document to be reviewed is consistent.
  • the preset type includes a process specification type; the audit module 620 is also used to:
  • the preset types include writing specification types; the review module 620 is also used to:
  • the text information contained in each document to be reviewed is input into the pre-trained language model, so that through the language model, it is determined whether there are issues of the writing standard type in each document to be reviewed.
  • the document review device 600 that combines RPA and AI to implement IA also includes:
  • the first sending module is used to send each document to be reviewed to the manual review platform when it is determined that each document to be reviewed does not have multiple preset types of problems.
  • the document review device 600 that combines RPA and AI to implement IA also includes:
  • the calling module is used to call the robotic process automation RPA robot to access the business system to obtain the contact information of the provider of each document to be reviewed;
  • the second sending module is used to use RPA robots to feedback the review results of each document to be reviewed to the corresponding provider through contact information.
  • the target business matter is a pharmaceutical registration matter.
  • the document review device that combines RPA and AI to implement IA in the embodiment of the present disclosure obtains at least one document to be reviewed corresponding to the target business matter; each document to be reviewed is reviewed based on AI technology to determine whether each document to be reviewed exists. Problems of multiple preset types; when it is determined that each document to be reviewed has at least one problem of the preset type, modification suggestion information corresponding to the document to be reviewed is generated. As a result, it is possible to automatically review documents to be reviewed corresponding to target business matters based on AI technology, reducing the labor costs required for document review and improving the efficiency of document review.
  • modification suggestions can be provided to the provider of the document to be audited, and it is convenient for the provider to modify the document to be audited. Revise.
  • embodiments of the present disclosure also provide an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor.
  • the processor executes the computer program, As described in any of the foregoing method embodiments, the document review method of IA is implemented by combining RPA and AI.
  • embodiments of the present disclosure also provide a computer-readable storage medium on which a computer program is stored.
  • the computer program is executed by a processor, the combination of RPA and AI as described in any of the foregoing method embodiments is implemented.
  • embodiments of the present disclosure also provide a computer program product.
  • the instruction processor in the computer program product is executed, the implementation of IA by combining RPA and AI as described in any of the foregoing method embodiments is realized. Document review methods.
  • embodiments of the present disclosure also provide a computer program.
  • the computer program includes computer program code.
  • the computer program code When the computer program code is run on a computer, it causes the computer to perform the combination described in any embodiment of the present disclosure.
  • RPA and AI implement IA's document review method.
  • FIG. 7 illustrates a block diagram of an exemplary electronic device suitable for implementing embodiments of the present disclosure.
  • the electronic device 10 shown in FIG. 7 is only an example and should not bring any limitations to the functions and scope of use of the embodiments of the present disclosure.
  • electronic device 10 is embodied in the form of a general computing device.
  • the components of electronic device 10 may include, but are not limited to: one or more processors or processing units 16, system memory 28, and a bus 18 connecting various system components, including memory 28 and processing unit 16.
  • Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, a graphics accelerated port, a processor, or a local bus using any of a variety of bus structures.
  • these architectures include but are not limited to Industry Standard Architecture (hereinafter referred to as: ISA) bus, Micro Channel Architecture (Micro Channel Architecture; hereafter referred to as: MAC) bus, enhanced ISA bus, video electronics Standards Association (Video Electronics Standards Association; hereinafter referred to as: VESA) local bus and Peripheral Component Interconnection (hereinafter referred to as: PCI) bus.
  • ISA Industry Standard Architecture
  • MAC Micro Channel Architecture
  • VESA Video Electronics Standards Association
  • PCI Peripheral Component Interconnection
  • electronic device 10 includes a variety of computer system readable media. These media may be any available media that can be accessed by electronic device 10, including volatile and nonvolatile media, removable and non-removable media.
  • the memory 28 may include computer system-readable media in the form of volatile memory, such as random access memory (Random Access Memory; hereinafter referred to as: RAM) 30 and/or cache memory 32 .
  • RAM random access memory
  • cache memory 32 e.g., random access memory (Random Access Memory; hereinafter referred to as: RAM) 30 and/or cache memory 32 .
  • RAM Random Access Memory
  • Other removable/non-removable, volatile/non-volatile computer system storage media may further be included.
  • storage system 34 may be used to read and write to non-removable, non-volatile magnetic media (not shown in Figure 7, commonly referred to as a "hard drive").
  • a disk drive may be provided for reading and writing to a removable non-volatile disk (e.g., a "floppy disk"), and a disk drive for reading and writing a removable non-volatile optical disk (e.g., a compact disk read-only memory).
  • a disc drive that reads and writes from Disc Read Only Memory (hereinafter referred to as: CD-ROM), Digital Video Disc Read Only Memory (hereinafter referred to as: DVD-ROM) or other optical media).
  • CD-ROM Disc Read Only Memory
  • DVD-ROM Digital Video Disc Read Only Memory
  • each drive may be connected to bus 18 through one or more data media interfaces.
  • Memory 28 may include at least one program product having a set (eg, at least one) of program modules configured to perform the functions of embodiments of the present disclosure.
  • a program/utility 40 having a set of (at least one) program modules 42 may be stored, for example, in memory 28 , each of these examples or some combination may include the implementation of a network environment.
  • Program modules 42 generally perform functions and/or methods in the embodiments described in this disclosure.
  • Electronic device 10 may also communicate with one or more external devices 14 (e.g., keyboard, pointing device, display 24, etc.), may also communicate with one or more devices that enable a user to interact with electronic device 10, and/or with Any device (eg, network card, modem, etc.) that enables the electronic device 10 to communicate with one or more other computing devices. This communication may occur through input/output (I/O) interface 22.
  • the electronic device 10 can also communicate with one or more networks (such as a local area network (Local Area Network; hereinafter referred to as: LAN), a wide area network (Wide Area Network; hereinafter referred to as: WAN)) and/or a public network, such as the Internet, through the network adapter 20 ) communication.
  • networks such as a local area network (Local Area Network; hereinafter referred to as: LAN), a wide area network (Wide Area Network; hereinafter referred to as: WAN)
  • a public network such as the Internet
  • network adapter 20 communicates with other modules of electronic device 10 via bus 18 .
  • bus 18 It should be understood that, although not shown in Figure 7, other hardware and/or software modules may be used in conjunction with electronic device 10, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tapes drives and data backup storage systems, etc.
  • the processing unit 16 executes programs stored in the memory 28 to perform various functional applications and data processing, such as implementing the methods mentioned in the previous embodiments.
  • first and second are used for descriptive purposes only and cannot be understood as indicating or implying relative importance or implicitly indicating the quantity of indicated technical features. Therefore, features defined as “first” and “second” may explicitly or implicitly include at least one of these features. In the description of the embodiments of the present disclosure, “plurality” means at least two, such as two, three, etc., unless otherwise explicitly and specifically limited.
  • a "computer-readable medium” may be any device that can contain, store, communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
  • Non-exhaustive list of computer readable media include the following: electrical connections with one or more wires (electronic device), portable computer disk cartridges (magnetic device), random access memory (RAM), Read-only memory (ROM), erasable and programmable read-only memory (EPROM or flash memory), fiber optic devices, and portable compact disc read-only memory (CDROM).
  • the computer-readable medium may even be paper or other suitable medium on which the program may be printed, as the paper or other medium may be optically scanned, for example, and subsequently edited, interpreted, or otherwise suitable as necessary. process to obtain the program electronically and then store it in computer memory.
  • various parts of the present disclosure may be implemented in hardware, software, firmware, or combinations thereof.
  • various steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system.
  • a suitable instruction execution system For example, if it is implemented in hardware, as in another embodiment, it can be implemented by any one of the following technologies known in the art or their combination: discrete logic gate circuits with logic functions for implementing data signals; Logic circuits, application specific integrated circuits with suitable combinational logic gates, programmable gate arrays (PGA), field programmable gate arrays (FPGA), etc.
  • the program can be stored in a computer-readable storage medium.
  • the program can be stored in a computer-readable storage medium.
  • each functional unit in various embodiments of the present disclosure may be integrated into one processing module, each unit may exist physically alone, or two or more units may be integrated into one module.
  • the above integrated modules can be implemented in the form of hardware or software function modules. If the integrated module is implemented in the form of a software function module and sold or used as an independent product, it can also be stored in a computer-readable storage medium.
  • the storage media mentioned above can be read-only memory, magnetic disks or optical disks, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Data Mining & Analysis (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Artificial Intelligence (AREA)
  • Quality & Reliability (AREA)
  • Multimedia (AREA)
  • Operations Research (AREA)
  • Marketing (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Computational Linguistics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present application relates to the technical field of robotic process automation (RPA) and artificial intelligence (AI), and relates to a document review method and apparatus for implementing IA by combining RPA and AI, and an electronic device. The method comprises: acquiring at least one document to be reviewed corresponding to a target service item; reviewing each said document on the basis of an AI technology, to determine whether a plurality of preset types of problems exist in each said document; and when it is determined that at least one preset type of problem exist in each said document, generating modification suggestion information corresponding to said document. Thus, documents to be reviewed corresponding to the target service item are automatically reviewed on the basis of the AI technology, thereby reducing the labor costs required for document review, and improving the efficiency of document review. According to the present application, contact information of a provider can be acquired by the IA implemented by combining the RPA and the AI, and a review result of each document to be reviewed is automatically fed back to the corresponding provider, thereby further reducing the labor costs required for feedback of the review result.

Description

结合RPA和AI实现IA的文档审核方法、装置及电子设备Combining RPA and AI to implement IA document review methods, devices and electronic equipment
相关申请的交叉引用Cross-references to related applications
本公开要求在2022年09月13日在中国提交的中国专利申请号2022111101693的优先权,其全部内容通过引用并入本文。This disclosure claims priority from Chinese Patent Application No. 2022111101693 filed in China on September 13, 2022, the entire content of which is incorporated herein by reference.
技术领域Technical field
本公开涉及机器人流程自动化和人工智能技术领域,具体涉及一种结合RPA和AI实现IA的文档审核方法及装置、电子设备、计算机可读存储介质、计算机程序产品和计算机程序。The present disclosure relates to the technical fields of robotic process automation and artificial intelligence, and specifically relates to a document review method and device, electronic equipment, computer-readable storage media, computer program products and computer programs that combine RPA and AI to implement IA.
背景技术Background technique
机器人流程自动化(Robotic Process Automation,简称RPA),是通过特定的“机器人软件”,模拟人在计算机上的操作,按规则自动执行流程任务。Robotic Process Automation (RPA) uses specific "robot software" to simulate human operations on a computer and automatically execute process tasks according to rules.
人工智能(Artificial Intelligence,简称AI)是研究、开发用于模拟、延伸和扩展人的智能的理论、方法、技术及应用系统的一门技术科学。Artificial Intelligence (AI for short) is a technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence.
智能自动化(Intelligent Automation,简称IA)是一系列从机器人流程自动化到人工智能的技术总称,将RPA与光学字符识别(Optical Character Recognition,OCR)、智能字符识别(Intelligent Character Recognition,ICR)、流程挖掘(Process Mining)、深度学习(Deep Learning,DL)、机器学习(Machine Learning,ML)、自然语言处理(Natural Language Processing,NLP)、语音识别(Automatic Speech Recognition,ASR)、语音合成(Text To Speech,TTS)、计算机视觉(Computer Vision,CV)等多种AI技术相结合,以创建能够思考、学习及自适应的端到端的业务流程,涵盖从流程发现、流程自动化,到通过自动而持续的数据收集、理解数据的含义,使用数据来管理和优化业务流程的整个历程。Intelligent Automation (IA) is a general term for a series of technologies from robotic process automation to artificial intelligence. It combines RPA with Optical Character Recognition (OCR), Intelligent Character Recognition (ICR), and process mining. (Process Mining), Deep Learning (DL), Machine Learning (ML), Natural Language Processing (NLP), Speech Recognition (Automatic Speech Recognition, ASR), Speech Synthesis (Text To Speech) , TTS), Computer Vision (CV) and other AI technologies are combined to create end-to-end business processes that can think, learn and adapt, covering from process discovery, process automation, to automatic and continuous The entire process of data collection, understanding the meaning of data, and using data to manage and optimize business processes.
在很多业务场景中,需要对用户提交的文档进行审核。比如,医药企业为了办理医疗器械注册、药品注册等医药注册事项,可以向药品监督管理局(简称药监局)提交相关的申请文档,药监局的审批部门会对医药企业提交的文档进行审核,并在审核通过时,下发相应证书,在审核不通过时,通知医药企业对文档进行修改。相关技术中,通常是通过人工进行文档审核,不仅人力成本高,且效率低。如何以较低的人力成本,高效的对文档进行审核,已经成为一个亟待解决的问题。 In many business scenarios, documents submitted by users need to be reviewed. For example, in order to handle medical device registration, drug registration and other pharmaceutical registration matters, pharmaceutical companies can submit relevant application documents to the Food and Drug Administration (hereinafter referred to as the Food and Drug Administration). The approval department of the Food and Drug Administration will review the documents submitted by the pharmaceutical company. , and if the review passes, the corresponding certificate will be issued. If the review fails, the pharmaceutical company will be notified to modify the document. In related technologies, document review is usually performed manually, which not only has high labor costs but also has low efficiency. How to efficiently review documents with lower labor costs has become an urgent problem that needs to be solved.
发明内容Contents of the invention
本公开实施例提供一种结合RPA和AI实现IA的文档审核方法及装置、电子设备、计算机可读存储介质、计算机程序产品和计算机程序,以解决相关技术中的文档审核方法存在的人力成本高且效率低的技术问题。Embodiments of the present disclosure provide a document review method and device, electronic equipment, computer-readable storage media, computer program products, and computer programs that combine RPA and AI to implement IA to solve the high labor cost of document review methods in related technologies. and low efficiency technical issues.
本公开第一方面实施例提供一种结合RPA和AI实现IA的文档审核方法,包括:获取目标业务事项对应的至少一个待审核文档;基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题;和基于确定各待审核文档存在至少一个预设类型的问题,生成待审核文档对应的修改建议信息。The embodiment of the first aspect of the present disclosure provides a document review method that combines RPA and AI to implement IA, including: obtaining at least one document to be reviewed corresponding to the target business matter; reviewing each document to be reviewed based on AI technology to determine each document to be reviewed. Review whether the document has multiple preset types of problems; and based on determining that each document to be reviewed has at least one preset type of problem, generate modification suggestion information corresponding to the document to be reviewed.
在一些实施例中,预设类型包括信息补全类型;基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题,包括:获取各待审核文档的标识;基于目标业务事项,查询预先创建的目标业务事项对应的知识图谱,以获取目标业务事项所要求的至少一个目标文档的标识;和将各待审核文档的标识与各目标文档的标识进行比对,以确定各待审核文档是否齐全。In some embodiments, the preset types include information completion types; each document to be reviewed is reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed, including: obtaining the information of each document to be reviewed. identification; based on the target business matter, query the knowledge graph corresponding to the pre-created target business matter to obtain the identification of at least one target document required by the target business matter; and compare the identification of each document to be reviewed with the identification of each target document Yes, to determine whether the documents to be reviewed are complete.
在一些实施例中,基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题,还包括:基于光学字符识别OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息;对各待审核文档所包含的文本信息进行信息抽取,以获取各文本信息所包含的待审核字段以及对应的字段值;基于目标业务事项,查询知识图谱,以获取各目标文档中所要求的字段;和判断各文本信息所包含的待审核字段中,是否存在与对应的目标文档中所要求的字段一致的目标字段,以及判断目标字段是否存在对应的字段值,以确定各待审核文档中的信息是否齐全。In some embodiments, reviewing each document to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed also includes: text-checking each document to be reviewed based on optical character recognition (OCR) technology. Identify to obtain the text information contained in each document to be reviewed; perform information extraction on the text information contained in each document to be reviewed to obtain the fields to be reviewed and the corresponding field values contained in each text information; based on the target business matters, Query the knowledge graph to obtain the required fields in each target document; and determine whether among the fields to be reviewed contained in each text information, there is a target field that is consistent with the required fields in the corresponding target document, and determine whether the target field is Corresponding field values exist to determine whether the information in each document to be reviewed is complete.
在一些实施例中,基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题,还包括:获取所有的文本信息中的相同待审核字段;和基于相同待审核字段存在对应的字段值,将相同待审核字段对应的字段值进行比对,以确定各待审核文档中的信息是否一致。In some embodiments, reviewing each document to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed also includes: obtaining the same fields to be reviewed in all text information; and based on There are corresponding field values for the same fields to be reviewed, and the field values corresponding to the same fields to be reviewed are compared to determine whether the information in each document to be reviewed is consistent.
在一些实施例中,预设类型包括流程规范类型;基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题,包括:基于OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息;基于目标业务事项,查询预先创建的目标业务事项对应的知识图谱,以获取目标业务事项对应的流程规范;和基于各待审核文档以及所包含的文本信息,判断各待审核文档是否满足流程规范。In some embodiments, the preset types include process specification types; each document to be reviewed is reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed, including: based on OCR technology, each document to be reviewed is reviewed. Review documents for text recognition to obtain the text information contained in each document to be reviewed; based on the target business matters, query the knowledge graph corresponding to the pre-created target business matters to obtain the process specifications corresponding to the target business matters; and based on each to be reviewed Documents and the text information they contain to determine whether each document to be reviewed meets the process specifications.
在一些实施例中,预设类型包括行文规范类型;基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题,包括:基于OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息;和将各待审核文档所包含 的文本信息,输入预先训练的语言模型,以通过语言模型,确定各待审核文档是否存在行文规范类型的问题。In some embodiments, the preset types include writing specification types; each document to be reviewed is reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed, including: based on OCR technology, each document to be reviewed is reviewed. Review documents for text recognition to obtain the text information contained in each document to be reviewed; and identify the text information contained in each document to be reviewed The text information is input into the pre-trained language model, so that through the language model, it can be determined whether there are problems with the writing standard type in each document to be reviewed.
在一些实施例中,结合RPA和AI实现IA的文档审核方法还包括:基于确定各待审核文档不存在多个预设类型的问题,将各待审核文档发送至人工审核平台。In some embodiments, the document review method for implementing IA by combining RPA and AI also includes: sending each document to be reviewed to a manual review platform based on determining that each document to be reviewed does not have multiple preset types of problems.
在一些实施例中,结合RPA和AI实现IA的文档审核方法还包括:调用机器人流程自动化RPA机器人访问业务系统,以获取各待审核文档的提供方的联系方式;和采用RPA机器人,通过联系方式,将各待审核文档的审核结果反馈至对应的提供方。In some embodiments, the document review method that combines RPA and AI to implement IA also includes: calling a robotic process automation RPA robot to access the business system to obtain the contact information of the provider of each document to be reviewed; and using the RPA robot to use the contact information , feedback the review results of each document to be reviewed to the corresponding provider.
在一些实施例中,目标业务事项为医药注册事项。In some embodiments, the target business matter is a pharmaceutical registration matter.
本公开第二方面实施例提供一种结合RPA和AI实现IA的文档审核装置,包括:获取模块,用于获取目标业务事项对应的至少一个待审核文档;审核模块,用于基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题;和生成模块,用于基于确定各待审核文档存在至少一个预设类型的问题的情况下,生成待审核文档对应的修改建议信息。The embodiment of the second aspect of the present disclosure provides a document review device that combines RPA and AI to implement IA, including: an acquisition module for acquiring at least one document to be reviewed corresponding to a target business matter; and an review module for reviewing each document based on AI technology. The documents to be reviewed are reviewed to determine whether each document to be reviewed has multiple preset types of problems; and a generation module is used to generate the documents to be reviewed based on determining that each document to be reviewed has at least one preset type of problem. Corresponding modification suggestion information.
在一些实施例中,预设类型包括信息补全类型;审核模块,用于:获取各待审核文档的标识;基于目标业务事项,查询预先创建的目标业务事项对应的知识图谱,以获取目标业务事项所要求的至少一个目标文档的标识;并将各待审核文档的标识与各目标文档的标识进行比对,以确定各待审核文档是否齐全。In some embodiments, the preset types include information completion types; an audit module is used to: obtain the identification of each document to be audited; based on the target business matter, query the knowledge graph corresponding to the pre-created target business matter to obtain the target business The identification of at least one target document required by the matter; and compare the identification of each document to be reviewed with the identification of each target document to determine whether each document to be reviewed is complete.
在一些实施例中,审核模块,还用于:基于光学字符识别OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息;对各待审核文档所包含的文本信息进行信息抽取,以获取各文本信息所包含的待审核字段以及对应的字段值;基于目标业务事项,查询知识图谱,以获取各目标文档中所要求的字段;和判断各文本信息所包含的待审核字段中,是否存在与对应的目标文档中所要求的字段一致的目标字段,以及判断目标字段是否存在对应的字段值,以确定各待审核文档中的信息是否齐全。In some embodiments, the review module is also used to: perform text recognition on each document to be reviewed based on optical character recognition OCR technology to obtain the text information contained in each document to be reviewed; Extract information to obtain the fields to be reviewed and the corresponding field values contained in each text information; query the knowledge graph based on the target business matters to obtain the required fields in each target document; and determine the fields contained in each text information. Among the fields to be reviewed, whether there is a target field that is consistent with the required field in the corresponding target document, and whether there is a corresponding field value in the target field, to determine whether the information in each document to be reviewed is complete.
在一些实施例中,审核模块,还用于:获取所有的文本信息中的相同待审核字段;在相同待审核字段存在对应的字段值的情况下,将相同待审核字段对应的字段值进行比对,以确定各待审核文档中的信息是否一致。In some embodiments, the audit module is also used to: obtain the same field to be audited in all text information; when there is a corresponding field value for the same field to be audited, compare the field values corresponding to the same field to be audited. Yes, to determine whether the information in each document to be reviewed is consistent.
在一些实施例中,预设类型包括流程规范类型;审核模块,还用于:基于OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息;基于目标业务事项,查询预先创建的目标业务事项对应的知识图谱,以获取目标业务事项对应的流程规范;和基于各待审核文档以及所包含的文本信息,判断各待审核文档是否满足流程规范。In some embodiments, the preset type includes a process specification type; the audit module is also used to: perform text recognition on each document to be audited based on OCR technology to obtain the text information contained in each document to be audited; based on the target business matters , query the knowledge graph corresponding to the pre-created target business matter to obtain the process specification corresponding to the target business item; and based on each document to be reviewed and the text information contained, determine whether each document to be reviewed meets the process specification.
在一些实施例中,预设类型包括行文规范类型;审核模块,还用于:基于OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息;和将各待审核文 档所包含的文本信息,输入预先训练的语言模型,以通过语言模型,确定各待审核文档是否存在行文规范类型的问题。In some embodiments, the preset types include writing specification types; the review module is also used to: perform text recognition on each document to be reviewed based on OCR technology to obtain the text information contained in each document to be reviewed; and Review document The text information contained in the document is input into the pre-trained language model, so that through the language model, it can be determined whether there are problems with the standard type of writing in each document to be reviewed.
在一些实施例中,结合RPA和AI实现IA的文档审核装置还包括:第一发送模块,用于基于确定各待审核文档不存在多个预设类型的问题,将各待审核文档发送至人工审核平台。In some embodiments, the document review device that combines RPA and AI to implement IA also includes: a first sending module, configured to send each document to be reviewed to a manual based on determining that each document to be reviewed does not have multiple preset types of problems. Review platform.
在一些实施例中,结合RPA和AI实现IA的文档审核装置还包括:调用模块,用于调用机器人流程自动化RPA机器人访问业务系统,以获取各待审核文档的提供方的联系方式;和第二发送模块,用于采用RPA机器人,通过联系方式,将各待审核文档的审核结果反馈至对应的提供方。In some embodiments, the document review device that combines RPA and AI to implement IA also includes: a calling module for calling a robotic process automation RPA robot to access the business system to obtain the contact information of the provider of each document to be reviewed; and second The sending module is used to use RPA robots to feedback the review results of each document to be reviewed to the corresponding provider through contact information.
在一些实施例中,目标业务事项为医药注册事项。In some embodiments, the target business matter is a pharmaceutical registration matter.
本公开第三方面实施例提出了一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,该处理器执行计算机程序时,实现如本公开上述实施例所述的方法。The third embodiment of the present disclosure provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor. When the processor executes the computer program, the above embodiments of the present disclosure are implemented. the method described.
本公开第四方面实施例提出了一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现如本公开上述实施例所述的方法。The fourth embodiment of the present disclosure provides a computer-readable storage medium on which a computer program is stored. When the computer program is executed by a processor, the method described in the above embodiments of the present disclosure is implemented.
本公开第五方面实施例提出了一种计算机程序产品,包括计算机程序,所述计算机程序在被处理器执行时实现如本公开上述实施例所述的方法。The fifth aspect embodiment of the present disclosure proposes a computer program product, including a computer program that implements the method described in the above embodiments of the present disclosure when executed by a processor.
本公开第六方面实施例提出了一种计算机程序,所述计算机程序包括计算机程序代码,当所述计算机程序代码在计算机上运行时,以使得计算机执行如本公开上述实施例所述的方法。The sixth aspect embodiment of the present disclosure proposes a computer program. The computer program includes computer program code. When the computer program code is run on a computer, the computer executes the method as described in the above embodiments of the present disclosure.
本公开实施例提供的技术方案可以包括以下有益效果:The technical solutions provided by the embodiments of the present disclosure may include the following beneficial effects:
通过获取目标业务事项对应的至少一个待审核文档,基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题,在确定各待审核文档存在至少一个预设类型的问题的情况下,生成待审核文档对应的修改建议信息,实现了基于AI技术,对目标业务事项对应的待审核文档进行自动审核,减少了文档审核所需的人力成本,提高了文档审核的效率。另外,通过在确定各待审核文档存在至少一个预设类型的问题的情况下,生成待审核文档对应的修改建议信息,可以为待审核文档的提供方提供修改建议,方便提供方对待审核文档进行修改。本公开实施例还能结合RPA和AI实现IA的获取提供方的联系方式,并将各待审核文档的审核结果自动反馈至对应的提供方,从而进一步减少反馈审核结果所需的人力成本。By obtaining at least one document to be reviewed corresponding to the target business matter, each document to be reviewed is reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed. After determining that each document to be reviewed has at least one preset type of problem, Under the condition of setting types of problems, the modification suggestion information corresponding to the documents to be reviewed is generated, and the automatic review of the documents to be reviewed corresponding to the target business matters is realized based on AI technology, which reduces the labor cost required for document review and improves the efficiency of document review. Review efficiency. In addition, by generating modification suggestion information corresponding to the document to be audited when it is determined that each document to be audited has at least one preset type of problem, modification suggestions can be provided to the provider of the document to be audited, and it is convenient for the provider to modify the document to be audited. Revise. This disclosed embodiment can also combine RPA and AI to obtain the contact information of the IA provider, and automatically feed back the review results of each document to be reviewed to the corresponding provider, thereby further reducing the labor cost required to feedback the review results.
本公开实施例的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本公开的实践了解到。 Additional aspects and advantages of embodiments of the disclosure will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the disclosure.
附图说明Description of drawings
在附图中,除非另外规定,否则贯穿多个附图相同的附图标记表示相同或相似的部件或元素。这些附图不一定是按照比例绘制的。应该理解,这些附图仅描绘了根据本公开公开的一些实施方式,而不应将其视为是对本公开范围的限制。In the drawings, unless otherwise specified, the same reference numbers refer to the same or similar parts or elements throughout the several figures. The drawings are not necessarily to scale. It should be understood that these drawings depict only some embodiments in accordance with the disclosure and are not to be considered limiting of the scope of the disclosure.
图1是根据本公开第一实施例的结合RPA和AI实现IA的文档审核方法的流程示意图;Figure 1 is a schematic flowchart of a document review method for implementing IA by combining RPA and AI according to the first embodiment of the present disclosure;
图2是根据本公开第二实施例的结合RPA和AI实现IA的文档审核方法的流程示意图;Figure 2 is a schematic flowchart of a document review method for implementing IA by combining RPA and AI according to the second embodiment of the present disclosure;
图3是根据本公开第三实施例的结合RPA和AI实现IA的文档审核方法的流程示意图;Figure 3 is a schematic flowchart of a document review method for implementing IA by combining RPA and AI according to the third embodiment of the present disclosure;
图4是根据本公开第四实施例的结合RPA和AI实现IA的文档审核方法的流程示意图;Figure 4 is a schematic flowchart of a document review method for implementing IA by combining RPA and AI according to the fourth embodiment of the present disclosure;
图5是根据本公开第五实施例的结合RPA和AI实现IA的文档审核方法的流程示意图;Figure 5 is a schematic flowchart of a document review method for implementing IA by combining RPA and AI according to the fifth embodiment of the present disclosure;
图6是根据本公开第六实施例的结合RPA和AI实现IA的文档审核装置的结构示意图;Figure 6 is a schematic structural diagram of a document review device that combines RPA and AI to implement IA according to the sixth embodiment of the present disclosure;
图7是用来实现本公开实施例的结合RPA和AI实现IA的文档审核方法的电子设备的框图。FIG. 7 is a block diagram of an electronic device used to implement a document review method for implementing IA by combining RPA and AI according to an embodiment of the present disclosure.
具体实施方式Detailed ways
下面详细描述本公开/公开的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本公开/公开,而不能理解为对本公开/公开的限制。Embodiments of the disclosure/disclosure are described in detail below, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals throughout represent the same or similar elements or elements having the same or similar functions. The embodiments described below with reference to the accompanying drawings are exemplary and are only used to explain the present disclosure/disclosure and are not to be construed as limitations of the present disclosure/disclosure.
参照下面的描述和附图,将清楚本公开/公开的实施例的这些和其他方面。在这些描述和附图中,具体公开了本公开/公开的实施例中的一些特定实施方式,来表示实施本公开/公开的实施例的原理的一些方式,但是应当理解,本公开/公开的实施例的范围不受此限制。相反,本公开/公开的实施例包括落入所附加权利要求书的精神和内涵范围内的所有变化、修改和等同物。These and other aspects of the present disclosure/disclosed embodiments will become apparent with reference to the following description and accompanying drawings. In these descriptions and drawings, some specific embodiments of the present disclosure/disclosure are specifically disclosed to represent some ways of implementing the principles of the present disclosure/disclosure embodiments, but it should be understood that the present disclosure/disclosure The scope of the embodiments is not limited in this way. On the contrary, the present disclosure/disclosed embodiments include all changes, modifications and equivalents falling within the spirit and scope of the appended claims.
需要说明的是,本公开申请的技术方案中,所涉及的数据的获取,存储和应用等,均符合相关法律法规的规定,且不违背公序良俗。It should be noted that the acquisition, storage and application of data involved in the technical solution of this disclosure application are in compliance with relevant laws and regulations and do not violate public order and good customs.
本公开实施例提供一种结合RPA和AI实现IA的文档审核方法及装置、电子设备、计算机可读存储介质、计算机程序产品和计算机程序。其中,方法包括:获取目标业务事项对应的至少一个待审核文档;基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题;基于确定各待审核文档存在至少一个预设类型的问题,生成待审核文档对应的修改建议信息。由此,实现了基于AI技术,对目标业务事项对应的待审核文档进行自动审核,减少了文档审核所需的人力成本,提高了文档审核的效率。 Embodiments of the present disclosure provide a document review method and device, electronic equipment, computer-readable storage media, computer program products, and computer programs that combine RPA and AI to implement IA. Among them, the method includes: obtaining at least one document to be reviewed corresponding to the target business matter; reviewing each document to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed; based on determining that each document to be reviewed is There is at least one preset type of problem, and modification suggestion information corresponding to the document to be reviewed is generated. As a result, it is possible to automatically review documents to be reviewed corresponding to target business matters based on AI technology, reducing the labor costs required for document review and improving the efficiency of document review.
本公开实施例提供的结合RPA和AI实现IA的文档审核方法、装置、电子设备及存储介质,可以应用于医药领域、司法领域等任意需要进行文档审核的领域,本公开实施例对此不作限制。本公开各实施例以医药领域为例进行说明。The document review method, device, electronic device, and storage medium provided by the embodiments of the present disclosure that combine RPA and AI to implement IA can be applied to any field that requires document review, such as the medical field and the judicial field. The embodiments of the present disclosure do not limit this. . Each embodiment of the present disclosure is explained by taking the medical field as an example.
为了清楚说明本发明的各实施例,首先对本发明实施例中涉及到的技术名词进行解释说明。In order to clearly explain each embodiment of the present invention, technical terms involved in the embodiments of the present invention are first explained.
在本公开实施例/公开的描述中,术语“多个”指两个或两个以上。In the description of the embodiments/disclosure of the present disclosure, the term "plurality" means two or more.
在本公开实施例的描述中,“RPA机器人”是指可结合AI技术和RPA技术,自动进行在线业务办理的软件机器人。RPA机器人拥有“连接器”和“无侵入”两个特性,通过模拟人类的操作方法,在不更改信息系统的前提下,使用非侵入的方式,将不同系统的数据进行提取、整合和连通。In the description of the embodiments of this disclosure, "RPA robot" refers to a software robot that can combine AI technology and RPA technology to automatically handle online business. RPA robots have two characteristics: "connector" and "non-intrusion". By simulating human operation methods, they can extract, integrate and connect data from different systems in a non-intrusive way without changing the information system.
在本公开实施例的描述中,“字段”和“字段值”,均为由单个字符或连续的多个字符组成的片段。其中,“字段”可以理解为属性项key,“字段值”可以理解为属性值value,且字段与字段值之间具有对应关系,字段和对应的字段值共同组成一条结构化数据。比如“张三”为字段“姓名”对应的字段值,“姓名”和“张三”组成一条结构化数据。In the description of the embodiments of the present disclosure, both "field" and "field value" are fragments composed of a single character or multiple consecutive characters. Among them, "field" can be understood as the attribute item key, and "field value" can be understood as the attribute value value, and there is a corresponding relationship between fields and field values. The fields and corresponding field values together form a piece of structured data. For example, "Zhang San" is the field value corresponding to the field "Name", and "Name" and "Zhang San" form a piece of structured data.
在本公开实施例的描述中,“待审核文档”,指审批部门接收到的用于办理某项业务的文档资料。相应的,“目标业务事项”,即指该项业务。“提供方”,指向审批部门提交待审核文档的一方,其中,提供方可以为个人或企业等,本公开实施例对此不作限制。In the description of the embodiment of the present disclosure, "documents to be reviewed" refers to documents received by the approval department for handling a certain business. Correspondingly, "target business matters" refers to the business. "Provider" refers to the party that submits documents to be reviewed by the approval department. The provider can be an individual or an enterprise, and the embodiment of the present disclosure does not limit this.
比如,医药企业为了办理医疗器械注册、药品注册、给药方式变更注册或者药品剂量变更注册等医药注册事项,可以向药监局提交申请,并且提交相关的申请文档。假设医药企业向药监局提交了药品注册申请,并提交了相关的申请文档,则药品注册事项即为目标业务事项,医药企业申请办理医药注册事项时提交的文档,即为目标业务事项对应的待审核文档,医药企业即为待审核文档的提供方。For example, in order to handle pharmaceutical registration matters such as medical device registration, drug registration, dosing method change registration, or drug dosage change registration, pharmaceutical companies can submit applications to the Food and Drug Administration and submit relevant application documents. Assuming that a pharmaceutical company submits a drug registration application to the Food and Drug Administration and submits relevant application documents, the drug registration matters are the target business matters, and the documents submitted by the pharmaceutical company when applying for drug registration matters are the corresponding target business matters. For documents to be reviewed, the pharmaceutical company is the provider of the documents to be reviewed.
在本公开实施例的描述中,“目标文档”,指成功办理目标业务事项所需要的文档,即目标业务事项所要求的文档。In the description of the embodiments of this disclosure, "target document" refers to the document required to successfully handle the target business matter, that is, the document required by the target business matter.
在本公开实施例的描述中,“预设类型”,指预先设置的待审核文档可能存在的问题所属的类型。In the description of the embodiments of the present disclosure, "preset type" refers to the type of problems that may exist in the preset documents to be reviewed.
在本公开实施例的描述中,“信息补全类型”,即待审核文档存在文档不齐全,或者待审核文档中信息不齐全,或者待审核文档中信息不一致等问题,需要补充或修改。其中,属于信息补全类型的问题例如可以包括:待审核文档不齐全,比如目标业务事项所要求的目标文档包括文档A、文档B和文档C,而待审核文档仅包括文档A和文档B;待审核文档中的信息不齐全,比如目标业务事项所要求的文档A中,要求包括字段a以及对应的字段值,而待审核文档中包括文档A,但该文档A中不包括字段a及对应的字段值,或者, 该文档A中包括字段a但不包括对应的字段值;待审核文档中信息不一致,比如,待审核文档包括文档A、文档B和文档C,其中文档A和文档B中均包括字段a以及对应的字段值,但文档A中的字段a对应的字段值,与文档B中的字段a对应的字段值不同。In the description of the embodiment of the present disclosure, "information completion type" means that the document to be reviewed has problems such as incomplete documents, incomplete information in the document to be reviewed, or inconsistent information in the document to be reviewed, and needs to be supplemented or modified. Among them, problems belonging to the information completion type may include, for example: incomplete documents to be reviewed. For example, the target documents required by the target business matter include document A, document B and document C, but the documents to be reviewed only include document A and document B; The information in the document to be reviewed is incomplete. For example, document A required by the target business matter requires field a and the corresponding field value. However, the document to be reviewed includes document A, but document A does not include field a and its corresponding field value. field value, or, The document A includes field a but does not include the corresponding field value; the information in the document to be reviewed is inconsistent. For example, the document to be reviewed includes document A, document B and document C, where both document A and document B include field a and the corresponding field value. field value, but the field value corresponding to field a in document A is different from the field value corresponding to field a in document B.
在本公开实施例的描述中,“流程规范类型”,即待审核文档不满足目标业务事项对应的流程规范。其中,目标业务事项对应的流程规范,即目标业务事项的办理流程中应该遵守的规范。比如,以目标业务事项为医疗器械注册事项为例,假设医疗器械注册事项对应的流程规范包括:如医疗器械注册证有效期内有新的强制性标准、国家标准发布实施,已注册产品为符合新的强制性标准、国家标准品所做的变化属于应当办理变更注册的,注册人应当先行办理变更注册手续,取得原审批部门批准的变更注册(备案)文件后,再提出延续注册申请。则在注册人(即本公开实施例中待审核资料的提供方)的医疗器械注册证有效期内有新的强制性标准发布实施,且医疗器械为符号新的强制性标准属于应当办理变更注册的情况下,若注册人提交的待审核文档中不包括原审批部门批准的变更注册(备案)文件,则待审核文档存在流程规范类型的问题。In the description of the embodiment of the present disclosure, the "process specification type" means that the document to be reviewed does not meet the process specification corresponding to the target business matter. Among them, the process specifications corresponding to the target business matters are the specifications that should be followed in the handling process of the target business matters. For example, taking the target business matter as a medical device registration matter, it is assumed that the process specifications corresponding to the medical device registration matter include: If a new mandatory standard or national standard is released and implemented within the validity period of the medical device registration certificate, the registered product must comply with the new If the changes made to mandatory standards and national standards require change registration, the registrant should first go through the change registration procedures and obtain the change registration (filing) document approved by the original approval department before submitting an application for registration renewal. Then within the validity period of the medical device registration certificate of the registrant (i.e., the provider of the materials to be reviewed in this disclosed embodiment), a new mandatory standard is released and implemented, and the medical device is a symbol of the new mandatory standard that should be changed to register. In this case, if the documents to be reviewed submitted by the registrant do not include the change registration (filing) document approved by the original approval department, the documents to be reviewed will have process specification issues.
在本公开实施例的描述中,“行文规范类型”,即待审核文档中存在错别字、英文翻译错误、专业名词使用不规范等行文格式方面的问题。In the description of the embodiment of the present disclosure, the "standard writing type" means that there are typographical problems, English translation errors, irregular use of professional terms and other writing format problems in the document to be reviewed.
在本公开实施例的描述中,“OCR(Optical Character Recognition,光学字符识别)”,具体是指电子设备检查纸上打印的字符,通过检测暗、亮的模式确定其形状,然后用字符识别方法将形状翻译成计算机文字的过程;即,针对印刷体字符,采用光学的方式将纸质文档中的文字转换成为黑白点阵的图像文件,并通过识别软件将图像中的文字转换成文本格式,供文字处理软件进一步编辑加工的技术。In the description of the embodiments of the present disclosure, "OCR (Optical Character Recognition, Optical Character Recognition)" specifically refers to the electronic device checking the characters printed on the paper, determining their shape by detecting dark and light patterns, and then using the character recognition method The process of translating shapes into computer text; that is, for printed characters, the text in the paper document is optically converted into a black and white dot matrix image file, and the text in the image is converted into a text format through recognition software. Technology for further editing and processing by word processing software.
在本公开实施例的描述中,“信息抽取”,是把文本里包含的信息进行结构化处理,变成表格一样的组织形式。其中,信息抽取可以包括命名实体识别和关系抽取。命名实体识别,即在一段文本中识别出各类命名实体。其中需要识别的命名实体通常包括人名、地名、组织机构名、药物、时间等,可以根据应用场景的不同进行设置。比如,对于医药注册事项,需要识别的命名实体可以包括医学专用名词、规范专用名词、注册人住所、注册人名称、代理医药注册事项的代理人的名称等。关系抽取,目的是为了识别出文本实体中的目标关系,通过识别实体之间的关系来提取实体之间的语义关系。关系抽取可以通过序列标注、分类、依存句法分析、语义依存分析等技术实现。In the description of the embodiments of the present disclosure, "information extraction" refers to structuring the information contained in the text into a table-like organizational form. Among them, information extraction can include named entity recognition and relationship extraction. Named entity recognition, that is, identifying various named entities in a piece of text. The named entities that need to be recognized usually include person names, place names, organization names, drugs, time, etc., which can be set according to different application scenarios. For example, for pharmaceutical registration matters, the named entities that need to be identified can include medical nouns, normative nouns, the address of the registrant, the name of the registrant, the name of the agent acting for pharmaceutical registration matters, etc. The purpose of relationship extraction is to identify target relationships in text entities and extract semantic relationships between entities by identifying the relationships between entities. Relation extraction can be achieved through sequence annotation, classification, dependency syntax analysis, semantic dependency analysis and other technologies.
在本公开实施例的描述中,“语言模型”,为用于确定待审核文档是否存在行文规范类型的问题的任意机器模型,比如神经网络模型。其中,语言模型可以预先通过对训练样本进行训练得到。 In the description of the embodiments of the present disclosure, a "language model" is any machine model used to determine whether a document to be reviewed has a writing specification type problem, such as a neural network model. Among them, the language model can be obtained by training training samples in advance.
在本公开实施例的描述中,“人工审核平台”,指能够通过人工对文档进行审核的平台,比如人机协同平台。In the description of the embodiments of this disclosure, "manual review platform" refers to a platform that can review documents manually, such as a human-machine collaboration platform.
在本公开实施例的描述中,“业务系统”,指审批部门办理业务事项的线上系统,比如药监局的管理系统等。In the description of the embodiments of this disclosure, "business system" refers to the online system used by the approval department to handle business matters, such as the management system of the Food and Drug Administration.
以下结合附图描述根据本公开/公开实施例的结合RPA和AI实现IA的文档审核方法及装置、电子设备、计算机可读存储介质、计算机程序产品和计算机程序。The document review method and device, electronic device, computer readable storage medium, computer program product and computer program that combines RPA and AI to implement IA according to the disclosure/disclosed embodiments are described below with reference to the accompanying drawings.
首先结合附图,对本公开实施例中的结合RPA和AI实现IA的文档审核方法进行说明。First, the document review method for implementing IA by combining RPA and AI in the embodiment of the present disclosure will be described with reference to the accompanying drawings.
图1是本公开第一实施例的结合RPA和AI实现IA的文档审核方法的流程图。如图1所示,该方法可包括步骤101至步骤103。Figure 1 is a flow chart of a document review method for implementing IA by combining RPA and AI according to the first embodiment of the present disclosure. As shown in Figure 1, the method may include steps 101 to 103.
步骤101,获取目标业务事项对应的至少一个待审核文档。Step 101: Obtain at least one document to be reviewed corresponding to the target business matter.
需要说明的是,本公开实施例的结合RPA和AI实现IA的文档审核方法,可以由结合RPA和AI实现IA的文档审核装置执行,以下将结合RPA和AI实现IA的文档审核装置简称为文档审核装置。其中,该文档审核装置可以由软件和/或硬件实现,该文档审核装置可以为电子设备,或者也可以配置在电子设备中,以实现文档的自动审核,从而减少文档审核所需的人力成本,提高文档审核的效率。其中,该电子设备可以包括但不限于终端设备、服务器等,该实施例对电子设备不作具体限定。It should be noted that the document review method that combines RPA and AI to implement IA in the embodiment of the present disclosure can be executed by a document review device that combines RPA and AI to implement IA. Hereinafter, the document review device that combines RPA and AI to implement IA will be referred to as a document Audit device. Wherein, the document review device can be implemented by software and/or hardware, and the document review device can be an electronic device, or can also be configured in an electronic device to realize automatic review of documents, thereby reducing the labor costs required for document review, Improve the efficiency of document review. The electronic device may include but is not limited to a terminal device, a server, etc., and this embodiment does not specifically limit the electronic device.
其中,目标业务事项对应的待审核文档,可以包括一个文档,也可以包括多个文档,本公开对此不作限制。The documents to be reviewed corresponding to the target business matter may include one document or multiple documents, and this disclosure does not limit this.
在一些实施例中,文档审核装置可以提供上传接口,从而提供方可以将办理目标业务事项所需的文档通过上传接口进行上传,相应的,文档审核装置可以获取目标业务事项对应的至少一个待审核文档。In some embodiments, the document review device can provide an upload interface, so that the provider can upload the documents required to handle the target business matter through the upload interface. Correspondingly, the document review device can obtain at least one to-be-reviewed document corresponding to the target business matter. document.
步骤102,基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题。Step 102: Review each document to be reviewed based on AI technology to determine whether each document to be reviewed contains multiple preset types of problems.
在一些实施例中,可以预先通过以下方式确定目标业务事项对应的多个预设类型:总结目标业务事项的办理过程中,待审核文档经常出现的问题,并对这些问题进行分类,以得到多个预设类型。进而文档审核装置可以对各待审核文档进行逐一审核,并针对每个预设类型,确定各待审核文档是否存在该预设类型的问题。In some embodiments, multiple preset types corresponding to the target business matter can be determined in advance by: summarizing the problems that frequently occur in documents to be reviewed during the processing of the target business matter, and classifying these problems to obtain multiple preset type. Furthermore, the document review device can review each document to be reviewed one by one, and determine for each preset type whether each document to be reviewed has problems of the preset type.
其中,多个预设类型,比如可以包括信息补全类型、流程规范类型、行文规范类型等。Among them, multiple preset types may include, for example, information completion type, process specification type, writing specification type, etc.
其中,不同的目标业务事项,可以对应不同的预设类型。Among them, different target business matters can correspond to different preset types.
步骤103,在确定各待审核文档存在至少一个预设类型的问题的情况下,生成待审核文档对应的修改建议信息。 Step 103: When it is determined that each document to be reviewed has at least one preset type of problem, generate modification suggestion information corresponding to the document to be reviewed.
在一些实施例中,文档审核装置可以根据待审核文档存在的预设类型的问题,生成待审核文档对应的修改建议信息。In some embodiments, the document review device may generate modification suggestion information corresponding to the document to be reviewed based on preset types of problems existing in the document to be reviewed.
举例来说,假设文档审核装置确定待审核文档存在信息补全类型的问题,其中该问题具体为:提供方提交的各待审核文档中,缺少目标业务事项所要求的文档A,则文档审核装置可以生成修改建议信息“需要补充文档A”。For example, assuming that the document review device determines that the document to be reviewed has an information completion type problem, where the problem is specifically: among the documents to be reviewed submitted by the provider, document A required by the target business matter is missing, then the document review device Modification suggestion information "Supplementary Document A required" can be generated.
本公开实施例提供的结合RPA和AI实现IA的文档审核方法,获取目标业务事项对应的至少一个待审核文档;基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题;在确定各待审核文档存在至少一个预设类型的问题的情况下,生成待审核文档对应的修改建议信息。由此,实现了基于AI技术,对目标业务事项对应的待审核文档进行自动审核,减少了文档审核所需的人力成本,提高了文档审核的效率。另外,通过在确定各待审核文档存在至少一个预设类型的问题的情况下,生成待审核文档对应的修改建议信息,可以为待审核文档的提供方提供修改建议,方便提供方对待审核文档进行修改。The document review method provided by the embodiment of the present disclosure combines RPA and AI to implement IA, and obtains at least one document to be reviewed corresponding to the target business matter; each document to be reviewed is reviewed based on AI technology to determine whether there are multiple documents to be reviewed. Problems of a preset type; when it is determined that each document to be reviewed has at least one problem of a preset type, modification suggestion information corresponding to the document to be reviewed is generated. As a result, it is possible to automatically review documents to be reviewed corresponding to target business matters based on AI technology, reducing the labor costs required for document review and improving the efficiency of document review. In addition, by generating modification suggestion information corresponding to the document to be audited when it is determined that each document to be audited has at least one preset type of problem, modification suggestions can be provided to the provider of the document to be audited, and it is convenient for the provider to modify the document to be audited. Revise.
在一些实施例中,预设类型可以包括信息补全类型,下面结合图2,对本公开实施例中,基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在信息补全类型的问题的过程进行说明。In some embodiments, the preset types may include information completion types. In conjunction with Figure 2, in this embodiment of the present disclosure, each document to be reviewed is reviewed based on AI technology to determine whether there is an information completion type in each document to be reviewed. Explain the process of the problem.
图2是根据本公开第二实施例的结合RPA和AI实现IA的文档审核方法的流程图。如图2所示,结合RPA和AI实现IA的文档审核方法,可以包括步骤201至步骤211。Figure 2 is a flow chart of a document review method for implementing IA by combining RPA and AI according to the second embodiment of the present disclosure. As shown in Figure 2, the document review method for implementing IA by combining RPA and AI may include steps 201 to 211.
步骤201,获取目标业务事项对应的至少一个待审核文档。Step 201: Obtain at least one document to be reviewed corresponding to the target business matter.
其中,步骤201的具体实现过程及原理,可以参考上述实施例的描述,此处不再赘述。For the specific implementation process and principle of step 201, reference can be made to the description of the above embodiments and will not be described again here.
在一些实施例中,属于信息补全类型的问题可以包括:待审核文档不齐全。相应的,文档审核装置获取各待审核文档后,可以通过以下步骤202-205的方式,确定各待审核文档是否存在信息补全类型的问题。In some embodiments, problems belonging to the information completion type may include: incomplete documents to be reviewed. Correspondingly, after the document review device obtains each document to be reviewed, it can determine whether there is an information completion type problem in each document to be reviewed through the following steps 202-205.
步骤202,获取各待审核文档的标识。Step 202: Obtain the identification of each document to be reviewed.
其中,该标识用于唯一标识待审核文档。其中,待审核文档的标识,可以为待审核文档的文件名称,也可以为待审核文档对应的编号等,本公开对此不作限制。Among them, this ID is used to uniquely identify the document to be reviewed. The identification of the document to be reviewed may be the file name of the document to be reviewed, or the number corresponding to the document to be reviewed, etc. This disclosure does not limit this.
步骤203,基于目标业务事项,查询预先创建的目标业务事项对应的知识图谱,以获取目标业务事项所要求的至少一个目标文档的标识。Step 203: Based on the target business matter, query the knowledge graph corresponding to the pre-created target business matter to obtain the identification of at least one target document required by the target business matter.
在一些实施例中,针对任意业务事项,可以根据办理该业务事项所需要的文档,预先创建该业务事项对应的知识图谱。其中,知识图谱中比如可以包括该业务事项对应的第一节点,以及办理该业务事项所需要的文档对应的第二节点,第一节点与第二节点通过边连接。从而文档审核装置可以基于目标业务事项,查询预先创建的目标业务事项对应的知识 图谱,确定知识图谱中通过边与目标业务事项对应的第一节点连接的第二节点,并将该第二节点对应的文档的标识,确定为目标业务事项所要求的至少一个目标文档的标识。In some embodiments, for any business matter, a knowledge graph corresponding to the business matter can be created in advance based on the documents required to handle the business matter. Among them, the knowledge graph may include, for example, a first node corresponding to the business matter, and a second node corresponding to the documents required to handle the business matter, and the first node and the second node are connected through edges. Therefore, the document review device can query the knowledge corresponding to the pre-created target business matter based on the target business matter. Graph, determine the second node in the knowledge graph that is connected to the first node corresponding to the target business matter through an edge, and determine the identifier of the document corresponding to the second node as the identifier of at least one target document required by the target business matter.
步骤204,将各待审核文档的标识与各目标文档的标识进行比对,以确定各待审核文档是否齐全。Step 204: Compare the identification of each document to be reviewed with the identification of each target document to determine whether each document to be reviewed is complete.
步骤205,在确定各待审核文档不齐全的情况下,确定各待审核文档存在信息补全类型的问题。Step 205: When it is determined that each document to be reviewed is incomplete, it is determined that each document to be reviewed has an information completion type problem.
在一些实施例中,在对于各目标文档的标识,确定均存在与该标识相同的待审核文档的标识的情况下,可以确定各待审核文档齐全。在对于至少一个目标文档的标识,确定不存在与该标识相同的待审核文档的标识的情况下,可以确定各待审文档不齐全,进而可以确定各待审核文档存在信息补全类型的问题。In some embodiments, if it is determined that for the identification of each target document, there is an identification of the document to be reviewed that is the same as the identification, it can be determined that the documents to be reviewed are complete. When it is determined that for the identification of at least one target document, there is no identification of the document to be reviewed that is the same as the identification, it can be determined that each document to be reviewed is incomplete, and further it can be determined that each document to be reviewed has an information completion type problem.
在一些实施例中,属于信息补全类型的问题还可以包括:待审核文档中信息不齐全。相应的,在步骤205之后,文档审核装置还可以通过以下步骤206-210所示的方式,确定各待审核文档是否存在信息补全类型的问题。In some embodiments, problems belonging to the information completion type may also include: incomplete information in the document to be reviewed. Correspondingly, after step 205, the document review device can also determine whether there are information completion type problems in each document to be reviewed in the manner shown in the following steps 206-210.
步骤206,基于光学字符识别OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息。Step 206: Based on the optical character recognition OCR technology, perform text recognition on each document to be reviewed to obtain the text information contained in each document to be reviewed.
步骤207,对各待审核文档所包含的文本信息进行信息抽取,以获取各文本信息所包含的待审核字段以及对应的字段值。Step 207: Extract text information contained in each document to be reviewed to obtain fields to be reviewed and corresponding field values contained in each text information.
步骤208,基于目标业务事项,查询知识图谱,以获取各目标文档中所要求的字段。Step 208: Based on the target business matters, query the knowledge graph to obtain the required fields in each target document.
在一些实施例中,针对任意业务事项,在创建该业务事项对应的知识图谱时,还可以根据办理该业务事项所需要的各文档中需要包含的字段,来创建知识图谱。相应的,知识图谱中除了包括该业务事项对应的第一节点、办理该业务事项所需要的文档对应的第二节点,还包括每个文档中需要包含的各字段对应的第三节点,第三节点与对应的第二节点通过边连接。从而文档审核装置可以基于目标业务事项,查询预先创建的目标业务事项对应的知识图谱,确定知识图谱中通过边与各第二节点连接的第三节点,并将各第三节点对应的字段,确定为对应的目标文档(即与第三节点连接的第二节点对应的目标文档)中所要求的字段。In some embodiments, for any business matter, when creating a knowledge graph corresponding to the business matter, the knowledge graph can also be created based on the fields that need to be included in each document required to handle the business matter. Correspondingly, in addition to the first node corresponding to the business matter and the second node corresponding to the documents required to handle the business matter, the knowledge graph also includes the third node corresponding to each field that needs to be included in each document. The node is connected to the corresponding second node through an edge. Therefore, the document review device can query the knowledge graph corresponding to the pre-created target business matter based on the target business matter, determine the third node in the knowledge graph connected to each second node through the edge, and determine the fields corresponding to each third node. are the required fields in the corresponding target document (that is, the target document corresponding to the second node connected to the third node).
步骤209,判断各文本信息所包含的待审核字段中,是否存在与对应的目标文档中所要求的字段一致的目标字段,以及判断目标字段是否存在对应的字段值,以确定各待审核文档中的信息是否齐全。Step 209: Determine whether there is a target field consistent with the required field in the corresponding target document among the fields to be reviewed included in each text information, and determine whether there is a corresponding field value in the target field, so as to determine whether there is a corresponding field value in each document to be reviewed. whether the information is complete.
步骤210,在确定各待审核文档中的信息不齐全的情况下,确定各待审核文档存在信息补全类型的问题。 Step 210: When it is determined that the information in each document to be reviewed is incomplete, it is determined that each document to be reviewed has an information completion type problem.
在一些实施例中,对于各目标文档中所要求的各字段,在文档审核装置确定对应的待审核文档所包含的文本信息中的待审核字段中,均存在与该字段一致的目标字段,且各目标字段均存在对应的字段值的情况下,可以确定各待审核文档中的信息齐全。In some embodiments, for each required field in each target document, the document review device determines that among the fields to be reviewed in the text information contained in the corresponding document to be reviewed, there is a target field that is consistent with the field, and When each target field has a corresponding field value, it can be determined that the information in each document to be reviewed is complete.
在一些实施例中,对于各目标文档中所要求的至少一个字段,在文档审核装置确定对应的待审核文档所包含的文本信息中的待审核字段中,不存在与该字段一致的目标字段的情况下,可以确定各待审核文档中的信息不齐全,进而可以确定各待审核文档存在信息补全类型的问题。In some embodiments, for at least one required field in each target document, the document review device determines that among the fields to be reviewed in the text information contained in the corresponding document to be reviewed, there is no target field that is consistent with the field. In this case, it can be determined that the information in each document to be reviewed is incomplete, and further it can be determined that there is an information completion type problem in each document to be reviewed.
在一些实施例中,可能出现某个待审核文档中某个字段没有对应的字段值的情况,那么,对各待审核文档所包含的文本信息进行信息抽取,可能存在没有获取到某个待审核字段对应的字段值的情况,即该待审核字段不存在对应的字段值。对于各目标文档中所要求的至少一个字段,在文档审核装置确定对应的待审核文档所包含的文本信息中的待审核字段中,存在与该字段一致的目标字段,但该目标字段不存在对应的字段值的情况下,可以确定各待审核文档中的信息不齐全,进而可以确定各待审核文档存在信息补全类型的问题。In some embodiments, it may happen that a certain field in a document to be reviewed does not have a corresponding field value. Then, if the text information contained in each document to be reviewed is extracted, a certain field to be reviewed may not be obtained. The corresponding field value of the field means that there is no corresponding field value for the field to be reviewed. For at least one required field in each target document, in the field to be reviewed in the text information contained in the corresponding document to be reviewed determined by the document review device, there is a target field that is consistent with the field, but there is no corresponding target field. In the case of field values, it can be determined that the information in each document to be reviewed is incomplete, and then it can be determined that there is an information completion type problem in each document to be reviewed.
在一些实施例中,属于信息补全类型的问题还可以包括:待审核文档中信息不一致。相应的,在步骤210之后,文档审核装置还可以通过以下步骤所示的方式,确定各待审核文档是否存在信息补全类型的问题:In some embodiments, problems belonging to the information completion type may also include: information in the document to be reviewed is inconsistent. Correspondingly, after step 210, the document review device can also determine whether there are information completion type problems in each document to be reviewed in the manner shown in the following steps:
获取所有的文本信息中的相同待审核字段;Get the same fields to be reviewed in all text messages;
在相同待审核字段存在对应的字段值的情况下,将相同待审核字段对应的字段值进行比对,以确定各待审核文档中的信息是否一致;When there are corresponding field values for the same field to be reviewed, compare the field values corresponding to the same field to be reviewed to determine whether the information in each document to be reviewed is consistent;
在确定各待审核文档中的信息不一致的情况下,确定各待审核文档存在信息补全类型的问题。When it is determined that the information in each document to be reviewed is inconsistent, it is determined that each document to be reviewed has an information completion type problem.
在一些实施例中,在文档审核装置确定所有文本信息中相同待审核字段对应的字段值均相同的情况下,可以确定各待审核文档中信息一致。在文档审核装置确定所有文本信息中,至少一个相同待审核字段对应的字段值不相同的情况下,可以确定各待审核文档中的信息不一致,进而可以确定各待审核文档存在信息补全类型的问题。In some embodiments, when the document review device determines that the field values corresponding to the same fields to be reviewed in all text information are the same, it can be determined that the information in the documents to be reviewed is consistent. When the document review device determines that in all text information, at least one field value corresponding to the same field to be reviewed is different, it can be determined that the information in each document to be reviewed is inconsistent, and further it can be determined that each document to be reviewed has an information completion type. question.
步骤211,在确定各待审核文档存在信息补全类型的问题下,生成待审核文档对应的修改建议信息。Step 211: After it is determined that each document to be reviewed has an information completion type problem, modification suggestion information corresponding to the document to be reviewed is generated.
在一些实施例中,文档审核装置确定各待审核文档不存在上述任一问题时,可以确定各待审核文档不存在信息补全类型的问题。在文档审核装置确定各待审核文档存在上述至少一个问题时,可以确定各待审核文档存在信息补全类型的问题,进而可以生成待审核文档对应的修改建议信息。 In some embodiments, when the document review device determines that each document to be reviewed does not have any of the above problems, it can determine that each document to be reviewed does not have an information completion type problem. When the document review device determines that each document to be reviewed has at least one of the above problems, it can be determined that each document to be reviewed has an information completion type problem, and then modification suggestion information corresponding to the document to be reviewed can be generated.
综上,本公开实施例提供的结合RPA和AI实现IA的文档审核方法,实现了基于AI技术,自动审核目标业务事项对应的待审核文档是否存在信息补全类型的问题,从而减少了文档审核所需的人力成本,提高了文档审核的效率。另外,通过在确定各待审核文档存在信息补全类型的问题的情况下,生成待审核文档对应的修改建议信息,可以为待审核文档的提供方提供修改建议,方便提供方对待审核文档进行信息补全或修改。In summary, the document review method provided by the embodiments of the present disclosure for implementing IA by combining RPA and AI realizes the automatic review based on AI technology to see whether the documents to be reviewed corresponding to the target business matters have information completion type problems, thereby reducing the need for document review. The required labor costs improve the efficiency of document review. In addition, by generating modification suggestion information corresponding to the document to be audited when it is determined that each document to be audited has an information completion type problem, modification suggestions can be provided to the provider of the document to be audited, and it is convenient for the provider to make information on the document to be audited. Complete or modify.
在一些实施例中,预设类型可以包括流程规范类型,下面结合图3,对本公开实施例中,基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在流程规范类型的问题的过程进行说明。In some embodiments, the preset type may include a process specification type. In conjunction with FIG. 3 , in this embodiment of the present disclosure, each document to be reviewed is reviewed based on AI technology to determine whether there are issues with the process specification type in each document to be reviewed. The process is explained.
图3是根据本公开第三实施例的结合RPA和AI实现IA的文档审核方法的流程图。如图3所示,结合RPA和AI实现IA的文档审核方法,可以包括步骤301至步骤305。Figure 3 is a flow chart of a document review method for implementing IA by combining RPA and AI according to the third embodiment of the present disclosure. As shown in Figure 3, the document review method for implementing IA by combining RPA and AI may include steps 301 to 305.
步骤301,获取目标业务事项对应的至少一个待审核文档。Step 301: Obtain at least one document to be reviewed corresponding to the target business matter.
步骤302,基于OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息。Step 302: Based on OCR technology, text recognition is performed on each document to be reviewed to obtain text information contained in each document to be reviewed.
其中,步骤301-302的具体实现过程及原理,可以参考上述实施例的描述,此处不再赘述。For the specific implementation process and principles of steps 301-302, reference can be made to the description of the above embodiments and will not be described again here.
步骤303,基于目标业务事项,查询预先创建的目标业务事项对应的知识图谱,以获取目标业务事项对应的流程规范。Step 303: Based on the target business item, query the pre-created knowledge graph corresponding to the target business item to obtain the process specification corresponding to the target business item.
在一些实施例中,针对任意业务事项,在创建该业务事项对应的知识图谱时,可以根据该业务事项的办理流程中应该遵守的规范,来创建知识图谱。相应的,知识图谱中除了包括该业务事项对应的第一节点,还可以包括该业务事项的办理流程中应该遵守的规范对应的第四节点,第四节点与第一节点通过边连接。从而文档审核装置可以基于目标业务事项,查询预先创建的目标业务事项对应的知识图谱,确定知识图谱中通过边与目标业务事项对应的第一节点连接的第四节点,并将该第四节点对应的规范,确定为目标业务事项对应的流程规范。In some embodiments, for any business matter, when creating a knowledge graph corresponding to the business matter, the knowledge graph can be created according to the specifications that should be followed in the handling process of the business matter. Correspondingly, in addition to the first node corresponding to the business matter, the knowledge graph may also include a fourth node corresponding to the specifications that should be followed in the handling process of the business matter. The fourth node is connected to the first node through an edge. Therefore, the document review device can query the pre-created knowledge graph corresponding to the target business matter based on the target business matter, determine the fourth node in the knowledge graph that is connected to the first node corresponding to the target business matter through an edge, and associate the fourth node with the first node corresponding to the target business matter. The specifications are determined as process specifications corresponding to the target business matters.
步骤304,基于各待审核文档以及所包含的文本信息,判断各待审核文档是否满足流程规范。Step 304: Based on each document to be reviewed and the text information contained therein, determine whether each document to be reviewed satisfies the process specification.
举例来说,以目标业务事项为医疗器械注册事项为例,假设医疗器械注册事项对应的流程规范包括:如医疗器械注册证有效期内有新的强制性标准、国家标准发布实施,已注册产品为符合新的强制性标准、国家标准品所做的变化属于应当办理变更注册的,注册人应当先行办理变更注册手续,取得原审批部门批准的变更注册(备案)文件后,再提出延续注册申请。待审核文档中包括医疗器械注册证。 For example, taking the target business matter as a medical device registration matter, it is assumed that the process specifications corresponding to the medical device registration matter include: If a new mandatory standard or national standard is released and implemented within the validity period of the medical device registration certificate, the registered product is If changes made to comply with new mandatory standards or national standards require change registration, the registrant should first go through the change registration procedures and obtain the change registration (filing) document approved by the original approval department before submitting an application for registration renewal. Documents to be reviewed include medical device registration certificates.
则文档审核装置可以基于医疗器械注册证所包含的文本信息,确定医疗器械注册证的有效期,并查询有效期内是否有新的强制性标准、国家标准发布实施,以及确定已注册产品是否符合新的强制性标准、国家标准品所做的变化属于应当办理变更注册的。若是,则文档审核装置可以审核注册人提交的待审核文档中是否包括原审批部门批准的变更注册(备案)文件。若不包括,则文档审核装置可以确定各待审核文档不满足医疗器械注册事项对应的流程规范。若包括,则文档审核装置可以确定各待审核文档满足医疗器械注册事项对应的流程规范。Then the document review device can determine the validity period of the medical device registration certificate based on the text information contained in the medical device registration certificate, and query whether there are new mandatory standards and national standards released and implemented within the validity period, and determine whether registered products comply with the new Changes made to mandatory standards and national standards require change registration. If so, the document review device can review whether the documents to be reviewed submitted by the registrant include the change registration (filing) document approved by the original approval department. If it is not included, the document review device may determine that each document to be reviewed does not meet the process specifications corresponding to medical device registration matters. If included, the document review device can determine that each document to be reviewed meets the process specifications corresponding to the medical device registration matters.
步骤305,在各待审核文档不满足流程规范的情况下,确定各待审核文档存在流程规范类型的问题,并生成待审核文档对应的修改建议信息。Step 305: When each document to be reviewed does not meet the process specification, it is determined that each document to be reviewed has a process specification type problem, and modification suggestion information corresponding to the document to be reviewed is generated.
在一些实施例中,文档审核装置可以根据待审核文档存在的流程规范类型的问题,生成待审核文档对应的修改建议信息。比如,继续上述示例,文档审核装置可以生成修改建议信息“请到原审批部门进行变更注册,取得原审批部门批准的变更注册(备案)文件后,再提出延续注册申请”。In some embodiments, the document review device may generate modification suggestion information corresponding to the document to be reviewed based on the process specification type problems existing in the document to be reviewed. For example, continuing the above example, the document review device can generate modification suggestion information "Please go to the original approval department to register the change, and after obtaining the change registration (filing) document approved by the original approval department, submit an application for registration renewal."
综上,本公开实施例提供的结合RPA和AI实现IA的文档审核方法,实现了基于AI技术,自动审核目标业务事项对应的待审核文档是否存在流程规范类型的问题,从而减少了文档审核所需的人力成本,提高了文档审核的效率。另外,通过在确定各待审核文档存在流程规范类型的问题的情况下,生成待审核文档对应的修改建议信息,可以为待审核文档的提供方提供修改建议,方便提供方对待审核文档进行修改。In summary, the document review method provided by the embodiments of the present disclosure for implementing IA by combining RPA and AI realizes the automatic review based on AI technology to see whether the documents to be reviewed corresponding to the target business matters have process specification type issues, thereby reducing the time required for document review. Reduce required labor costs and improve the efficiency of document review. In addition, by generating modification suggestion information corresponding to the document to be audited when it is determined that each document to be audited has a problem of the process specification type, modification suggestions can be provided to the provider of the document to be audited, and it is convenient for the provider to modify the document to be audited.
在一些实施例中,预设类型可以包括行文规范类型,下面结合图4,对本公开实施例中,基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在行文规范类型的问题的过程进行说明。In some embodiments, the preset type may include a writing standard type. In the following, with reference to Figure 4, in the embodiment of the present disclosure, each document to be reviewed is reviewed based on AI technology to determine whether there is a problem with the writing standard type in each document to be reviewed. The process is explained.
图4是根据本公开第四实施例的结合RPA和AI实现IA的文档审核方法的流程图。如图4所示,结合RPA和AI实现IA的文档审核方法,可以包括步骤401至步骤404。Figure 4 is a flow chart of a document review method for implementing IA by combining RPA and AI according to the fourth embodiment of the present disclosure. As shown in Figure 4, the document review method for implementing IA by combining RPA and AI may include steps 401 to 404.
步骤401,获取目标业务事项对应的至少一个待审核文档。Step 401: Obtain at least one document to be reviewed corresponding to the target business matter.
步骤402,基于OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息。Step 402: Based on OCR technology, text recognition is performed on each document to be reviewed to obtain text information contained in each document to be reviewed.
其中,步骤401-402的具体实现过程及原理,可以参考上述实施例的描述,此处不再赘述。For the specific implementation process and principles of steps 401-402, please refer to the description of the above embodiments and will not be described again here.
步骤403,将各待审核文档所包含的文本信息,输入预先训练的语言模型,以通过语言模型,确定各待审核文档是否存在行文规范类型的问题。Step 403: Input the text information contained in each document to be reviewed into a pre-trained language model, so as to use the language model to determine whether each document to be reviewed has problems with writing standards.
在一些实施例中,可以预先训练生成语言模型,语言模型的输入为文本信息,输出为该文本信息中存在的行文规范类型的问题以及对应的置信度。从而可以将各待审核文档所 包含的文本信息,输入预先训练的语言模型,以通过语言模型,预测各文本信息中存在的行文规范类型的问题,并确定对应的置信度,从而文档审核装置,可以根据语言模型预测的各文本信息中存在的行文规范类型的问题以及对应的置信度,确定各待审核文档是否存在行文规范类型的问题。比如,可以设置置信度阈值,并在某个待审核文档所包含的文本信息中,存在行文规范类型的问题1对应的置信度大于置信度阈值的情况下,确定该待审核文档存在行文规范类型的问题1。其中,置信度阈值可以根据需要任意设置,比如可以设置为0.7、0.8等,本公开对此不作限制。In some embodiments, the language model can be pre-trained to generate a language model. The input of the language model is text information, and the output is the standard-type questions and corresponding confidence levels existing in the text information. In this way, each document to be reviewed can be The text information contained in the document is input into a pre-trained language model, so that the language model can predict the writing standard type problems existing in each text information, and determine the corresponding confidence level, so that the document review device can predict each text based on the language model. The standard type of problems existing in the information and the corresponding confidence level determine whether each document to be reviewed has a standard type of problem. For example, you can set a confidence threshold, and if the text information contained in a document to be reviewed contains a writing standard type question 1 and the confidence level corresponding to question 1 is greater than the confidence threshold, it is determined that the document to be reviewed has a writing standard type. Question 1. Among them, the confidence threshold can be set arbitrarily as needed, for example, it can be set to 0.7, 0.8, etc., and this disclosure does not limit this.
步骤404,在确定各待审核文档存在行文规范类型的问题的情况下,生成待审核文档对应的修改建议信息。Step 404: When it is determined that each document to be reviewed has a standard-type problem, modification suggestion information corresponding to the document to be reviewed is generated.
在一些实施例中,文档审核装置可以根据待审核文档存在的行文规范类型的问题,生成待审核文档对应的修改建议信息。比如,假设待审核文档存在的行文规范类型的问题为:文档A中某个中文词x的英文翻译错误,则文档审核装置可以生成修改建议信息“请修改词语x的英文翻译”。In some embodiments, the document review device may generate modification suggestion information corresponding to the document to be reviewed based on the standard-type issues existing in the document to be reviewed. For example, assuming that the document to be reviewed has a writing standard type problem: the English translation of a certain Chinese word x in document A is incorrect, the document review device can generate a modification suggestion message "Please modify the English translation of word x."
综上,本公开实施例提供的结合RPA和AI实现IA的文档审核方法,实现了基于AI技术,自动审核目标业务事项对应的待审核文档是否存在行文规范类型的问题,从而减少了文档审核所需的人力成本,提高了文档审核的效率。另外,通过在确定各待审核文档存在行文规范类型的问题的情况下,生成待审核文档对应的修改建议信息,可以为待审核文档的提供方提供修改建议,方便提供方对待审核文档进行修改。In summary, the document review method provided by embodiments of the present disclosure that combines RPA and AI to implement IA can automatically review whether the document to be reviewed corresponding to the target business matter has a standard type of writing based on AI technology, thus reducing the complexity of document review. Reduce required labor costs and improve the efficiency of document review. In addition, by generating modification suggestion information corresponding to the document to be reviewed when it is determined that each document to be reviewed has a standard-type problem, modification suggestions can be provided to the provider of the document to be reviewed, and it is convenient for the provider to modify the document to be reviewed.
下面结合图5,对本公开实施例提供的结合RPA和AI实现IA的文档审核方法进行进一步说明。图5是本公开第五实施例的结合RPA和AI实现IA的文档审核方法的流程图,如图5所示,该方法可以包括步骤501至步骤506。The document review method for implementing IA by combining RPA and AI provided by the embodiment of the present disclosure will be further described below with reference to FIG. 5 . Figure 5 is a flow chart of a document review method that combines RPA and AI to implement IA according to the fifth embodiment of the present disclosure. As shown in Figure 5, the method may include steps 501 to 506.
步骤501,获取目标业务事项对应的至少一个待审核文档。Step 501: Obtain at least one document to be reviewed corresponding to the target business matter.
步骤502,基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题,其中,多个预设类型包括信息补全类型、流程规范类型和行文规范类型。Step 502: Review each document to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed. The multiple preset types include information completion type, process specification type and writing specification type. .
步骤503,在确定各待审核文档存在至少一个预设类型的问题的情况下,生成待审核文档对应的修改建议信息。Step 503: When it is determined that each document to be reviewed has at least one preset type of problem, generate modification suggestion information corresponding to the document to be reviewed.
其中,步骤501-503的具体实现过程及原理,可以参考上述实施例的描述,此处不再赘述。For the specific implementation process and principles of steps 501-503, reference can be made to the description of the above embodiments and will not be described again here.
步骤504,在确定各待审核文档不存在多个预设类型的问题的情况下,将待审核文档发送至人工审核平台。Step 504: When it is determined that each document to be reviewed does not have multiple preset types of problems, the document to be reviewed is sent to the manual review platform.
在一些实施例中,本公开实施例提供的结合RPA和AI实现IA的文档审核方法,可以应于通过人工对文档进行审核之前的预审过程。相应的,在文档审核装置确定各待审核 文档不存在多个预设类型的问题的情况下,可以确定各待审核文档预审通过,从而可以将各待审核文档发送至人工审核平台,以通过人工对各待审核文档进行进一步审核。In some embodiments, the document review method that combines RPA and AI to implement IA provided by the embodiments of the present disclosure can be applied to the pre-review process before manually reviewing documents. Correspondingly, in the document review device, determine each document to be reviewed. If the document does not have multiple preset types of problems, it can be determined that each document to be reviewed has passed the pre-review, so that each document to be reviewed can be sent to the manual review platform for further review of each document to be reviewed manually.
通过文档审核装置对各待审核文档进行自动审核,在确定各待审核文档不存在多个预设类型的问题的情况下,再将各待审核文档发送至人工审核平台进一步审核,减少了对各待审核文档进行审核所需的人力成本,且减少了各待审核文档的提供方与审批部门之间的交互次数,提高了目标业务事项的办理效率。Each document to be reviewed is automatically reviewed through the document review device. After it is determined that each document to be reviewed does not have multiple preset types of problems, each document to be reviewed is then sent to the manual review platform for further review, which reduces the need for each document to be reviewed. The labor cost required to review the documents to be reviewed is reduced, and the number of interactions between the provider of each document to be reviewed and the approval department is reduced, and the efficiency of handling target business matters is improved.
比如,对于医药注册事项,可以采用本公开实施例提供的结合RPA和AI实现IA的文档审核方法,对医药企业提交的注册申请文档进行预审。在文档审核装置确定注册申请文档存在信息补全类型、流程规范类型、行文规范类型中至少一个类型的问题的情况下,可以生成相应的修改建议信息,以使医药企业根据修改建议信息,对待审核文档进行修改。在文档审核装置确定注册申请文档不存在信息补全类型、流程规范类型、行文规范类型中各类型的问题的情况下,可以将各待审核文档发送至人工审核平台,以使审批部门对各待审核文档进一步审核。由此,在医药企业提交注册申请文档后,可以及时给予医药企业反馈,降低医药注册申请周期,提高注册申请效率。For example, for pharmaceutical registration matters, the document review method that combines RPA and AI to implement IA provided by the embodiments of the present disclosure can be used to pre-review registration application documents submitted by pharmaceutical companies. When the document review device determines that the registration application document has at least one type of problem among the information completion type, the process specification type, and the writing specification type, corresponding modification suggestion information can be generated so that the pharmaceutical enterprise can make the review based on the modification suggestion information. The document is modified. When the document review device determines that the registration application document does not have any problems in the information completion type, process specification type, or writing specification type, each document to be reviewed can be sent to the manual review platform so that the approval department can review each document to be reviewed. Review documents for further review. As a result, after the pharmaceutical company submits the registration application documents, it can provide timely feedback to the pharmaceutical company, reduce the pharmaceutical registration application cycle, and improve the registration application efficiency.
步骤505,调用机器人流程自动化RPA机器人访问业务系统,以获取各待审核文档的提供方的联系方式。Step 505: Call the robotic process automation RPA robot to access the business system to obtain the contact information of the provider of each document to be reviewed.
其中,提供方的联系方式,可以是电话号码、邮箱地址等,本公开对此不作限制。Among them, the contact information of the provider can be a phone number, email address, etc., and this disclosure does not limit this.
在一些实施例中,业务系统中存储了各待审核文档的提供方的联系方式,文档审核装置可以通过后台数据访问的方式,从业务系统中获取各待审核文档的提供方的联系方式。In some embodiments, the business system stores the contact information of the providers of each document to be reviewed, and the document review device can obtain the contact information of the providers of each document to be reviewed from the business system through background data access.
在一些实施例中,文档审核装置也可以调用RPA机器人,通过web页面访问的方式,访问业务系统,以获取各待审核文档的提供方的联系方式。其中,web页面指万维网上的一个按照HTML(Hyper Text Markup Language,超文本标记语言)格式组织起来的文件。In some embodiments, the document review device can also call the RPA robot to access the business system through web page access to obtain the contact information of the provider of each document to be reviewed. Among them, a web page refers to a file on the World Wide Web organized in HTML (Hyper Text Markup Language) format.
步骤506,采用RPA机器人,通过联系方式,将各待审核文档的审核结果反馈至对应的提供方。Step 506: Use the RPA robot to feed back the review results of each document to be reviewed to the corresponding provider through contact information.
其中,审核结果,可以包括各待审核文档审核通过,或者各待审核文档审核不通过以及不通过的原因及修改建议信息等。Among them, the review results may include whether each document to be reviewed has passed the review, or each document to be reviewed has failed to pass the review, as well as the reasons for the failure and modification suggestion information, etc.
在一些实施例中,文档审核装置完成对某个提供方提供的各待审核文档的审核之后,可以采用RPA机器人,将审核结果通过该提供方的联系方式反馈至对应的提供方,以使提供方及时获取待审核文档的审核结果。In some embodiments, after the document review device completes the review of each document to be reviewed provided by a certain provider, the RPA robot can be used to feed back the review results to the corresponding provider through the provider's contact information, so that the provider can In order to obtain the review results of documents to be reviewed in a timely manner.
由此,通过调用RPA机器人获取业务系统中的提供方的联系方式,能够实现文档审核装置与业务系统的联动,通过采用RPA机器人获取提供方的联系方式,并将各待审核文档的审核结果通过联系方式反馈至对应的提供方,能够结合RPA和AI实现IA的获取提供 方的联系方式,并将各待审核文档的审核结果自动反馈至对应的提供方,从而进一步减少反馈审核结果所需的人力成本。Therefore, by calling the RPA robot to obtain the contact information of the provider in the business system, the linkage between the document review device and the business system can be realized. The RPA robot is used to obtain the contact information of the provider, and the review results of each document to be reviewed are passed. Contact information is fed back to the corresponding provider, and RPA and AI can be combined to achieve IA acquisition and provision. The contact information of each party is provided, and the review results of each document to be reviewed are automatically fed back to the corresponding provider, thereby further reducing the labor cost required to feedback the review results.
为了实现上述实施例,本公开实施例还提出了一种结合RPA和AI实现IA的文档审核装置。图6是根据本公开第六实施例的结合RPA和AI实现IA的文档审核装置的结构示意图。In order to implement the above embodiments, embodiments of the present disclosure also propose a document review device that combines RPA and AI to implement IA. Figure 6 is a schematic structural diagram of a document review device that combines RPA and AI to implement IA according to the sixth embodiment of the present disclosure.
如图6所示,该结合RPA和AI实现IA的文档审核装置600,包括:获取模块610、审核模块620以及生成模块630。As shown in Figure 6, the document review device 600 that combines RPA and AI to implement IA includes: an acquisition module 610, an review module 620, and a generation module 630.
获取模块610,用于获取目标业务事项对应的至少一个待审核文档。The acquisition module 610 is used to acquire at least one document to be reviewed corresponding to the target business matter.
审核模块620,用于基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题。The review module 620 is used to review each document to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each document to be reviewed.
生成模块630,用于在确定各待审核文档存在至少一个预设类型的问题的情况下,生成待审核文档对应的修改建议信息。The generation module 630 is configured to generate modification suggestion information corresponding to the document to be reviewed when it is determined that each document to be reviewed has at least one preset type of problem.
需要说明的是,本公开实施例的结合RPA和AI实现IA的文档审核装置600,可以执行上述实施例提供的结合RPA和AI实现IA的文档审核方法。其中,结合RPA和AI实现IA的文档审核装置600可以由软件和/或硬件实现,该结合RPA和AI实现IA的文档审核装置600可以为电子设备,或者也可以配置在电子设备中,以实现文档的自动审核,从而减少文档审核所需的人力成本,提高文档审核的效率。其中,该电子设备可以包括但不限于终端设备、服务器等,该实施例对电子设备不作具体限定。It should be noted that the document review device 600 that combines RPA and AI to implement IA in the embodiment of the present disclosure can execute the document review method that combines RPA and AI to implement IA provided in the above embodiments. Among them, the document review device 600 that combines RPA and AI to implement IA can be implemented by software and/or hardware. The document review device 600 that combines RPA and AI to implement IA can be an electronic device, or can also be configured in an electronic device to implement Automatic review of documents, thereby reducing the labor costs required for document review and improving the efficiency of document review. The electronic device may include but is not limited to a terminal device, a server, etc., and this embodiment does not specifically limit the electronic device.
在本公开的一个实施例中,预设类型包括信息补全类型;审核模块620,用于:In one embodiment of the present disclosure, the preset types include information completion types; the audit module 620 is used to:
获取各待审核文档的标识;Obtain the identification of each document to be reviewed;
基于目标业务事项,查询预先创建的目标业务事项对应的知识图谱,以获取目标业务事项所要求的至少一个目标文档的标识;Based on the target business matter, query the knowledge graph corresponding to the pre-created target business matter to obtain the identification of at least one target document required by the target business matter;
将各待审核文档的标识与各目标文档的标识进行比对,以确定各待审核文档是否齐全。Compare the identification of each document to be reviewed with the identification of each target document to determine whether each document to be reviewed is complete.
在本公开的一个实施例中,审核模块620,还用于:In one embodiment of the present disclosure, the audit module 620 is also used to:
基于光学字符识别OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息;Based on optical character recognition (OCR) technology, text recognition is performed on each document to be reviewed to obtain the text information contained in each document to be reviewed;
对各待审核文档所包含的文本信息进行信息抽取,以获取各文本信息所包含的待审核字段以及对应的字段值;Extract the text information contained in each document to be reviewed to obtain the fields to be reviewed and the corresponding field values contained in each text information;
基于目标业务事项,查询知识图谱,以获取各目标文档中所要求的字段;Based on the target business matters, query the knowledge graph to obtain the required fields in each target document;
判断各文本信息所包含的待审核字段中,是否存在与对应的目标文档中所要求的字段一致的目标字段,以及判断目标字段是否存在对应的字段值,以确定各待审核文档中的信息是否齐全。 Determine whether among the fields to be reviewed included in each text information, there is a target field that is consistent with the required field in the corresponding target document, and whether there is a corresponding field value in the target field, so as to determine whether the information in each document to be reviewed is complete.
在本公开的一个实施例中,审核模块620,还用于:In one embodiment of the present disclosure, the audit module 620 is also used to:
获取所有的文本信息中的相同待审核字段;Get the same fields to be reviewed in all text messages;
在相同待审核字段存在对应的字段值的情况下,将相同待审核字段对应的字段值进行比对,以确定各待审核文档中的信息是否一致。When there are corresponding field values for the same field to be reviewed, the field values corresponding to the same field to be reviewed are compared to determine whether the information in each document to be reviewed is consistent.
在本公开的一个实施例中,预设类型包括流程规范类型;审核模块620,还用于:In one embodiment of the present disclosure, the preset type includes a process specification type; the audit module 620 is also used to:
基于OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息;Based on OCR technology, text recognition is performed on each document to be reviewed to obtain the text information contained in each document to be reviewed;
基于目标业务事项,查询预先创建的目标业务事项对应的知识图谱,以获取目标业务事项对应的流程规范;Based on the target business matter, query the knowledge graph corresponding to the pre-created target business matter to obtain the process specification corresponding to the target business matter;
基于各待审核文档以及所包含的文本信息,判断各待审核文档是否满足流程规范。Based on each document to be reviewed and the text information it contains, determine whether each document to be reviewed meets the process specifications.
在本公开的一个实施例中,预设类型包括行文规范类型;审核模块620,还用于:In one embodiment of the present disclosure, the preset types include writing specification types; the review module 620 is also used to:
基于OCR技术,对各待审核文档进行文本识别,以获取各待审核文档所包含的文本信息;Based on OCR technology, text recognition is performed on each document to be reviewed to obtain the text information contained in each document to be reviewed;
将各待审核文档所包含的文本信息,输入预先训练的语言模型,以通过语言模型,确定各待审核文档是否存在行文规范类型的问题。The text information contained in each document to be reviewed is input into the pre-trained language model, so that through the language model, it is determined whether there are issues of the writing standard type in each document to be reviewed.
在本公开的一个实施例中,结合RPA和AI实现IA的文档审核装置600还包括:In one embodiment of the present disclosure, the document review device 600 that combines RPA and AI to implement IA also includes:
第一发送模块,用于在确定各待审核文档不存在多个预设类型的问题的情况下,将各待审核文档发送至人工审核平台。The first sending module is used to send each document to be reviewed to the manual review platform when it is determined that each document to be reviewed does not have multiple preset types of problems.
在本公开的一个实施例中,结合RPA和AI实现IA的文档审核装置600还包括:In one embodiment of the present disclosure, the document review device 600 that combines RPA and AI to implement IA also includes:
调用模块,用于调用机器人流程自动化RPA机器人访问业务系统,以获取各待审核文档的提供方的联系方式;The calling module is used to call the robotic process automation RPA robot to access the business system to obtain the contact information of the provider of each document to be reviewed;
第二发送模块,用于采用RPA机器人,通过联系方式,将各待审核文档的审核结果反馈至对应的提供方。The second sending module is used to use RPA robots to feedback the review results of each document to be reviewed to the corresponding provider through contact information.
在本公开的一个实施例中,目标业务事项为医药注册事项。In one embodiment of the present disclosure, the target business matter is a pharmaceutical registration matter.
需要说明的是,前述对结合RPA和AI实现IA的文档审核方法实施例的解释说明也适用于该实施例的结合RPA和AI实现IA的文档审核装置,本公开实施例结合RPA和AI实现IA的文档审核装置实施例中未公布的细节,此处不再赘述。It should be noted that the aforementioned explanation of the embodiment of the document review method that combines RPA and AI to implement IA is also applicable to the document review device that combines RPA and AI to implement IA in this embodiment. The embodiment of the present disclosure combines RPA and AI to implement IA. The unpublished details of the document review device embodiment will not be described again here.
综上,本公开实施例的结合RPA和AI实现IA的文档审核装置,获取目标业务事项对应的至少一个待审核文档;基于AI技术对各待审核文档进行审核,以确定各待审核文档是否存在多个预设类型的问题;在确定各待审核文档存在至少一个预设类型的问题的情况下,生成待审核文档对应的修改建议信息。由此,实现了基于AI技术,对目标业务事项对应的待审核文档进行自动审核,减少了文档审核所需的人力成本,提高了文档审核的效率。 另外,通过在确定各待审核文档存在至少一个预设类型的问题的情况下,生成待审核文档对应的修改建议信息,可以为待审核文档的提供方提供修改建议,方便提供方对待审核文档进行修改。In summary, the document review device that combines RPA and AI to implement IA in the embodiment of the present disclosure obtains at least one document to be reviewed corresponding to the target business matter; each document to be reviewed is reviewed based on AI technology to determine whether each document to be reviewed exists. Problems of multiple preset types; when it is determined that each document to be reviewed has at least one problem of the preset type, modification suggestion information corresponding to the document to be reviewed is generated. As a result, it is possible to automatically review documents to be reviewed corresponding to target business matters based on AI technology, reducing the labor costs required for document review and improving the efficiency of document review. In addition, by generating modification suggestion information corresponding to the document to be audited when it is determined that each document to be audited has at least one preset type of problem, modification suggestions can be provided to the provider of the document to be audited, and it is convenient for the provider to modify the document to be audited. Revise.
为了实现上述实施例,本公开实施例还提出一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时,实现如前述任一方法实施例所述的结合RPA和AI实现IA的文档审核方法。In order to implement the above embodiments, embodiments of the present disclosure also provide an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor. When the processor executes the computer program, As described in any of the foregoing method embodiments, the document review method of IA is implemented by combining RPA and AI.
为了实现上述实施例,本公开实施例还提出一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时实现如前述任一方法实施例所述的结合RPA和AI实现IA的文档审核方法。In order to implement the above embodiments, embodiments of the present disclosure also provide a computer-readable storage medium on which a computer program is stored. When the computer program is executed by a processor, the combination of RPA and AI as described in any of the foregoing method embodiments is implemented. Implement IA's document review method.
为了实现上述实施例,本公开实施例还提出一种计算机程序产品,当所述计算机程序产品中的指令处理器执行时,实现如前述任一方法实施例所述的结合RPA和AI实现IA的文档审核方法。In order to implement the above embodiments, embodiments of the present disclosure also provide a computer program product. When the instruction processor in the computer program product is executed, the implementation of IA by combining RPA and AI as described in any of the foregoing method embodiments is realized. Document review methods.
为了实现上述实施例,本公开实施例还提供了一种计算机程序,该计算机程序包括计算机程序代码,当该计算机程序代码在计算机上运行时,使得计算机执行本公开任一实施例所述的结合RPA和AI实现IA的文档审核方法。In order to implement the above embodiments, embodiments of the present disclosure also provide a computer program. The computer program includes computer program code. When the computer program code is run on a computer, it causes the computer to perform the combination described in any embodiment of the present disclosure. RPA and AI implement IA's document review method.
图7示出了适于用来实现本公开实施方式的示例性电子设备的框图。图7显示的电子设备10仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。7 illustrates a block diagram of an exemplary electronic device suitable for implementing embodiments of the present disclosure. The electronic device 10 shown in FIG. 7 is only an example and should not bring any limitations to the functions and scope of use of the embodiments of the present disclosure.
如图7所示,电子设备10以通用计算设备的形式表现。电子设备10的组件可以包括但不限于:一个或者多个处理器或者处理单元16,系统存储器28,连接不同系统组件(包括存储器28和处理单元16)的总线18。As shown in Figure 7, electronic device 10 is embodied in the form of a general computing device. The components of electronic device 10 may include, but are not limited to: one or more processors or processing units 16, system memory 28, and a bus 18 connecting various system components, including memory 28 and processing unit 16.
总线18表示几类总线结构中的一种或多种,包括存储器总线或者存储器控制器,外围总线,图形加速端口,处理器或者使用多种总线结构中的任意总线结构的局域总线。举例来说,这些体系结构包括但不限于工业标准体系结构(Industry Standard Architecture;以下简称:ISA)总线,微通道体系结构(Micro Channel Architecture;以下简称:MAC)总线,增强型ISA总线、视频电子标准协会(Video Electronics Standards Association;以下简称:VESA)局域总线以及外围组件互连(Peripheral Component Interconnection;以下简称:PCI)总线。Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, a graphics accelerated port, a processor, or a local bus using any of a variety of bus structures. For example, these architectures include but are not limited to Industry Standard Architecture (hereinafter referred to as: ISA) bus, Micro Channel Architecture (Micro Channel Architecture; hereafter referred to as: MAC) bus, enhanced ISA bus, video electronics Standards Association (Video Electronics Standards Association; hereinafter referred to as: VESA) local bus and Peripheral Component Interconnection (hereinafter referred to as: PCI) bus.
在一些实施例中,电子设备10包括多种计算机系统可读介质。这些介质可以是任何能够被电子设备10访问的可用介质,包括易失性和非易失性介质,可移动的和不可移动的介质。In some embodiments, electronic device 10 includes a variety of computer system readable media. These media may be any available media that can be accessed by electronic device 10, including volatile and nonvolatile media, removable and non-removable media.
存储器28可以包括易失性存储器形式的计算机系统可读介质,例如随机存取存储器(Random Access Memory;以下简称:RAM)30和/或高速缓存存储器32。电子设备10 可以进一步包括其它可移动/不可移动的、易失性/非易失性计算机系统存储介质。仅作为举例,存储系统34可以用于读写不可移动的、非易失性磁介质(图7未显示,通常称为“硬盘驱动器”)。尽管图7中未示出,可以提供用于对可移动非易失性磁盘(例如“软盘”)读写的磁盘驱动器,以及对可移动非易失性光盘(例如:光盘只读存储器(Compact Disc Read Only Memory;以下简称:CD-ROM)、数字多功能只读光盘(Digital Video Disc Read Only Memory;以下简称:DVD-ROM)或者其它光介质)读写的光盘驱动器。在这些情况下,每个驱动器可以通过一个或者多个数据介质接口与总线18相连。存储器28可以包括至少一个程序产品,该程序产品具有一组(例如至少一个)程序模块,这些程序模块被配置以执行本公开各实施例的功能。The memory 28 may include computer system-readable media in the form of volatile memory, such as random access memory (Random Access Memory; hereinafter referred to as: RAM) 30 and/or cache memory 32 . Electronic equipment10 Other removable/non-removable, volatile/non-volatile computer system storage media may further be included. By way of example only, storage system 34 may be used to read and write to non-removable, non-volatile magnetic media (not shown in Figure 7, commonly referred to as a "hard drive"). Although not shown in FIG. 7, a disk drive may be provided for reading and writing to a removable non-volatile disk (e.g., a "floppy disk"), and a disk drive for reading and writing a removable non-volatile optical disk (e.g., a compact disk read-only memory). A disc drive that reads and writes from Disc Read Only Memory (hereinafter referred to as: CD-ROM), Digital Video Disc Read Only Memory (hereinafter referred to as: DVD-ROM) or other optical media). In these cases, each drive may be connected to bus 18 through one or more data media interfaces. Memory 28 may include at least one program product having a set (eg, at least one) of program modules configured to perform the functions of embodiments of the present disclosure.
具有一组(至少一个)程序模块42的程序/实用工具40,可以存储在例如存储器28中,这样的程序模块42包括但不限于操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或某种组合中可能包括网络环境的实现。程序模块42通常执行本公开所描述的实施例中的功能和/或方法。A program/utility 40 having a set of (at least one) program modules 42, including but not limited to an operating system, one or more application programs, other program modules, and program data, may be stored, for example, in memory 28 , each of these examples or some combination may include the implementation of a network environment. Program modules 42 generally perform functions and/or methods in the embodiments described in this disclosure.
电子设备10也可以与一个或多个外部设备14(例如键盘、指向设备、显示器24等)通信,还可与一个或者多个使得用户能与该电子设备10交互的设备通信,和/或与使得该电子设备10能与一个或多个其它计算设备进行通信的任何设备(例如网卡,调制解调器等等)通信。这种通信可以通过输入/输出(I/O)接口22进行。并且,电子设备10还可以通过网络适配器20与一个或者多个网络(例如局域网(Local Area Network;以下简称:LAN),广域网(Wide Area Network;以下简称:WAN)和/或公共网络,例如因特网)通信。如图7所示,网络适配器20通过总线18与电子设备10的其它模块通信。应当明白,尽管图7中未示出,可以结合电子设备10使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理单元、外部磁盘驱动阵列、RAID系统、磁带驱动器以及数据备份存储系统等。Electronic device 10 may also communicate with one or more external devices 14 (e.g., keyboard, pointing device, display 24, etc.), may also communicate with one or more devices that enable a user to interact with electronic device 10, and/or with Any device (eg, network card, modem, etc.) that enables the electronic device 10 to communicate with one or more other computing devices. This communication may occur through input/output (I/O) interface 22. Moreover, the electronic device 10 can also communicate with one or more networks (such as a local area network (Local Area Network; hereinafter referred to as: LAN), a wide area network (Wide Area Network; hereinafter referred to as: WAN)) and/or a public network, such as the Internet, through the network adapter 20 ) communication. As shown in FIG. 7 , network adapter 20 communicates with other modules of electronic device 10 via bus 18 . It should be understood that, although not shown in Figure 7, other hardware and/or software modules may be used in conjunction with electronic device 10, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tapes drives and data backup storage systems, etc.
处理单元16通过运行存储在存储器28中的程序,从而执行各种功能应用以及数据处理,例如实现前述实施例中提及的方法。The processing unit 16 executes programs stored in the memory 28 to perform various functional applications and data processing, such as implementing the methods mentioned in the previous embodiments.
需要说明的是,前述对方法、装置实施例的解释说明也适用于上述实施例的电子设备、计算机可读存储介质、计算机程序产品和计算机程序,此处不再赘述。It should be noted that the foregoing explanations of the method and device embodiments also apply to the electronic equipment, computer-readable storage media, computer program products and computer programs of the above-mentioned embodiments, and will not be described again here.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本公开的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技 术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, reference to the terms "one embodiment,""someembodiments,""anexample,""specificexamples," or "some examples" or the like means that specific features are described in connection with the embodiment or example. , structures, materials, or features are included in at least one embodiment or example of the present disclosure. In this specification, the schematic expressions of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the specific features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. In addition, to the extent not inconsistent with each other, the technical The skilled person may combine and combine the different embodiments or examples and features of different embodiments or examples described in this specification.
此外,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本公开实施例的描述中,“多个”的含义是至少两个,例如两个,三个等,除非另有明确具体的限定。In addition, the terms “first” and “second” are used for descriptive purposes only and cannot be understood as indicating or implying relative importance or implicitly indicating the quantity of indicated technical features. Therefore, features defined as "first" and "second" may explicitly or implicitly include at least one of these features. In the description of the embodiments of the present disclosure, "plurality" means at least two, such as two, three, etc., unless otherwise explicitly and specifically limited.
流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或更多个用于实现定制逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本公开的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本公开的实施例所属技术领域的技术人员所理解。Any process or method descriptions in flowcharts or otherwise described herein may be understood to represent modules, segments, or portions of code that include one or more executable instructions for implementing customized logical functions or steps of the process. , and the scope of the preferred embodiments of the present disclosure includes additional implementations in which functions may be performed out of the order shown or discussed, including in a substantially simultaneous manner or in the reverse order, depending on the functionality involved, which shall It should be understood by those skilled in the art to which embodiments of the present disclosure belong.
在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。就本说明书而言,"计算机可读介质"可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或多个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因为可以例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机存储器中。The logic and/or steps represented in the flowcharts or otherwise described herein, for example, may be considered a sequenced list of executable instructions for implementing the logical functions, and may be embodied in any computer-readable medium, For use by, or in combination with, instruction execution systems, devices or devices (such as computer-based systems, systems including processors or other systems that can fetch instructions from and execute instructions from the instruction execution system, device or device) or equipment. For the purposes of this specification, a "computer-readable medium" may be any device that can contain, store, communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections with one or more wires (electronic device), portable computer disk cartridges (magnetic device), random access memory (RAM), Read-only memory (ROM), erasable and programmable read-only memory (EPROM or flash memory), fiber optic devices, and portable compact disc read-only memory (CDROM). Additionally, the computer-readable medium may even be paper or other suitable medium on which the program may be printed, as the paper or other medium may be optically scanned, for example, and subsequently edited, interpreted, or otherwise suitable as necessary. process to obtain the program electronically and then store it in computer memory.
应当理解,本公开的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。如,如果用硬件来实现和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。 It should be understood that various parts of the present disclosure may be implemented in hardware, software, firmware, or combinations thereof. In the above embodiments, various steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system. For example, if it is implemented in hardware, as in another embodiment, it can be implemented by any one of the following technologies known in the art or their combination: discrete logic gate circuits with logic functions for implementing data signals; Logic circuits, application specific integrated circuits with suitable combinational logic gates, programmable gate arrays (PGA), field programmable gate arrays (FPGA), etc.
本技术领域的普通技术人员可以理解实现上述实施例方法携带的全部或部分步骤是可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,该程序在执行时,包括方法实施例的步骤之一或其组合。Those of ordinary skill in the art can understand that all or part of the steps involved in implementing the methods of the above embodiments can be completed by instructing relevant hardware through a program. The program can be stored in a computer-readable storage medium. The program can be stored in a computer-readable storage medium. When executed, one of the steps of the method embodiment or a combination thereof is included.
此外,在本公开各个实施例中的各功能单元可以集成在一个处理模块中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。所述集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。In addition, each functional unit in various embodiments of the present disclosure may be integrated into one processing module, each unit may exist physically alone, or two or more units may be integrated into one module. The above integrated modules can be implemented in the form of hardware or software function modules. If the integrated module is implemented in the form of a software function module and sold or used as an independent product, it can also be stored in a computer-readable storage medium.
上述提到的存储介质可以是只读存储器,磁盘或光盘等。尽管上面已经示出和描述了本公开的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本公开的限制,本领域的普通技术人员在本公开的范围内可以对上述实施例进行变化、修改、替换和变型。The storage media mentioned above can be read-only memory, magnetic disks or optical disks, etc. Although the embodiments of the present disclosure have been shown and described above, it can be understood that the above-mentioned embodiments are illustrative and should not be construed as limitations of the present disclosure. Those of ordinary skill in the art can make modifications to the above-mentioned embodiments within the scope of the present disclosure. The embodiments are subject to changes, modifications, substitutions and variations.
本公开所有实施例均可以单独被执行,也可以与其他实施例相结合被执行,均视为本公开要求的保护范围。 All embodiments of the present disclosure can be executed alone or in combination with other embodiments, which are considered to be within the scope of protection claimed by the present disclosure.

Claims (22)

  1. 一种结合机器人流程自动化RPA和人工智能AI实现智能自动化IA的文档审核方法,其中,所述方法包括:A document review method that combines robotic process automation RPA and artificial intelligence AI to realize intelligent automated IA, wherein the method includes:
    获取目标业务事项对应的至少一个待审核文档;Obtain at least one document to be reviewed corresponding to the target business matter;
    基于AI技术对各所述待审核文档进行审核,以确定各所述待审核文档是否存在多个预设类型的问题;和Review each of the documents to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each of the documents to be reviewed; and
    基于确定各所述待审核文档存在至少一个所述预设类型的问题,生成所述待审核文档对应的修改建议信息。Based on determining that each of the documents to be reviewed has at least one problem of the preset type, modification suggestion information corresponding to the document to be reviewed is generated.
  2. 根据权利要求1所述的方法,其中,所述预设类型包括信息补全类型;The method according to claim 1, wherein the preset type includes an information completion type;
    所述基于AI技术对各所述待审核文档进行审核,以确定各所述待审核文档是否存在多个预设类型的问题,包括:The AI technology is used to review each of the documents to be reviewed to determine whether there are multiple preset types of problems in each of the documents to be reviewed, including:
    获取各所述待审核文档的标识;Obtain the identification of each document to be reviewed;
    基于所述目标业务事项,查询预先创建的所述目标业务事项对应的知识图谱,以获取所述目标业务事项所要求的至少一个目标文档的标识;和Based on the target business matter, query the pre-created knowledge graph corresponding to the target business matter to obtain the identification of at least one target document required by the target business matter; and
    将各所述待审核文档的标识与各所述目标文档的标识进行比对,以确定各所述待审核文档是否齐全。Compare the identification of each document to be reviewed with the identification of each target document to determine whether each document to be reviewed is complete.
  3. 根据权利要求2所述的方法,其中,所述基于AI技术对各所述待审核文档进行审核,以确定各所述待审核文档是否存在多个预设类型的问题,还包括:The method according to claim 2, wherein the auditing of each of the documents to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each of the documents to be reviewed also includes:
    基于光学字符识别OCR技术,对各所述待审核文档进行文本识别,以获取各所述待审核文档所包含的文本信息;Based on optical character recognition (OCR) technology, perform text recognition on each of the documents to be reviewed to obtain the text information contained in each of the documents to be reviewed;
    对各所述待审核文档所包含的所述文本信息进行信息抽取,以获取各所述文本信息所包含的待审核字段以及对应的字段值;Perform information extraction on the text information contained in each of the documents to be reviewed to obtain the fields to be reviewed and the corresponding field values contained in each of the text information;
    基于所述目标业务事项,查询所述知识图谱,以获取各所述目标文档中所要求的字段;和Based on the target business matters, query the knowledge graph to obtain the required fields in each of the target documents; and
    判断各所述文本信息所包含的所述待审核字段中,是否存在与对应的所述目标文档中所要求的字段一致的目标字段,以及判断所述目标字段是否存在所述对应的字段值,以确定各所述待审核文档中的信息是否齐全。 Determining whether there is a target field consistent with the required field in the corresponding target document among the fields to be reviewed included in each of the text information, and determining whether the corresponding field value exists in the target field, To determine whether the information in each of the documents to be reviewed is complete.
  4. 根据权利要求3所述的方法,其中,所述基于AI技术对各所述待审核文档进行审核,以确定各所述待审核文档是否存在多个预设类型的问题,还包括:The method according to claim 3, wherein the auditing of each of the documents to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each of the documents to be reviewed also includes:
    获取所有的所述文本信息中的相同待审核字段;和Obtain the same fields to be reviewed in all the text messages; and
    基于所述相同待审核字段存在所述对应的字段值,将所述相同待审核字段的所述对应的字段值进行比对,以确定各所述待审核文档中的信息是否一致。Based on the existence of the corresponding field value in the same field to be reviewed, the corresponding field values of the same field to be reviewed are compared to determine whether the information in each of the documents to be reviewed is consistent.
  5. 根据权利要求1至4中任一项所述的方法,其中,所述预设类型包括流程规范类型;The method according to any one of claims 1 to 4, wherein the preset type comprises a process specification type;
    所述基于AI技术对各所述待审核文档进行审核,以确定各所述待审核文档是否存在多个预设类型的问题,包括:The AI technology is used to review each of the documents to be reviewed to determine whether there are multiple preset types of problems in each of the documents to be reviewed, including:
    基于OCR技术,对各所述待审核文档进行文本识别,以获取各所述待审核文档所包含的文本信息;Based on OCR technology, perform text recognition on each of the documents to be reviewed to obtain the text information contained in each of the documents to be reviewed;
    基于所述目标业务事项,查询预先创建的所述目标业务事项对应的知识图谱,以获取所述目标业务事项对应的流程规范;和Based on the target business matter, query the pre-created knowledge graph corresponding to the target business matter to obtain the process specification corresponding to the target business matter; and
    基于各所述待审核文档以及所包含的所述文本信息,判断各所述待审核文档是否满足所述流程规范。Based on each of the documents to be reviewed and the text information contained therein, it is determined whether each of the documents to be reviewed satisfies the process specification.
  6. 根据权利要求1至5中任一项所述的方法,其中,所述预设类型包括行文规范类型;The method according to any one of claims 1 to 5, wherein the preset type includes a writing specification type;
    所述基于AI技术对各所述待审核文档进行审核,以确定各所述待审核文档是否存在多个预设类型的问题,包括:The AI technology is used to review each of the documents to be reviewed to determine whether there are multiple preset types of problems in each of the documents to be reviewed, including:
    基于OCR技术,对各所述待审核文档进行文本识别,以获取各所述待审核文档所包含的文本信息;和Based on OCR technology, perform text recognition on each of the documents to be reviewed to obtain the text information contained in each of the documents to be reviewed; and
    将各所述待审核文档所包含的所述文本信息,输入预先训练的语言模型,以通过所述语言模型,确定各所述待审核文档是否存在所述行文规范类型的问题。The text information contained in each of the documents to be reviewed is input into a pre-trained language model, so that through the language model, it is determined whether each of the documents to be reviewed has problems of the writing standard type.
  7. 根据权利要求1至6中任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1 to 6, wherein the method further comprises:
    基于确定各所述待审核文档不存在所述多个预设类型的问题,将各所述待审核文档发送至人工审核平台。Based on determining that each of the documents to be reviewed does not have problems of the plurality of preset types, each of the documents to be reviewed is sent to a manual review platform.
  8. 根据权利要求1至7中任一项所述的方法,其中,所述方法还包括: The method according to any one of claims 1 to 7, wherein the method further comprises:
    调用机器人流程自动化RPA机器人访问业务系统,以获取各所述待审核文档的提供方的联系方式;和Call the Robotic Process Automation RPA robot to access the business system to obtain the contact information of the provider of each of the documents to be reviewed; and
    采用所述RPA机器人,通过所述联系方式,将各所述待审核文档的审核结果反馈至对应的提供方。The RPA robot is used to feed back the review results of each document to be reviewed to the corresponding provider through the contact information.
  9. 根据权利要求1至8中任一项所述的方法,其中,所述目标业务事项为医药注册事项。The method according to any one of claims 1 to 8, wherein the target business matter is a pharmaceutical registration matter.
  10. 一种结合RPA和AI实现IA的文档审核装置,其中,所述装置包括:A document review device that combines RPA and AI to implement IA, wherein the device includes:
    获取模块,用于获取目标业务事项对应的至少一个待审核文档;The acquisition module is used to acquire at least one document to be reviewed corresponding to the target business matter;
    审核模块,用于基于AI技术对各所述待审核文档进行审核,以确定各所述待审核文档是否存在多个预设类型的问题;和The review module is used to review each of the documents to be reviewed based on AI technology to determine whether there are multiple preset types of problems in each of the documents to be reviewed; and
    生成模块,用于基于确定各所述待审核文档存在至少一个所述预设类型的问题,生成所述待审核文档对应的修改建议信息。A generating module, configured to generate modification suggestion information corresponding to the document to be reviewed based on determining that each of the documents to be reviewed has at least one problem of the preset type.
  11. 根据权利要求10所述的装置,其中,所述预设类型包括信息补全类型;The device according to claim 10, wherein the preset type includes an information completion type;
    所述审核模块,用于:The audit module is used for:
    获取各所述待审核文档的标识;Obtain the identification of each document to be reviewed;
    基于所述目标业务事项,查询预先创建的所述目标业务事项对应的知识图谱,以获取所述目标业务事项所要求的至少一个目标文档的标识;并Based on the target business matter, query the pre-created knowledge graph corresponding to the target business matter to obtain the identification of at least one target document required by the target business matter; and
    将各所述待审核文档的标识与各所述目标文档的标识进行比对,以确定各所述待审核文档是否齐全。Compare the identification of each document to be reviewed with the identification of each target document to determine whether each document to be reviewed is complete.
  12. 根据权利要求11所述的装置,其中,所述审核模块,还用于:The device according to claim 11, wherein the audit module is also used to:
    基于光学字符识别OCR技术,对各所述待审核文档进行文本识别,以获取各所述待审核文档所包含的文本信息;Based on optical character recognition (OCR) technology, perform text recognition on each of the documents to be reviewed to obtain the text information contained in each of the documents to be reviewed;
    对各所述待审核文档所包含的所述文本信息进行信息抽取,以获取各所述文本信息所包含的待审核字段以及对应的字段值;Perform information extraction on the text information contained in each of the documents to be reviewed to obtain the fields to be reviewed and the corresponding field values contained in each of the text information;
    基于所述目标业务事项,查询所述知识图谱,以获取各所述目标文档中所要求的字段;和 Based on the target business matters, query the knowledge graph to obtain the required fields in each of the target documents; and
    判断各所述文本信息所包含的所述待审核字段中,是否存在与对应的所述目标文档中所要求的字段一致的目标字段,以及判断所述目标字段是否存在所述对应的字段值,以确定各所述待审核文档中的信息是否齐全。Determining whether there is a target field consistent with the required field in the corresponding target document among the fields to be reviewed included in each of the text information, and determining whether the corresponding field value exists in the target field, To determine whether the information in each of the documents to be reviewed is complete.
  13. 根据权利要求12所述的装置,其中,所述审核模块,还用于:获取所有的所述文本信息中的相同待审核字段;和基于所述相同待审核字段存在所述对应的字段值,将所述相同待审核字段的所述对应的字段值进行比对,以确定各所述待审核文档中的信息是否一致。The device according to claim 12, wherein the audit module is further configured to: obtain the same fields to be audited in all the text information; and the corresponding field values exist based on the same fields to be audited, The corresponding field values of the same fields to be reviewed are compared to determine whether the information in each of the documents to be reviewed is consistent.
  14. 根据权利要求10至13中任一项所述的装置,其中,所述预设类型包括流程规范类型;所述审核模块,还用于:基于OCR技术,对各所述待审核文档进行文本识别,以获取各所述待审核文档所包含的文本信息;基于所述目标业务事项,查询预先创建的目标业务事项对应的知识图谱,以获取目标业务事项对应的流程规范;和基于各所述待审核文档以及所包含的所述文本信息,判断各所述待审核文档是否满足流程规范。The device according to any one of claims 10 to 13, wherein the preset type includes a process specification type; the review module is also used to: perform text recognition on each of the documents to be reviewed based on OCR technology , to obtain the text information contained in each of the documents to be reviewed; based on the target business matters, query the knowledge graph corresponding to the pre-created target business matters to obtain the process specifications corresponding to the target business matters; and based on each of the target business matters to be reviewed Review the documents and the text information contained therein to determine whether each of the documents to be reviewed meets the process specifications.
  15. 根据权利要求10至14中任一项所述的装置,其中,所述预设类型包括行文规范类型;所述审核模块,还用于:基于OCR技术,对各所述待审核文档进行文本识别,以获取各所述待审核文档所包含的文本信息;将各所述待审核文档所包含的所述文本信息,输入预先训练的语言模型,以通过所述语言模型,确定各所述待审核文档是否存在所述行文规范类型的问题。The device according to any one of claims 10 to 14, wherein the preset type includes a writing specification type; the review module is also used to: perform text recognition on each of the documents to be reviewed based on OCR technology , to obtain the text information contained in each document to be reviewed; input the text information contained in each document to be reviewed into a pre-trained language model, so as to determine each document to be reviewed through the language model Does the document contain any issues of the type specified in the document specification?
  16. 根据权利要求10至15中任一项所述的装置,其中,所述结合RPA和AI实现IA的文档审核装置还包括:第一发送模块,用于基于确定各所述待审核文档不存在所述多个预设类型的问题,将各所述待审核文档发送至人工审核平台。The device according to any one of claims 10 to 15, wherein the document review device that combines RPA and AI to implement IA further includes: a first sending module for determining that each of the documents to be reviewed does not exist. Describe multiple preset types of questions and send each of the documents to be reviewed to the manual review platform.
  17. 根据权利要求10至16中任一项所述的装置,其中,所述结合RPA和AI实现IA的文档审核装置还包括:调用模块,用于调用机器人流程自动化RPA机器人访问业务系统,以获取各所述待审核文档的提供方的联系方式;和第二发送模块,用于采用所述RPA机器人,通过所述联系方式,将各所述待审核文档的审核结果反馈至对应的提供方。The device according to any one of claims 10 to 16, wherein the document review device that combines RPA and AI to implement IA further includes: a calling module for calling a robotic process automation RPA robot to access the business system to obtain each The contact information of the provider of the documents to be reviewed; and a second sending module used to use the RPA robot to feed back the review results of each of the documents to be reviewed to the corresponding provider through the contact information.
  18. 根据权利要求10至17中任一项所述的装置,其中,所述目标业务事项为医药注册事项。 The device according to any one of claims 10 to 17, wherein the target business matter is a pharmaceutical registration matter.
  19. 一种电子设备,其中,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时,实现如权利要求1至9中任一项所述的方法。An electronic device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor. When the processor executes the computer program, any one of claims 1 to 9 is implemented. the method described.
  20. 一种计算机可读存储介质,其上存储有计算机程序,其中,该计算机程序被处理器执行时实现如权利要求1至9中任一项所述的方法。A computer-readable storage medium having a computer program stored thereon, wherein when the computer program is executed by a processor, the method according to any one of claims 1 to 9 is implemented.
  21. 一种计算机程序产品,包括计算机程序,其中,所述计算机程序被处理器执行时实现如权利要求1至9中任一项所述的方法。A computer program product includes a computer program, wherein when the computer program is executed by a processor, the method according to any one of claims 1 to 9 is implemented.
  22. 一种计算机程序,其中,所述计算机程序包括计算机程序代码,当所述计算机程序代码在计算机上运行时,以使得计算机执行如权利要求1至9中任一项所述的方法。 A computer program, wherein the computer program includes computer program code, and when the computer program code is run on a computer, the computer performs the method according to any one of claims 1 to 9.
PCT/CN2023/116767 2022-09-13 2023-09-04 Document review method and apparatus for implementing ia by combining rpa and ai, and electronic device WO2024055862A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211110169.3A CN115511441A (en) 2022-09-13 2022-09-13 Document auditing method and device for realizing IA (Internet of things) by combining RPA (resilient packet Access) and AI (Artificial Intelligence architecture), and electronic equipment
CN202211110169.3 2022-09-13

Publications (1)

Publication Number Publication Date
WO2024055862A1 true WO2024055862A1 (en) 2024-03-21

Family

ID=84503117

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/116767 WO2024055862A1 (en) 2022-09-13 2023-09-04 Document review method and apparatus for implementing ia by combining rpa and ai, and electronic device

Country Status (2)

Country Link
CN (1) CN115511441A (en)
WO (1) WO2024055862A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110134800A (en) * 2019-04-17 2019-08-16 深圳壹账通智能科技有限公司 A kind of document relationships visible processing method and device
CN110852065A (en) * 2019-11-07 2020-02-28 达而观信息科技(上海)有限公司 Document auditing method, device, system, equipment and storage medium
US20200134757A1 (en) * 2018-10-30 2020-04-30 International Business Machines Corporation Extracting, deriving, and using legal matter semantics to generate e-discovery queries in an e-discovery system
CN114186019A (en) * 2021-11-03 2022-03-15 北京来也网络科技有限公司 Enterprise project auditing method and device combining RPA and AI

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200134757A1 (en) * 2018-10-30 2020-04-30 International Business Machines Corporation Extracting, deriving, and using legal matter semantics to generate e-discovery queries in an e-discovery system
CN110134800A (en) * 2019-04-17 2019-08-16 深圳壹账通智能科技有限公司 A kind of document relationships visible processing method and device
CN110852065A (en) * 2019-11-07 2020-02-28 达而观信息科技(上海)有限公司 Document auditing method, device, system, equipment and storage medium
CN114186019A (en) * 2021-11-03 2022-03-15 北京来也网络科技有限公司 Enterprise project auditing method and device combining RPA and AI

Also Published As

Publication number Publication date
CN115511441A (en) 2022-12-23

Similar Documents

Publication Publication Date Title
CN1457041B (en) System for automatically annotating training data for natural language understanding system
AU2016210590B2 (en) Method and System for Entity Relationship Model Generation
South et al. A prototype tool set to support machine-assisted annotation
WO2022218186A1 (en) Method and apparatus for generating personalized knowledge graph, and computer device
JP7293643B2 (en) A semi-automated method, system, and program for translating the content of structured documents into chat-based interactions
WO2021121158A1 (en) Official document file processing method, apparatus, computer device, and storage medium
JP2024514069A (en) electronic messaging methods
US11928156B2 (en) Learning-based automated machine learning code annotation with graph neural network
WO2021129074A1 (en) Method and system for processing reference of variable in program code
CN111753140A (en) XML file parsing method and related equipment
WO2022247231A1 (en) Resume screening method, resume screening apparatus, terminal device and storage medium
CN114386853A (en) Data auditing processing method, device and equipment based on universal auditing model
CN114185935A (en) Social security data processing method and device combining RPA and AI and storage medium
CN110633258B (en) Log insertion method, device, computer device and storage medium
WO2024055862A1 (en) Document review method and apparatus for implementing ia by combining rpa and ai, and electronic device
WO2023226129A1 (en) Item-rule code generation method and apparatus combining rpa and ai, and electronic device
US20220406210A1 (en) Automatic generation of lectures derived from generic, educational or scientific contents, fitting specified parameters
WO2019242167A1 (en) Method for managing requirements and computer device
CN115017271A (en) Method and system for intelligently generating RPA flow component block
CN114169857A (en) Process data processing method, device, equipment and medium based on RPA and AI
CN114219438A (en) Document file distribution method, device, equipment and medium based on RPA and AI
CN114546937A (en) Intelligent document editing management system, method, equipment and storage medium
CN114511858A (en) AI and RPA-based official document file processing method, device, equipment and medium
CN104503992A (en) Question bank construction method
CN110457659B (en) Clause document generation method and terminal equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23864594

Country of ref document: EP

Kind code of ref document: A1