WO2022142116A1 - 文档处理方法、装置、电子设备及存储介质 - Google Patents

文档处理方法、装置、电子设备及存储介质 Download PDF

Info

Publication number
WO2022142116A1
WO2022142116A1 PCT/CN2021/096932 CN2021096932W WO2022142116A1 WO 2022142116 A1 WO2022142116 A1 WO 2022142116A1 CN 2021096932 W CN2021096932 W CN 2021096932W WO 2022142116 A1 WO2022142116 A1 WO 2022142116A1
Authority
WO
WIPO (PCT)
Prior art keywords
document
target document
target
entity
preset
Prior art date
Application number
PCT/CN2021/096932
Other languages
English (en)
French (fr)
Inventor
王强
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2022142116A1 publication Critical patent/WO2022142116A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • G06F16/345Summarisation for human users
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification

Definitions

  • the present application relates to the technical field of artificial intelligence, and in particular, to a document processing method, apparatus, electronic device and storage medium.
  • IP Initial Public Offering
  • relevant institutions need to review the qualifications of the applicants, and only the applicants who have passed the review are eligible to subscribe.
  • Relevant institutions need to review the relationship between the applicant and the issuing company, the relationship between the applicant and the issuing agent, etc., to determine the subscription qualification of the applicant.
  • the previous practice was to manually upload the audit materials, and then the relevant staff manually marked the relationship between the applicant and the issuing company (or issuing agent), and then marked it for the reviewers to review.
  • the inventor found that the previous practice Inefficiency, error-prone, and high labor and time costs.
  • the purpose of this application is to provide a document processing method, device, electronic device and storage medium.
  • a first aspect of the present application provides a document processing method, including:
  • target document meets the preset requirements, classify and label the target document based on the pre-trained BERT model, and output the category and label corresponding to the target document.
  • a second aspect of the present application provides a document processing device, comprising:
  • a receiving module configured to receive a target document uploaded by a user, generate a document abstract of the target document through a preset automatic abstract tool, and extract target keywords from the document abstract;
  • a judgment module configured to judge whether the target document meets a preset requirement according to the number of extracted target keywords, where the preset requirement is that the number of target keywords contained in the document is greater than a preset threshold;
  • the labeling module is used for classifying and labeling the target document based on the pre-trained BERT model if the target document meets the preset requirements, and outputting the category and label corresponding to the target document.
  • a third aspect of the present application provides an electronic device, comprising: a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor executes the computer program when running the computer program Do the following:
  • the preset requirement is that the quantity of the target keywords contained in the document is greater than a preset threshold
  • target document meets the preset requirements, classify and label the target document based on the pre-trained BERT model, and output the category and label corresponding to the target document.
  • a fourth aspect of the present application provides a computer-readable medium, in which computer-readable instructions are stored, and the computer-readable instructions can be executed by a processor to achieve the following operations:
  • target document meets the preset requirements, classify and label the target document based on the pre-trained BERT model, and output the category and label corresponding to the target document.
  • a fifth aspect of the present application provides a document processing device, including:
  • a receiving unit configured to receive a target document uploaded by a user, generate a document abstract of the target document through a preset automatic abstract tool, and extract target keywords from the document abstract;
  • a judgment unit configured to judge whether the target document meets a preset requirement according to the number of extracted target keywords, where the preset requirement is that the number of target keywords contained in the document is greater than a preset threshold;
  • the labeling unit is configured to classify and label the target document based on the pre-trained BERT model if the target document meets the preset requirements, and output the category and label corresponding to the target document.
  • the document processing method, device, device and storage medium receive a target document uploaded by a user, generate a document abstract of the target document through a preset automatic abstract tool, and extract the document from the document. Extract target keywords from the abstract; determine whether the target document meets the preset requirements according to the number of extracted target keywords, where the preset requirement is that the number of target keywords contained in the document is greater than a preset threshold; if the target If the document meets the preset requirements, classify and label the target document based on the pre-trained BERT model, and output the category and label corresponding to the target document.
  • the relationship between the applicant and the issuing enterprise can be automatically classified and identified according to the investment relationship certification document uploaded by the applicant, without the need for manual identification by the user, which greatly reduces the consumption of manpower and time.
  • FIG. 1 shows one of the flow charts of a document processing method provided by the present application
  • FIG. 2 shows the second flowchart of a document processing method provided by the present application
  • FIG. 3 shows the third flowchart of a document processing method provided by the present application
  • FIG. 4 shows a schematic diagram of a document processing apparatus provided by some embodiments of the present application.
  • FIG. 5 shows a schematic diagram of an electronic device provided by some embodiments of the present application.
  • FIG. 6 shows a schematic diagram of a computer-readable storage medium provided by some embodiments of the present application.
  • Embodiments of the present application provide a document processing method, device, and device, an electronic device, and a computer-readable storage medium, which are described below with reference to the accompanying drawings.
  • FIG. 1 shows a flowchart of a document processing method provided by some embodiments of the present application.
  • the document processing method may include the following steps:
  • Step S101 Receive a target document uploaded by a user, generate a document abstract of the target document through a preset automatic abstract tool, and extract target keywords from the document abstract;
  • Step S102 Judging whether the target document meets a preset requirement according to the number of extracted target keywords, where the preset requirement is that the number of target keywords contained in the document is greater than a preset threshold;
  • Step S103 If the target document meets the preset requirements, classify and label the target document based on the pre-trained BERT model, and output the category and label corresponding to the target document.
  • the user can be a subscriber who issues stocks issued by the issuing company, and the target document can be an investment relationship certification document uploaded by the subscriber.
  • the investment relationship can include the following:
  • the applicant is an employee or former employee of the issuing enterprise
  • the subscriber is the cornerstone investor of the issuing enterprise
  • the subscriber is a shareholder of the issuing company
  • the requisitioner is the shareholder of the agent (or sponsor).
  • step S101 after receiving the target document uploaded by the user, it is necessary to judge the correlation between the content of the target document and the investment relationship. If the correlation is very low, the target document uploaded by the user is considered invalid, and the user can be prompted that the target document does not meet the requirements. , need to re-upload.
  • the target keywords may be employees, shareholders, agents, etc. Specifically, it may be determined whether the target document uploaded by the user conforms to the preset value according to whether the number of target keywords contained in the document is greater than a preset threshold. Require.
  • judging whether the target document meets the preset requirements in step S101 can be specifically implemented as: generating a document abstract of the target document by using a preset automatic abstract tool, and extracting target keywords from the document abstract; in step S102 Whether the target document meets the preset requirements is determined according to the number of the extracted target keywords.
  • the target document meets the preset requirements, it means that the content of the target document has a high correlation with the investment relationship; if the target document does not meet the preset requirements, it means that the content of the target document has a low correlation with the investment relationship .
  • step S103 if the target document meets the preset requirements, then classify and label the target document based on the pre-trained BERT model, and output the corresponding category and label of the target document.
  • a pre-trained BERT model can be used to perform content reading comprehension and semantic analysis on the target document.
  • the full name of BERT is Bidirectional Encoder Representation from Transformers, which is a multi-layer bidirectional conversion encoder.
  • BERT is a pre-training language model.
  • the purpose of pre-training is to train the underlying and common partial models in downstream tasks in advance, and then use the respective sample data of downstream tasks (such as machine translation, reading comprehension, etc.) to train the respective models. model, which can greatly speed up the convergence rate.
  • the sample data in this application can be multi-party data including issuing companies, agents and applicants, as well as publicly available related data.
  • the operation of classifying and labeling the target document based on the pre-trained BERT model in step S102 can be specifically implemented as:
  • Step S201 Input the target document into a pre-trained BERT model to obtain the issuing enterprise entity, the agent entity and the purchaser entity in the target document;
  • Step S202 According to the issuing enterprise entity and the agent entity, search for the associated paragraph of the issuing enterprise entity and the associated paragraph of the agent entity in the target document;
  • Step S203 Using the pre-trained semantic analysis model, extract the investment relationship between the issuing enterprise entity and the applicant entity from the associated paragraph of the issuing enterprise entity, and extract the agent from the associated paragraph of the agent entity. the investment relationship between the entity and the applicant entity;
  • Step S204 According to the preset corresponding relationship between the investment relationship and the category, classify and label the investment relationship.
  • machine content reading comprehension and semantic analysis are performed on the investment relationship proof document to identify the entity.
  • Find the issuing company, agent, and purchaser in the entity and then find the relevant paragraph for the identified issuing company entity, analyze the relationship between entities according to semantics, identify the relationship and label it, and if it is an employee relationship, mark the document.
  • the model parameters of the initial BERT model are adjusted to obtain the BERT model.
  • the semantic analysis model is obtained by training with a related model, and details are not described here.
  • the document processing method receives a target document uploaded by a user, and determines whether the target document meets a preset requirement, where the preset requirement is that the number of target keywords contained in the document is greater than a preset threshold; if If the target document meets the preset requirements, classify and label the target document based on the pre-trained BERT model, and output the category and label corresponding to the target document.
  • the relationship between the applicant and the issuing enterprise can be automatically classified and identified according to the investment relationship certification document uploaded by the applicant, without the need for manual identification by the user, which greatly reduces the consumption of manpower and time.
  • a prompt message is sent to remind the user that the target document does not meet the requirements and needs to be re-uploaded.
  • the document processing method of the above embodiment may further include:
  • Step S301 Perform pedigree analysis on pre-acquired multi-party data including issuers, agents and subscribers, so as to verify the authenticity of the investment relationship;
  • Step S302 Update the investment relationship according to the authenticity check result, and classify and label the updated investment relationship.
  • the subscriber of A is actually a shareholder of company B, and then the issuer is held by company B, so the subscriber of A is actually the shareholder of the issuing company.
  • This relationship needs to be traced through public data. , pedigree analysis to identify. If the documents uploaded by the applicant are incomplete, some investment relationships may be intentionally missed. Therefore, it is necessary to verify the authenticity of the investment relationship through public data or data provided by the issuing company to ensure a comprehensive identification of the applicant's investment relationship.
  • pedigree analysis is as follows:
  • a Use the public data to analyze the shareholders and types of the issuing company.
  • the personal type is recorded in the data table person_inv
  • the enterprise type is recorded in the data table etp_inv (the table contains fields: company name, whether it has been verified).
  • etp_inv For the enterprises recorded in etp_inv, perform step a analysis, and continue to analyze the shareholders and types of enterprises. If it is a personal type shareholder, it is recorded in the table person_inv, and if it is an enterprise type, it is recorded in the data table etp_inv.
  • the document processing method of the above embodiment may further include:
  • Step S303 Based on the category and label corresponding to the target document, an audit report is generated according to a preset report template.
  • an audit report is generated according to a preset report template.
  • the document processing method of the above embodiment may further include: storing the audit report and the target document in a database correspondingly for subsequent use.
  • the relationship between the applicant and the issuing company is automatically classified and identified according to the investment relationship certification document uploaded by the applicant, without the need for manual identification by the user, which greatly reduces the consumption of manpower and time, and generates an audit report for the applicant for supervision. Personnel audit to improve audit efficiency.
  • a document processing method is provided, and correspondingly, the present application also provides a document processing apparatus.
  • the document processing apparatus provided by the embodiment of the present application may implement the above-mentioned document processing method, and the document processing apparatus may be implemented by software, hardware, or a combination of software and hardware.
  • the document processing apparatus may include integrated or separate functional modules or units to perform corresponding steps in the above-mentioned methods.
  • FIG. 4 shows a schematic diagram of a document processing apparatus provided by some embodiments of the present application. Since the apparatus embodiments are basically similar to the method embodiments, the description is relatively simple, and reference may be made to some descriptions of the method embodiments for related parts.
  • the device embodiments described below are merely illustrative.
  • the document processing apparatus 10 may include:
  • a receiving module 101 is configured to receive a target document uploaded by a user, generate a document abstract of the target document through a preset automatic abstract tool, and extract target keywords from the document abstract;
  • a judgment module 102 configured to judge whether the target document meets a preset requirement according to the number of extracted target keywords, where the preset requirement is that the number of target keywords contained in the document is greater than a preset threshold;
  • the labeling module 103 is configured to classify and label the target document based on the pre-trained BERT model if the target document meets the preset requirements, and output the category and label corresponding to the target document.
  • the user can be the subscriber of the issuer's issued shares
  • the target document can be the investment relationship certification document uploaded by the subscriber.
  • the investment relationship certification document represents the subscriber and the issuing company.
  • the target document uploaded by the user After receiving the target document uploaded by the user, it is necessary to judge the correlation between the content of the target document and the investment relationship. If the correlation is very low, the target document uploaded by the user is considered invalid, and the user can be prompted that the target document does not meet the requirements and needs to be re-uploaded.
  • the target keywords may be employees, shareholders, agents, etc. Specifically, it may be determined whether the target document uploaded by the user meets the preset requirements according to whether the number of target keywords contained in the document is greater than a preset threshold.
  • the target document meets the preset requirements, it means that the content of the target document has a high correlation with the investment relationship; if the target document does not meet the preset requirements, it means that the content of the target document has a low correlation with the investment relationship .
  • the document processing apparatus 10 further includes:
  • the prompt module is used for sending a prompt message if the target document does not meet the preset requirements, so as to remind the user that the target document does not meet the requirements and needs to be re-uploaded.
  • the labeling module 102 is specifically used for:
  • the issuing enterprise entity and the agency entity find the relevant paragraph of the issuing enterprise entity and the associated paragraph of the agency entity in the target document;
  • the investment relationship between the issuing enterprise entity and the purchaser entity is extracted from the associated paragraphs of the issuing enterprise entity, and the agent entity and the subscription entity are extracted from the associated paragraphs of the agent entity. investment relationships between entities;
  • the investment relationship is classified and correspondingly labeled.
  • the investment relationship includes:
  • the applicant is an employee or former employee of the issuing enterprise
  • the subscriber is the cornerstone investor of the issuing enterprise
  • the subscriber is a shareholder of the issuing company
  • the requisitioner is the shareholder of the agent.
  • machine content reading comprehension and semantic analysis are performed on the investment relationship proof document to identify the entity.
  • Find the issuing company, agent, and purchaser in the entity and then find the relevant paragraph for the identified issuing company entity, analyze the relationship between entities according to semantics, identify the relationship and label it, and if it is an employee relationship, mark the document.
  • the document processing apparatus 10 further includes:
  • the verification module is used for the labeling module to perform pedigree analysis on the pre-acquired multi-party data including the issuer, the agent and the purchaser after classifying the investment relationship and labeling the investment relationship, so as to conduct a pedigree analysis on the investment relationship.
  • Authenticity check update the investment relationship according to the authenticity check result, and classify and label the updated investment relationship.
  • the document processing apparatus further includes:
  • the report generation module is configured to generate an audit report according to a preset report template based on the category and label corresponding to the target document.
  • the subscriber of A is actually a shareholder of company B, and then the issuer is held by company B, so the subscriber of A is actually the shareholder of the issuing company.
  • This relationship needs to be traced through public data. , pedigree analysis to identify. If the documents uploaded by the applicant are incomplete, some investment relationships may be intentionally missed. Therefore, it is necessary to verify the authenticity of the investment relationship through public data or data provided by the issuing company to ensure a comprehensive identification of the applicant's investment relationship.
  • the document processing apparatus 10 further includes:
  • the report generation module is configured to generate an audit report according to a preset report template based on the category and label corresponding to the target document.
  • an audit report is generated according to a preset report template.
  • the relationship between the applicant and the issuing company is automatically classified and identified according to the investment relationship certification document uploaded by the applicant, without the need for manual identification by the user, which greatly reduces the consumption of manpower and time, and generates an audit report for the applicant for supervision. Personnel audit to improve audit efficiency.
  • the document processing apparatus 10 further includes:
  • a storage module configured to store the audit report and the target document in a database correspondingly.
  • the document processing device receives a target document uploaded by a user, and determines whether the target document meets a preset requirement, where the preset requirement is that the number of target keywords contained in the document is greater than a preset threshold; if If the target document meets the preset requirements, classify and label the target document based on the pre-trained BERT model, and output the category and label corresponding to the target document.
  • the device can automatically classify and identify the relationship between the applicant and the issuing enterprise according to the investment relationship certification document uploaded by the applicant, without requiring manual identification by the user, greatly reducing the consumption of manpower and time. And it can also automatically verify the authenticity of the investment relationship and generate an audit report to improve the audit efficiency.
  • the present application also provides a document processing apparatus.
  • the document processing device provided by the embodiment of the present application can implement the above-mentioned document processing method. Since the device embodiments are basically similar to the method embodiments, the description is relatively simple, and reference may be made to some descriptions of the method embodiments for related parts. The device embodiments described below are merely illustrative.
  • the document processing device may include:
  • a receiving unit configured to receive a target document uploaded by a user, generate a document abstract of the target document through a preset automatic abstract tool, and extract target keywords from the document abstract;
  • a judgment unit configured to judge whether the target document meets a preset requirement according to the number of extracted target keywords, where the preset requirement is that the number of target keywords contained in the document is greater than a preset threshold;
  • the labeling unit is configured to classify and label the target document based on the pre-trained BERT model if the target document meets the preset requirements, and output the category and label corresponding to the target document.
  • the user can be a subscriber who issues stocks issued by the issuing company, and the target document can be an investment relationship certification document uploaded by the subscriber.
  • the target document uploaded by the user After receiving the target document uploaded by the user, it is necessary to judge the correlation between the content of the target document and the investment relationship. If the correlation is very low, the target document uploaded by the user is considered invalid, and the user can be prompted that the target document does not meet the requirements and needs to be re-uploaded.
  • the target keywords may be employees, shareholders, agents, etc. Specifically, it may be determined whether the target document uploaded by the user meets the preset requirements according to whether the number of target keywords contained in the document is greater than a preset threshold.
  • the target document meets the preset requirements, it means that the content of the target document has a high correlation with the investment relationship; if the target document does not meet the preset requirements, it means that the content of the target document has a low correlation with the investment relationship .
  • the document processing device further includes:
  • a prompting unit configured to issue a prompt message if the target document does not meet the preset requirements to prompt the user that the target document does not meet the requirements and needs to be re-uploaded.
  • the labeling unit is specifically used for:
  • the issuing enterprise entity and the agency entity find the relevant paragraph of the issuing enterprise entity and the associated paragraph of the agency entity in the target document;
  • the investment relationship between the issuing enterprise entity and the purchaser entity is extracted from the associated paragraphs of the issuing enterprise entity, and the agent entity and the subscription entity are extracted from the associated paragraphs of the agent entity. investment relationships between entities;
  • the investment relationship is classified and correspondingly labeled.
  • the investment relationship includes:
  • the applicant is an employee or former employee of the issuing enterprise
  • the subscriber is the cornerstone investor of the issuing enterprise
  • the subscriber is a shareholder of the issuing company
  • the requisitioner is the shareholder of the agent.
  • machine content reading comprehension and semantic analysis are performed on the investment relationship proof document to identify the entity.
  • Find the issuing company, agent, and purchaser in the entity and then find the relevant paragraph for the identified issuing company entity, analyze the relationship between entities according to semantics, identify the relationship and label it, and if it is an employee relationship, mark the document.
  • the document processing device further includes:
  • the verification unit is used for the labeling unit to perform pedigree analysis on the pre-acquired multi-party data including the issuer, the agent and the purchaser after classifying the investment relationship and labeling the investment relationship, so as to carry out a pedigree analysis on the investment relationship.
  • Authenticity check update the investment relationship according to the authenticity check result, and classify and label the updated investment relationship.
  • the document processing device further includes:
  • a report generating unit configured to generate an audit report according to a preset report template based on the category and label corresponding to the target document.
  • the subscriber of A is actually a shareholder of company B, and then the issuer is held by company B, so the subscriber of A is actually the shareholder of the issuing company.
  • This relationship needs to be traced through public data. , pedigree analysis to identify. If the documents uploaded by the applicant are incomplete, some investment relationships may be intentionally missed. Therefore, it is necessary to verify the authenticity of the investment relationship through public data or data provided by the issuing company to ensure a comprehensive identification of the applicant's investment relationship.
  • the document processing device further includes:
  • a report generating unit configured to generate an audit report according to a preset report template based on the category and label corresponding to the target document.
  • an audit report is generated according to a preset report template.
  • the relationship between the applicant and the issuing company is automatically classified and identified according to the investment relationship certification document uploaded by the applicant, without the need for manual identification by the user, which greatly reduces the consumption of manpower and time, and generates an audit report for the applicant for supervision. Personnel audit to improve audit efficiency.
  • the document processing device further includes:
  • a storage unit configured to store the audit report and the target document in a database correspondingly.
  • Embodiments of the present application further provide an electronic device corresponding to the document processing method provided by the foregoing embodiments, the electronic device may be a mobile phone, a notebook computer, a tablet computer, a desktop computer, etc., to execute the above document processing method.
  • FIG. 5 shows a schematic diagram of an electronic device provided by some embodiments of the present application.
  • the electronic device 20 includes: a processor 200 , a memory 201 , a bus 202 and a communication interface 203 , the processor 200 , the communication interface 203 and the memory 201 are connected through the bus 202 ;
  • the memory 201 may include a high-speed random access memory (RAM: Random Access Memory), and may also include a non-volatile memory (non-volatile memory), such as at least one disk memory.
  • RAM Random Access Memory
  • non-volatile memory such as at least one disk memory.
  • the communication connection between the network element of the system and at least one other network element is realized through at least one communication interface 203 (which may be wired or wireless), which may use the Internet, a wide area network, a local network, a metropolitan area network, and the like.
  • the bus 202 may be an ISA bus, a PCI bus, an EISA bus, or the like.
  • the bus can be divided into an address bus, a data bus, a control bus, and the like.
  • the memory 201 is used to store a program, and the processor 200 executes the program after receiving the execution instruction.
  • the document processing method disclosed in any of the foregoing embodiments of the present application may be applied to the processor 200, Or implemented by the processor 200 .
  • the processor 200 may be an integrated circuit chip with signal processing capability. In the implementation process, each step of the above-mentioned method can be completed by an integrated logic circuit of hardware in the processor 200 or an instruction in the form of software.
  • the above-mentioned processor 200 can be a general-purpose processor, including a central processing unit (Central Processing Unit, referred to as CPU), a network processor (Network Processor, referred to as NP), etc.; it can also be a digital signal processor (DSP), an application-specific integrated circuit. (ASIC), off-the-shelf programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware component.
  • CPU Central Processing Unit
  • NP Network Processor
  • DSP digital signal processor
  • ASIC application-specific integrated circuit
  • FPGA off-the-shelf programmable gate array
  • a general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
  • the steps of the methods disclosed in conjunction with the embodiments of the present application may be directly embodied as executed by a hardware decoding processor, or executed by a combination of hardware and software modules in the decoding processor.
  • the software module may be located in random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, registers and other storage media mature in the art.
  • the storage medium is located in the memory 201, and the processor 200 reads the information in the memory 201, and completes the steps of the above method in combination with its hardware.
  • the electronic device provided by the embodiment of the present application and the document processing method provided by the embodiment of the present application are based on the same inventive concept, and have the same beneficial effects as the method adopted, executed or realized.
  • Embodiments of the present application further provide a computer-readable storage medium corresponding to the document processing method provided by the foregoing embodiments, please refer to FIG. 6 , the computer-readable storage medium shown is an optical disc 30 on which a computer program is stored (ie, a program product), when the computer program is executed by the processor, it will execute the document processing method provided by any of the foregoing embodiments.
  • a computer program ie, a program product
  • the computer-readable storage medium may be non-volatile or volatile, including but not limited to phase-change memory (PRAM), static random access memory (SRAM), dynamic random access memory access memory (DRAM), other types of random access memory (RAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), flash memory, or other optical, magnetic storage media, in This will not be repeated one by one.
  • PRAM phase-change memory
  • SRAM static random access memory
  • DRAM dynamic random access memory access memory
  • RAM random access memory
  • ROM read only memory
  • EEPROM electrically erasable programmable read only memory
  • flash memory or other optical, magnetic storage media, in This will not be repeated one by one.
  • the computer-readable storage medium provided by the above-mentioned embodiments of the present application and the document processing method provided by the embodiments of the present application are based on the same inventive concept, and have the same beneficial effects as the methods used, executed or implemented by the application programs stored therein.
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more functions for implementing the specified logical function(s) executable instructions.
  • the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or actions , or can be implemented in a combination of dedicated hardware and computer instructions.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

一种文档处理方法、装置、电子设备及计算机可读存储介质。其中方法包括:接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词(S101);根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值(S102);若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签(S103)。相较于现有技术,通过本方案,可以根据申购者上传的投资关系证明文档对申购者与发行企业的关系进行自动分类标识,无需用户手动标识,大大减少了人力和时间的消耗。

Description

文档处理方法、装置、电子设备及存储介质
本申请要求于2020年12月28日提交中国专利局、申请号为202011583169.6、申请名称为“文档处理方法、装置、电子设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及人工智能技术领域,具体涉及一种文档处理方法、装置、电子设备及存储介质。
背景技术
狭义的公司上市即首次公开募股Initial Public Offerings(IPO)指企业通过证券交易所首次公开向投资者增发股票,以期募集用于企业发展资金的过程。
许多新股上市以后,相关机构需要对申购者的资格进行审核,只有审核通过的申购者才具有申购资格。相关机构需要对申购者与发行企业的关系、申购者与发行代理的关系等进行审核,以判断申购者的申购资格。
对于申购者提供的投资关系证明文档,以往做法就是人工上传审核资料,然后相关工作人员手动标记申购者与发行企业(或发行代理)的关系,标记之后供审核人员进行审核,发明人发现以往做法效率低,且容易出错,人力和时间成本较高。
发明内容
本申请的目的是提供一种文档处理方法、装置、电子设备及存储介质。
本申请第一方面提供一种文档处理方法,包括:
接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;
若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
本申请第二方面提供一种文档处理装置,包括:
接收模块,用于接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
判断模块,用于根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;
打标签模块,用于若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
本申请第三方面提供一种电子设备,包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其中,所述处理器运行所述计算机程序时执行以下操作:
接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述 预设要求为文档中包含的目标关键词的数量大于预设阈值;
若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
本申请第四方面提供一种计算机可读介质,其中存储有计算机可读指令,所述计算机可读指令可被处理器执行以实现以下操作:
接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;
若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
本申请第五方面提供一种文档处理设备,包括:
接收单元,用于接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
判断单元,用于根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;
打标签单元,用于若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
相较于现有技术,本申请提供的文档处理方法、装置、设备及存储介质,接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。相较于现有技术,通过本方案,可以根据申购者上传的投资关系证明文档对申购者与发行企业的关系进行自动分类标识,无需用户手动标识,大大减少了人力和时间的消耗。
附图说明
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本申请的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:
图1示出了本申请所提供的一种文档处理方法的流程图之一;
图2示出了本申请所提供的一种文档处理方法的流程图之二;
图3示出了本申请所提供的一种文档处理方法的流程图之三;
图4示出了本申请的一些实施方式所提供的一种文档处理装置的示意图;
图5示出了本申请的一些实施方式所提供的一种电子设备的示意图;
图6示出了本申请的一些实施方式所提供的一种计算机可读存储介质的示意图。
具体实施方式
下面将参照附图更详细地描述本公开的示例性实施方式。虽然附图中显示 了本公开的示例性实施方式,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施方式所限制。相反,提供这些实施方式是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。
需要注意的是,除非另有说明,本申请使用的技术术语或者科学术语应当为本申请所属领域技术人员所理解的通常意义。
另外,术语“第一”和“第二”等是用于区别不同对象,而不是用于描述特定顺序。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。
本申请实施例提供一种文档处理方法、设备及装置、一种电子设备以及计算机可读存储介质,下面结合附图进行说明。
请参考图1,其示出了本申请的一些实施方式所提供的一种文档处理方法的流程图,如图1所示,该文档处理方法,可以包括以下步骤:
步骤S101:接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
步骤S102:根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;
步骤S103:若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
实际应用中,用户可以是发行企业发行股票的申购者,目标文档可以是申购者上传的投资关系证明文档,投资关系证明文档是表征申购者与发行企业(或发行代理)投资关系的证明文件。
具体的,投资关系可以包括以下几种:
申购者是发行企业的雇员或者前雇员;
申购者是发行企业的基石投资者;
申购者是发行企业的股东;
申购者是代理商(或保荐商)的股东。
在步骤S101中,接收到用户上传的目标文档后,需要判断目标文档的内容与投资关系的相关性,若相关性很低,则认为用户上传的目标文档无效,可以提示用户目标文档不符合要求,需重新上传。
具体的,在步骤S101中,目标关键词可以是雇员、股东、代理商等,具体可以根据文档中包含的目标关键词的数量是否大于预设阈值,来判断用户上传的目标文档是否符合预设要求。
具体的,步骤S101中判断目标文档是否符合预设要求,可以具体实现为:通过预设的自动摘要工具生成目标文档的文档摘要,并从所述文档摘要中提取目标关键词;在步骤S102中根据提取的目标关键词的数量判断目标文档是否符合预设要求。
若所述目标文档符合预设要求,则说明目标文档的内容与投资关系的相关性较高,若所述目标文档不符合预设要求,则说明目标文档的内容与投资关系的相关性较低。
在步骤S103中,若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别 及标签。
具体的,可以采用预先训练好的BERT模型来对目标文档进行内容阅读理解和语义分析。BERT的全称是Bidirectional Encoder Representation from Transformers,即多层双向转换编码器。
BERT是一种预训练语言模型,预训练的目的就是,提前训练好下游任务中底层的、共性的部分模型,然后再用下游任务(比如机器翻译、阅读理解等)各自的样本数据来训练各自的模型,这样就可以极大地加快收敛速度。本申请中的样本数据可以是包括发行企业、代理商和申购者的多方数据,以及公开可获知的相关数据。
根据本申请的一些实施方式中,步骤S102中基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,可以具体实现为:
步骤S201:将所述目标文档输入预先训练的BERT模型,得到所述目标文档中的发行企业实体、代理商实体和申购者实体;
步骤S202:根据所述发行企业实体和所述代理商实体,在所述目标文档中找寻所述发行企业实体的关联段落,及所述代理商实体的关联段落;
步骤S203:利用预先训练好的语义分析模型,从所述发行企业实体的关联段落中提取发行企业实体和申购者实体之间的投资关系,及从所述代理商实体的关联段落中提取代理商实体和申购者实体之间的投资关系;
步骤S204:根据预设的投资关系和类别的对应关系,对所述投资关系进行分类以及打上相应标签。
具体的,对投资关系证明文档进行机器内容阅读理解和语义分析,识别出实体。在实体中找到发行企业、代理商、申购者,然后对识别出来的发行企业实体,找寻相关段落,根据语义分析实体之间的关系,识别关系并打上标签,如果是雇员关系,则将文档标记雇员标签,如果是基石投资者关系,则将文档标记基石投资者标签,如果是股东关系,则将文档标记股东标签,等等。若未识别出关系,则提示用户上传文档可能不符要求,然后要求用户详细说明文档中哪些描述能说明投资关系或者补充其他文档。
本申请中,所述BERT模型的训练过程如下:
确定训练文档和初始BERT模型;
将所述训练文档输入所述初始BERT模型;
采集所述初始BERT模型的输出,得到对应于所述训练文档的训练特征表示信息;
根据所述训练特征表示信息,确定所述训练文档的预测类别;
确定所述训练文档的实际类别,并根据所述实际类别与所述预测类别,得到反馈信息;
根据所述反馈信息,调整所述初始BERT模型的模型参数,以得到所述BERT模型。
本申请中,所述语义分析模型采用相关模型训练得到,在此不做赘述。
本申请实施例提供的文档处理方法,接收用户上传的目标文档,并判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。相较于现有技术,通过本方案,可以根据申购者上传的投资关系证明文档对申购者与发行企业的关系进行自动分类标识,无需用户手动标识,大大减 少了人力和时间的消耗。
根据本申请的一些实施方式中,若所述目标文档不符合预设要求,则发出提示信息,以提示用户所述目标文档不符合要求,需重新上传。
根据本申请的一些实施方式中,如图2所示,上述实施例的文档处理方法还可以包括:
步骤S301:对预先获取的包括发行企业、代理商和申购者的多方数据进行谱系分析,以对所述投资关系进行真实性核查;
步骤S302:根据真实性核查结果更新所述投资关系,并对更新后的投资关系进行分类以及打上相应标签。
具体的,以识别股份代持关系为例:比如A申购者实际是B公司股东,然后通过B公司持股发行企业,所以A申购者实际是发行企业的股东,这个关系需要通过公开数据追溯查询、谱系分析才能识别出来。如果申购者上传的文档不完整,可能故意漏掉一些投资关系,所以需要再通过公开数据或者发行企业提供的数据来进行投资关系的真实性核查,确保全面的识别申购者的投资关系。
具体的,谱系分析举例如下:
a.利用公开的数据,分析发行企业股东及类型,个人类型记录在数据表person_inv中,企业类型记录在数据表etp_inv中(表包含字段为:企业名、是否已核查)。
b.对记录在etp_inv中的企业,再进行a步骤分析,继续分析企业的股东及类型。如是个人类型股东,则记录在表person_inv中,如是企业类型记录在数据表etp_inv中。
完成此步核查后,在数据表etp_inv中记录此企业已核查(记录状态)。
c.逐个遍历核查etp_inv中的每个企业股东的股东,直到都是个人股东为止。
d.然后将数据表person_inv中的个人股东,与申购者根据姓名或者身份证号进行一一遍历匹配,看是否有匹配上的,如果有则再判断已识别的申购者的投资者关系是否包含此代持投资关系,若没有包含则提示申购者还需补充提交投资关系证明文档,否则申购将不通过。
本实施方式,基于发行企业、代理商和申购者提供的股权信息、财务数据等信息以及公开可获知的数据进行投资关系的真实性核查,挖掘和正确识别出新股申购者与发行企业、代理商的特殊关系,以满足监管要求。
根据本申请的一些实施方式中,如图3所示,上述实施例的文档处理方法还可以包括:
步骤S303:基于所述目标文档对应的类别及标签,根据预设报告模板生成审核报告。
具体的,基于得到的申购者与上市企业(或代理商)的投资关系,以及真实性核查结果,根据预设报告模板生成审核报告。
根据本申请的一些实施方式中,上述实施例的文档处理方法还可以包括:将所述审核报告和所述目标文档对应存储于数据库中,以便后续使用。
本实施方式,根据申购者上传的投资关系证明文档对申购者与发行企业的关系进行自动分类标识,无需用户手动标识,大大减少了人力和时间的消耗,并且生成申购者的审核报告,供监管人员审核,以提升审核效率。
在上述的实施例中,提供了一种文档处理方法,与之相对应的,本申请还提供一种文档处理装置。本申请实施例提供的文档处理装置可以实施上述文档 处理方法,该文档处理装置可以通过软件、硬件或软硬结合的方式来实现。例如,该文档处理装置可以包括集成的或分开的功能模块或单元来执行上述各方法中的对应步骤。请参考图4,其示出了本申请的一些实施方式所提供的一种文档处理装置的示意图。由于装置实施例基本相似于方法实施例,所以描述得比较简单,相关之处参见方法实施例的部分说明即可。下述描述的装置实施例仅仅是示意性的。
如图4所示,所述文档处理装置10,可以包括:
接收模块101,用于接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
判断模块102,用于根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;
打标签模块103,用于若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
实际应用中,用户可以是发行企业发行股票的申购者,目标文档可以是申购者上传的投资关系证明文档,投资关系证明文档是表征申购者与发行企业
(或发行代理)投资关系的证明文件。
接收到用户上传的目标文档后,需要判断目标文档的内容与投资关系的相关性,若相关性很低,则认为用户上传的目标文档无效,可以提示用户目标文档不符合要求,需重新上传。
具体的,目标关键词可以是雇员、股东、代理商等,具体可以根据文档中包含的目标关键词的数量是否大于预设阈值,来判断用户上传的目标文档是否符合预设要求。
若所述目标文档符合预设要求,则说明目标文档的内容与投资关系的相关性较高,若所述目标文档不符合预设要求,则说明目标文档的内容与投资关系的相关性较低。
在本申请实施例的一些实施方式中,所述文档处理装置10,还包括:
提示模块,用于若所述目标文档不符合预设要求,则发出提示信息,以提示用户所述目标文档不符合要求,需重新上传。
在本申请实施例的一些实施方式中,所述打标签模块102,具体用于:
将所述目标文档输入预先训练的BERT模型,得到所述目标文档中的发行企业实体、代理商实体和申购者实体;
根据所述发行企业实体和所述代理商实体,在所述目标文档中找寻所述发行企业实体的关联段落,及所述代理商实体的关联段落;
利用预先训练好的语义分析模型,从所述发行企业实体的关联段落中提取发行企业实体和申购者实体之间的投资关系,及从所述代理商实体的关联段落中提取代理商实体和申购者实体之间的投资关系;
根据预设的投资关系和类别的对应关系,对所述投资关系进行分类以及打上相应标签。
在本申请实施例的一些实施方式中,所述投资关系包括:
申购者是发行企业的雇员或者前雇员;
申购者是发行企业的基石投资者;
申购者是发行企业的股东;
申购者是代理商的股东。
具体的,对投资关系证明文档进行机器内容阅读理解和语义分析,识别出实体。在实体中找到发行企业、代理商、申购者,然后对识别出来的发行企业实体,找寻相关段落,根据语义分析实体之间的关系,识别关系并打上标签,如果是雇员关系,则将文档标记雇员标签,如果是基石投资者关系,则将文档标记基石投资者标签,如果是股东关系,则将文档标记股东标签,等等。若未识别出关系,则提示用户上传文档可能不符要求,然后要求用户详细说明文档中哪些描述能说明投资关系或者补充其他文档。
在本申请实施例的一些实施方式中,所述文档处理装置10,还包括:
核查模块,用于所述打标签模块对所述投资关系进行分类以及打上相应标签之后,对预先获取的包括发行企业、代理商和申购者的多方数据进行谱系分析,以对所述投资关系进行真实性核查;根据真实性核查结果更新所述投资关系,并对更新后的投资关系进行分类以及打上相应标签。
根据本申请的一些实施方式中,所述文档处理装置,还包括:
报告生成模块,用于基于所述目标文档对应的类别及标签,根据预设报告模板生成审核报告。
具体的,以识别股份代持关系为例:比如A申购者实际是B公司股东,然后通过B公司持股发行企业,所以A申购者实际是发行企业的股东,这个关系需要通过公开数据追溯查询、谱系分析才能识别出来。如果申购者上传的文档不完整,可能故意漏掉一些投资关系,所以需要再通过公开数据或者发行企业提供的数据来进行投资关系的真实性核查,确保全面的识别申购者的投资关系。
本实施方式,基于发行企业、代理商和申购者提供的股权信息、财务数据等信息以及公开可获知的数据进行投资关系的真实性核查,挖掘和正确识别出新股申购者与发行企业、代理商的特殊关系,以满足监管要求。
在本申请实施例的一些实施方式中,所述文档处理装置10,还包括:
报告生成模块,用于基于所述目标文档对应的类别及标签,根据预设报告模板生成审核报告。
具体的,基于得到的申购者与上市企业(或代理商)的投资关系,以及真实性核查结果,根据预设报告模板生成审核报告。
本实施方式,根据申购者上传的投资关系证明文档对申购者与发行企业的关系进行自动分类标识,无需用户手动标识,大大减少了人力和时间的消耗,并且生成申购者的审核报告,供监管人员审核,以提升审核效率。
在本申请实施例的一些实施方式中,所述文档处理装置10,还包括:
存储模块,用于将所述审核报告和所述目标文档对应存储于数据库中。
本申请实施例提供的文档处理装置,接收用户上传的目标文档,并判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。相较于现有技术,通过本装置,可以根据申购者上传的投资关系证明文档对申购者与发行企业的关系进行自动分类标识,无需用户手动标识,大大减少了人力和时间的消耗。并且还可以自动对投资关系进行真实性核查,生成审核报告,以提高审核效率。
本申请还提供一种文档处理设备。本申请实施例提供的文档处理设备可以实施上述文档处理方法。由于设备实施例基本相似于方法实施例,所以描述得比较简单,相关之处参见方法实施例的部分说明即可。下述描述的设备实施例 仅仅是示意性的。
所述文档处理设备,可以包括:
接收单元,用于接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
判断单元,用于根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;
打标签单元,用于若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
实际应用中,用户可以是发行企业发行股票的申购者,目标文档可以是申购者上传的投资关系证明文档,投资关系证明文档是表征申购者与发行企业(或发行代理)投资关系的证明文件。
接收到用户上传的目标文档后,需要判断目标文档的内容与投资关系的相关性,若相关性很低,则认为用户上传的目标文档无效,可以提示用户目标文档不符合要求,需重新上传。
具体的,目标关键词可以是雇员、股东、代理商等,具体可以根据文档中包含的目标关键词的数量是否大于预设阈值,来判断用户上传的目标文档是否符合预设要求。
若所述目标文档符合预设要求,则说明目标文档的内容与投资关系的相关性较高,若所述目标文档不符合预设要求,则说明目标文档的内容与投资关系的相关性较低。
在本申请实施例的一些实施方式中,所述文档处理设备,还包括:
提示单元,用于若所述目标文档不符合预设要求,则发出提示信息,以提示用户所述目标文档不符合要求,需重新上传。
在本申请实施例的一些实施方式中,所述打标签单元,具体用于:
将所述目标文档输入预先训练的BERT模型,得到所述目标文档中的发行企业实体、代理商实体和申购者实体;
根据所述发行企业实体和所述代理商实体,在所述目标文档中找寻所述发行企业实体的关联段落,及所述代理商实体的关联段落;
利用预先训练好的语义分析模型,从所述发行企业实体的关联段落中提取发行企业实体和申购者实体之间的投资关系,及从所述代理商实体的关联段落中提取代理商实体和申购者实体之间的投资关系;
根据预设的投资关系和类别的对应关系,对所述投资关系进行分类以及打上相应标签。
在本申请实施例的一些实施方式中,所述投资关系包括:
申购者是发行企业的雇员或者前雇员;
申购者是发行企业的基石投资者;
申购者是发行企业的股东;
申购者是代理商的股东。
具体的,对投资关系证明文档进行机器内容阅读理解和语义分析,识别出实体。在实体中找到发行企业、代理商、申购者,然后对识别出来的发行企业实体,找寻相关段落,根据语义分析实体之间的关系,识别关系并打上标签,如果是雇员关系,则将文档标记雇员标签,如果是基石投资者关系,则将文档标记基石投资者标签,如果是股东关系,则将文档标记股东标签,等等。若未 识别出关系,则提示用户上传文档可能不符要求,然后要求用户详细说明文档中哪些描述能说明投资关系或者补充其他文档。
在本申请实施例的一些实施方式中,所述文档处理设备,还包括:
核查单元,用于所述打标签单元对所述投资关系进行分类以及打上相应标签之后,对预先获取的包括发行企业、代理商和申购者的多方数据进行谱系分析,以对所述投资关系进行真实性核查;根据真实性核查结果更新所述投资关系,并对更新后的投资关系进行分类以及打上相应标签。
根据本申请的一些实施方式中,所述文档处理设备,还包括:
报告生成单元,用于基于所述目标文档对应的类别及标签,根据预设报告模板生成审核报告。
具体的,以识别股份代持关系为例:比如A申购者实际是B公司股东,然后通过B公司持股发行企业,所以A申购者实际是发行企业的股东,这个关系需要通过公开数据追溯查询、谱系分析才能识别出来。如果申购者上传的文档不完整,可能故意漏掉一些投资关系,所以需要再通过公开数据或者发行企业提供的数据来进行投资关系的真实性核查,确保全面的识别申购者的投资关系。
本实施方式,基于发行企业、代理商和申购者提供的股权信息、财务数据等信息以及公开可获知的数据进行投资关系的真实性核查,挖掘和正确识别出新股申购者与发行企业、代理商的特殊关系,以满足监管要求。
在本申请实施例的一些实施方式中,所述文档处理设备,还包括:
报告生成单元,用于基于所述目标文档对应的类别及标签,根据预设报告模板生成审核报告。
具体的,基于得到的申购者与上市企业(或代理商)的投资关系,以及真实性核查结果,根据预设报告模板生成审核报告。
本实施方式,根据申购者上传的投资关系证明文档对申购者与发行企业的关系进行自动分类标识,无需用户手动标识,大大减少了人力和时间的消耗,并且生成申购者的审核报告,供监管人员审核,以提升审核效率。
在本申请实施例的一些实施方式中,所述文档处理设备,还包括:
存储单元,用于将所述审核报告和所述目标文档对应存储于数据库中。
本申请实施例提供的文档处理设备,与本申请前述实施例提供的文档处理方法出于相同的发明构思,具有相同的有益效果。
本申请实施方式还提供一种与前述实施方式所提供的文档处理方法对应的电子设备,所述电子设备可以是手机、笔记本电脑、平板电脑、台式机电脑等,以执行上述文档处理方法。
请参考图5,其示出了本申请的一些实施方式所提供的一种电子设备的示意图。如图5所示,所述电子设备20包括:处理器200,存储器201,总线202和通信接口203,所述处理器200、通信接口203和存储器201通过总线202连接;所述存储器201中存储有可在所述处理器200上运行的计算机程序,所述处理器200运行所述计算机程序时执行本申请前述任一实施方式所提供的文档处理方法。
其中,存储器201可能包含高速随机存取存储器(RAM:Random Access Memory),也可能还包括非不稳定的存储器(non-volatile memory),例如至少一个磁盘存储器。通过至少一个通信接口203(可以是有线或者无线)实现该系统网元与至少一个其他网元之间的通信连接,可以使用互联网、广域网、本地网、城域网等。
总线202可以是ISA总线、PCI总线或EISA总线等。所述总线可以分为地址总线、数据总线、控制总线等。其中,存储器201用于存储程序,所述处理器200在接收到执行指令后,执行所述程序,前述本申请实施例任一实施方式揭示的所述文档处理方法可以应用于处理器200中,或者由处理器200实现。
处理器200可能是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的各步骤可以通过处理器200中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器200可以是通用处理器,包括中央处理器(Central Processing Unit,简称CPU)、网络处理器(Network Processor,简称NP)等;还可以是数字信号处理器(DSP)、专用集成电路(ASIC)、现成可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器201,处理器200读取存储器201中的信息,结合其硬件完成上述方法的步骤。
本申请实施例提供的电子设备与本申请实施例提供的文档处理方法出于相同的发明构思,具有与其采用、运行或实现的方法相同的有益效果。
本申请实施方式还提供一种与前述实施方式所提供的文档处理方法对应的计算机可读存储介质,请参考图6,其示出的计算机可读存储介质为光盘30,其上存储有计算机程序(即程序产品),所述计算机程序在被处理器运行时,会执行前述任意实施方式所提供的文档处理方法。
需要说明的是,所述计算机可读存储介质可以是非易失性,也可以是易失性的,可以包括但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他光学、磁性存储介质,在此不再一一赘述。
本申请的上述实施例提供的计算机可读存储介质与本申请实施例提供的文档处理方法出于相同的发明构思,具有与其存储的应用程序所采用、运行或实现的方法相同的有益效果。
需要说明的是,附图中的流程图和框图显示了根据本申请的多个实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或代码的一部分,所述模块、程序段或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或动作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。
最后应说明的是:以上各实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述各实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者 对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的范围,其均应涵盖在本申请的权利要求和说明书的范围当中。

Claims (20)

  1. 一种文档处理方法,其中,包括:
    接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
    根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;
    若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
  2. 根据权利要求1所述的方法,其中,所述方法还包括:
    若所述目标文档不符合预设要求,则发出提示信息,以提示用户所述目标文档不符合要求,需重新上传。
  3. 根据权利要求1所述的方法,其中,所述基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,包括:
    将所述目标文档输入预先训练的BERT模型,得到所述目标文档中的发行企业实体、代理商实体和申购者实体;
    根据所述发行企业实体和所述代理商实体,在所述目标文档中找寻所述发行企业实体的关联段落,及所述代理商实体的关联段落;
    利用预先训练好的语义分析模型,从所述发行企业实体的关联段落中提取发行企业实体和申购者实体之间的投资关系,及从所述代理商实体的关联段落中提取代理商实体和申购者实体之间的投资关系;
    根据预设的投资关系和类别的对应关系,对所述投资关系进行分类以及打上相应标签。
  4. 根据权利要求3所述的方法,其中,所述BERT模型的训练过程包括:
    确定训练文档和初始BERT模型;
    将所述训练文档输入所述初始BERT模型;
    采集所述初始BERT模型的输出,得到对应于所述训练文档的训练特征表示信息;
    根据所述训练特征表示信息,确定所述训练文档的预测类别;
    确定所述训练文档的实际类别,并根据所述实际类别与所述预测类别,得到反馈信息;
    根据所述反馈信息,调整所述初始BERT模型的模型参数,以得到所述BERT模型。
  5. 根据权利要求3所述的方法,其中,所述对所述投资关系进行分类以及打上相应标签之后,还包括:
    对预先获取的包括发行企业、代理商和申购者的多方数据进行谱系分析,以对所述投资关系进行真实性核查;
    根据真实性核查结果更新所述投资关系,并对更新后的投资关系进行分类以及打上相应标签。
  6. 根据权利要求3所述的方法,其中,所述方法还包括:
    基于所述目标文档对应的类别及标签,根据预设报告模板生成审核报告。
  7. 根据权利要求6所述的方法,其中,所述方法还包括:
    将所述审核报告和所述目标文档对应存储于数据库中。
  8. 一种电子设备,包括:存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其中,所述处理器运行所述计算机程序时执行以下操作:
    接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
    根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要 求为文档中包含的目标关键词的数量大于预设阈值;
    若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
  9. 根据权利要求8所述的电子设备,其中,所述处理器还用于:
    若所述目标文档不符合预设要求,则发出提示信息,以提示用户所述目标文档不符合要求,需重新上传。
  10. 根据权利要求8所述的电子设备,其中,所述处理器在基于预先训练的BERT模型对所述目标文档进行分类及打标签操作时,具体用于:
    将所述目标文档输入预先训练的BERT模型,得到所述目标文档中的发行企业实体、代理商实体和申购者实体;
    根据所述发行企业实体和所述代理商实体,在所述目标文档中找寻所述发行企业实体的关联段落,及所述代理商实体的关联段落;
    利用预先训练好的语义分析模型,从所述发行企业实体的关联段落中提取发行企业实体和申购者实体之间的投资关系,及从所述代理商实体的关联段落中提取代理商实体和申购者实体之间的投资关系;
    根据预设的投资关系和类别的对应关系,对所述投资关系进行分类以及打上相应标签。
  11. 根据权利要求10所述的电子设备,其中,所述处理器在对所述投资关系进行分类以及打上相应标签之后,还用于:
    对预先获取的包括发行企业、代理商和申购者的多方数据进行谱系分析,以对所述投资关系进行真实性核查;
    根据真实性核查结果更新所述投资关系,并对更新后的投资关系进行分类以及打上相应标签。
  12. 根据权利要求10所述的电子设备,其中,所述处理器还用于:
    基于所述目标文档对应的类别及标签,根据预设报告模板生成审核报告。
  13. 根据权利要求12所述的电子设备,其中,所述处理器还用于:
    将所述审核报告和所述目标文档对应存储于数据库中。
  14. 一种计算机可读存储介质,其中存储有计算机可读指令,所述计算机可读指令可被处理器执行以实现以下操作:
    接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
    根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;
    若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
  15. 根据权利要求14所述的存储介质,其中,所述计算机可读指令可被处理器执行以实现以下操作:
    若所述目标文档不符合预设要求,则发出提示信息,以提示用户所述目标文档不符合要求,需重新上传。
  16. 根据权利要求14所述的存储介质,其中,所述计算机可读指令可被处理器执行以实现以下操作:
    将所述目标文档输入预先训练的BERT模型,得到所述目标文档中的发行企业实体、代理商实体和申购者实体;
    根据所述发行企业实体和所述代理商实体,在所述目标文档中找寻所述发行企业 实体的关联段落,及所述代理商实体的关联段落;
    利用预先训练好的语义分析模型,从所述发行企业实体的关联段落中提取发行企业实体和申购者实体之间的投资关系,及从所述代理商实体的关联段落中提取代理商实体和申购者实体之间的投资关系;
    根据预设的投资关系和类别的对应关系,对所述投资关系进行分类以及打上相应标签。
  17. 根据权利要求16所述的存储介质,其中,所述计算机可读指令可被处理器执行以实现以下操作:
    对预先获取的包括发行企业、代理商和申购者的多方数据进行谱系分析,以对所述投资关系进行真实性核查;
    根据真实性核查结果更新所述投资关系,并对更新后的投资关系进行分类以及打上相应标签。
  18. 根据权利要求16所述的存储介质,其中,所述计算机可读指令可被处理器执行以实现以下操作:
    基于所述目标文档对应的类别及标签,根据预设报告模板生成审核报告。
  19. 根据权利要求18所述的存储介质,其中,所述计算机可读指令可被处理器执行以实现以下操作:
    将所述审核报告和所述目标文档对应存储于数据库中。
  20. 一种文档处理装置,其中,包括:
    接收模块,用于接收用户上传的目标文档,通过预设的自动摘要工具生成所述目标文档的文档摘要,并从所述文档摘要中提取目标关键词;
    判断模块,用于根据提取的目标关键词的数量判断所述目标文档是否符合预设要求,所述预设要求为文档中包含的目标关键词的数量大于预设阈值;
    打标签模块,用于若所述目标文档符合预设要求,则基于预先训练的BERT模型对所述目标文档进行分类及打标签操作,并输出所述目标文档对应的类别及标签。
PCT/CN2021/096932 2020-12-28 2021-05-28 文档处理方法、装置、电子设备及存储介质 WO2022142116A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011583169.6 2020-12-28
CN202011583169.6A CN112732897A (zh) 2020-12-28 2020-12-28 文档处理方法、装置、电子设备及存储介质

Publications (1)

Publication Number Publication Date
WO2022142116A1 true WO2022142116A1 (zh) 2022-07-07

Family

ID=75606820

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/096932 WO2022142116A1 (zh) 2020-12-28 2021-05-28 文档处理方法、装置、电子设备及存储介质

Country Status (2)

Country Link
CN (1) CN112732897A (zh)
WO (1) WO2022142116A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116561348A (zh) * 2023-07-10 2023-08-08 中国电子科技集团公司第十五研究所 一种股东增减持信息抽取处理方法和系统
CN117891447A (zh) * 2024-03-14 2024-04-16 蒲惠智造科技股份有限公司 一种企业管理软件开发方法、装置、设备及介质

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112732897A (zh) * 2020-12-28 2021-04-30 平安科技(深圳)有限公司 文档处理方法、装置、电子设备及存储介质
CN113361253B (zh) * 2021-05-28 2024-04-09 北京金山数字娱乐科技有限公司 识别模型训练方法及装置
CN113505217A (zh) * 2021-07-29 2021-10-15 永道科技有限公司 基于大数据实现工程造价数据库快速形成的方法和系统
CN116797329A (zh) * 2022-04-29 2023-09-22 朱芷叶 异常数据告警方法、装置、计算机设备及存储介质
CN115827940B (zh) * 2023-02-17 2024-01-26 北京网智易通科技有限公司 电子档案的归档方法、装置、电子设备和存储介质

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170344625A1 (en) * 2016-05-27 2017-11-30 International Business Machines Corporation Obtaining of candidates for a relationship type and its label
CN109447412A (zh) * 2018-09-26 2019-03-08 平安科技(深圳)有限公司 构建企业关系图谱的方法、装置、计算机设备和存储介质
CN109446527A (zh) * 2018-10-26 2019-03-08 广东小天才科技有限公司 一种无意义语料的分析方法及系统
US20190197105A1 (en) * 2017-12-21 2019-06-27 International Business Machines Corporation Unsupervised neural based hybrid model for sentiment analysis of web/mobile application using public data sources
US20200111023A1 (en) * 2018-10-04 2020-04-09 Accenture Global Solutions Limited Artificial intelligence (ai)-based regulatory data processing system
CN111651552A (zh) * 2020-06-08 2020-09-11 中国工商银行股份有限公司 结构化信息确定方法、装置和电子设备
CN111859922A (zh) * 2020-07-31 2020-10-30 上海银行股份有限公司 实体关系抽取技术在银行风控中的应用方法
CN111968700A (zh) * 2020-07-07 2020-11-20 南京农业大学 一种基于bert的水稻表型组学知识图谱关系提取方法及系统
CN112732897A (zh) * 2020-12-28 2021-04-30 平安科技(深圳)有限公司 文档处理方法、装置、电子设备及存储介质

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104360993B (zh) * 2014-11-19 2018-03-30 广州极盛信息科技开发有限公司 一种从文本提取所需内容的方法
CN109710918A (zh) * 2018-11-26 2019-05-03 平安科技(深圳)有限公司 舆情关系识别方法、装置、计算机设备和存储介质
CN109670788A (zh) * 2018-12-13 2019-04-23 平安医疗健康管理股份有限公司 基于数据分析的医保审核方法、装置、设备和存储介质
CN110413775A (zh) * 2019-06-25 2019-11-05 北京清博大数据科技有限公司 一种数据打标签分类方法、装置、终端及存储介质
CN110427623B (zh) * 2019-07-24 2021-09-21 深圳追一科技有限公司 半结构化文档知识抽取方法、装置、电子设备及存储介质
CN110781299B (zh) * 2019-09-18 2024-03-19 平安科技(深圳)有限公司 资产信息识别方法、装置、计算机设备及存储介质
CN111737416B (zh) * 2020-06-29 2022-08-19 重庆紫光华山智安科技有限公司 案件处理模型的训练方法、案件文本处理方法及相关装置

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170344625A1 (en) * 2016-05-27 2017-11-30 International Business Machines Corporation Obtaining of candidates for a relationship type and its label
US20190197105A1 (en) * 2017-12-21 2019-06-27 International Business Machines Corporation Unsupervised neural based hybrid model for sentiment analysis of web/mobile application using public data sources
CN109447412A (zh) * 2018-09-26 2019-03-08 平安科技(深圳)有限公司 构建企业关系图谱的方法、装置、计算机设备和存储介质
US20200111023A1 (en) * 2018-10-04 2020-04-09 Accenture Global Solutions Limited Artificial intelligence (ai)-based regulatory data processing system
CN109446527A (zh) * 2018-10-26 2019-03-08 广东小天才科技有限公司 一种无意义语料的分析方法及系统
CN111651552A (zh) * 2020-06-08 2020-09-11 中国工商银行股份有限公司 结构化信息确定方法、装置和电子设备
CN111968700A (zh) * 2020-07-07 2020-11-20 南京农业大学 一种基于bert的水稻表型组学知识图谱关系提取方法及系统
CN111859922A (zh) * 2020-07-31 2020-10-30 上海银行股份有限公司 实体关系抽取技术在银行风控中的应用方法
CN112732897A (zh) * 2020-12-28 2021-04-30 平安科技(深圳)有限公司 文档处理方法、装置、电子设备及存储介质

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116561348A (zh) * 2023-07-10 2023-08-08 中国电子科技集团公司第十五研究所 一种股东增减持信息抽取处理方法和系统
CN116561348B (zh) * 2023-07-10 2023-09-19 中国电子科技集团公司第十五研究所 一种股东增减持信息抽取处理方法和系统
CN117891447A (zh) * 2024-03-14 2024-04-16 蒲惠智造科技股份有限公司 一种企业管理软件开发方法、装置、设备及介质

Also Published As

Publication number Publication date
CN112732897A (zh) 2021-04-30

Similar Documents

Publication Publication Date Title
WO2022142116A1 (zh) 文档处理方法、装置、电子设备及存储介质
Banga The green bond market: a potential source of climate finance for developing countries
Kirchmer et al. Value-Driven Robotic Process Automation (RPA) A Process-Led Approach to Fast Results at Minimal Risk
Clack et al. Smart contract templates: foundations, design landscape and research directions
Abdullah et al. A'macro'analysis of the use of XBRL
US20200050620A1 (en) Artificially intelligent system employing modularized and taxonomy-based classifications to generated and predict compliance-related content
Harris et al. An evaluation of the current state and future of XBRL and interactive data for investors and analysts
CN110383319A (zh) 大规模异构数据摄取和用户解析
AU2014343044B2 (en) Method and system for document data extraction template management
Strandvik Towards a more structured approach to quality assurance: DGT’s quality journey
JP2019133645A (ja) 構造化文書の内容をチャットベースのインタラクションに翻訳するための半自動化された方法及びシステム、プログラム
CN116150613A (zh) 信息抽取模型训练方法、信息抽取方法及装置
Amaral et al. A model-based conceptualization of requirements for compliance checking of data processing against GDPR
Frame Corporate social responsibility: A challenge for the donor community
Ashok Role of XBRL in promoting the integrated reporting in Indian scenario
Chen et al. From natural language to accounting entries using a natural language processing method
Rizkiana et al. The development of composite sentiment index in Indonesia based on the internet-available data
Hu et al. Analysts’ Green Coverage and Corporate Green Innovation in China: The Moderating Effect of Corporate Environmental Information Disclosure
CN110347921A (zh) 一种多模态数据信息的标签抽取方法及装置
Hamdan et al. The Advantages and Difficulties of Using AI and BT in the Auditing Procedures: A Literature Review
Dehdar et al. How important is institutional quality to lower energy utilization: an analysis using a global panel of stable and unstable countries
KR102642391B1 (ko) 부동산 개발 프로젝트 관리 서비스 제공 시스템
WO2019019456A1 (zh) 理赔数据处理方法、装置、计算机设备和存储介质
Shen et al. Public opinion toward social business from a social media perspective
US20230214893A1 (en) Improved system and method for automating business accounting

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21912877

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21912877

Country of ref document: EP

Kind code of ref document: A1