CN116629230A - Document processing method, device, equipment and storage medium - Google Patents

Document processing method, device, equipment and storage medium Download PDF

Info

Publication number
CN116629230A
CN116629230A CN202310602834.9A CN202310602834A CN116629230A CN 116629230 A CN116629230 A CN 116629230A CN 202310602834 A CN202310602834 A CN 202310602834A CN 116629230 A CN116629230 A CN 116629230A
Authority
CN
China
Prior art keywords
document
bill
result
check result
checking
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310602834.9A
Other languages
Chinese (zh)
Inventor
张敏华
齐蓉
钟玉兴
王虹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Industrial and Commercial Bank of China Ltd ICBC
Original Assignee
Industrial and Commercial Bank of China Ltd ICBC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Industrial and Commercial Bank of China Ltd ICBC filed Critical Industrial and Commercial Bank of China Ltd ICBC
Priority to CN202310602834.9A priority Critical patent/CN116629230A/en
Publication of CN116629230A publication Critical patent/CN116629230A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/194Calculation of difference between files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/106Display of layout of documents; Previewing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/02Banking, e.g. interest calculation or account maintenance
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Finance (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Accounting & Taxation (AREA)
  • Health & Medical Sciences (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The disclosure provides a bill processing method, device, equipment and storage medium, which can be applied to the technical field of data processing or the technical field of finance and technology. The method comprises the following steps: acquiring a bill identifier and target text information in at least one effective area from an image of a bill to be processed; retrieving a check source bill from a preset database according to the bill identification, wherein the check source bill and the bill to be processed have the same type, and check information is filled in the check source bill; checking the target text information according to the checking information to obtain checking results of the documents to be processed, wherein the checking results comprise an integrity checking result and a consistency checking result; and rendering the integrity check result and the consistency check result on a visual page, wherein the visual page is formed by a check result main display page, and an integrity check result sub-display page and a consistency check result sub-display page which are mounted under the check result main display page.

Description

Document processing method, device, equipment and storage medium
Technical Field
The present disclosure relates to the field of data processing technologies or financial technologies, and in particular, to a document processing method, device, equipment and storage medium.
Background
When a banking staff transacts international business under a documentary credit, the bill, invoice, insurance bill, boxing bill, origin certificate, checking certificate and other bills submitted by a beneficiary of the credit card or a legal holder of the bill need to be checked within a specified time so as to ensure that the bill is consistent and the bill is consistent.
In the process of implementing the inventive concept of the present disclosure, the inventor found that the following problems generally exist in the related art: the existing bill auditing is generally manually audited, the manual annotation of the bill does not accord with the point, the auditing flow is complicated, the auditing result cannot be intuitively displayed, and the bill processing efficiency and the intelligent degree are reduced.
Disclosure of Invention
In view of the foregoing, the present disclosure provides document processing methods, apparatuses, devices, storage media, and program products.
One aspect of the present disclosure provides a document processing method, including: acquiring a bill identifier and target text information in at least one effective area from an image of a bill to be processed; retrieving a check source bill from a preset database according to the bill identification, wherein the check source bill and the bill to be processed are of the same type, and check information is filled in the check source bill; checking the target text information according to the checking information to obtain a checking result of the document to be processed, wherein the checking result comprises an integrity checking result and a consistency checking result; and rendering the integrity check result and the consistency check result on a visual page, wherein the visual page is formed by a check result main display page, an integrity check result sub-display page and a consistency check result sub-display page which are mounted under the check result main display page.
According to an embodiment of the disclosure, the document to be processed has a plurality of documents, the target text information includes document titles, and each document to be processed corresponds to the document title; the step of checking the text information according to the checking information to obtain the checking result of the document to be processed comprises the following steps: generating a bill title list to be checked based on the bill titles corresponding to each bill to be processed; extracting at least one check bill title in the check information to obtain a check title list; checking the bill title list to be checked according to the check title list; and obtaining the integrity check result.
According to the embodiment of the disclosure, the to-be-processed bills have a plurality of bills, the target text information comprises bill element names and bill element value sets, each bill to be processed comprises a plurality of bill element names, and each bill element name corresponds to the bill element value set; the step of checking the text information according to the checking information to obtain the checking result of the document to be processed comprises the following steps: extracting a plurality of check document element names in the check information and a check document element value set corresponding to each of the plurality of check document element names; for each of the plurality of verification document element names: based on each checking receipt element name, searching a target receipt element name which is the same as the checking receipt element name from a plurality of receipt element names included in each receipt to be processed; checking a target document element value set corresponding to the target document element name with the checking document element value set to obtain a document element value checking result; traversing and checking each of the check document element names in a plurality of check document element names until each check document element name obtains a document element value checking result; and determining the consistency check result according to the document element value check result.
According to an embodiment of the present disclosure, the above-described collation result main presentation page has a collation result main presentation component corresponding to that created by a preset framework; the rendering the integrity check result and the consistency check result on the visual page comprises: transmitting the integrity check result and the consistency check result to the check result main display component in response to a request for acquiring the check result; binding the integrity check result and the consistency check result into a state component of the main display component; triggering a rendering component of the preset frame under the condition that the state component completes state updating based on the received integrity check result and the consistency check result; and respectively rendering the integrity check result and the consistency check result to the integrity check result sub-display page and the consistency check result sub-display page through the rendering component.
According to the embodiment of the disclosure, an integrity check result sub-display component and a consistency check result sub-display component are mounted on a virtual DOM node of the check result main display component, wherein the integrity check result sub-display component corresponds to the integrity check result sub-display page, and the consistency check result sub-display component corresponds to the consistency check result sub-display page; the rendering, by the rendering component, the integrity check result and the consistency check result to the integrity check result sub-display page and the consistency check result sub-display page respectively includes: calculating difference information of the virtual DOM nodes after the integrity check result sub-display assembly and the consistency check result sub-display assembly respectively receive the integrity check result and the consistency check result through the rendering assembly; and updating the difference information of the virtual DOM nodes to the integrity check result sub-display page and the consistency check result sub-display page respectively.
According to an embodiment of the present disclosure, the acquiring the document identifier and the target text information in the at least one effective area from the image of the document to be processed includes: based on a preset information identification method, carrying out identification processing on the image of the document to be processed to obtain the document identification and an identification image of the document to be processed, wherein the identification image comprises a plurality of identification areas; determining at least one effective area from the identification areas, wherein text information is recorded in the effective area; extracting the target text information from the text information based on the feature parameters of the document, wherein the feature parameters of the document comprise at least one of the following: document title feature parameters, document element name feature parameters, and document element value feature parameters.
According to an embodiment of the present disclosure, the above method further includes: performing key value pair conversion processing on the target text information to obtain structured target text information; checking the structured target text information according to the checking information to obtain a checking result of the document to be processed; and rendering the checking result of the bill to be processed on the visual page.
Another aspect of the present disclosure also provides a document processing apparatus, including: the acquisition module is used for acquiring the bill identification and the target text information in at least one effective area from the image of the bill to be processed; the retrieving module is used for retrieving the check source bill from a preset database according to the bill identifier, wherein the check source bill is of the same type as the bill to be processed, and check information is filled in the check source bill; the first checking module is used for checking the target text information according to the checking information to obtain a checking result of the bill to be processed, wherein the checking result comprises an integrity checking result and a consistency checking result; the first rendering module is used for rendering the integrity check result and the consistency check result on a visual page, wherein the visual page is formed by a check result main display page, an integrity check result sub-display page mounted under the check result main display page and a consistency check result sub-display page.
Another aspect of the present disclosure also provides an electronic device, including: one or more processors; and a storage device for storing one or more programs, wherein the one or more programs, when executed by the one or more processors, cause the one or more processors to perform the document processing method.
Another aspect of the present disclosure also provides a computer-readable storage medium having stored thereon executable instructions that, when executed by a processor, cause the processor to perform the document processing method described above.
Another aspect of the disclosure also provides a computer program product comprising a computer program which, when executed by a processor, implements the document processing method described above.
According to the bill processing method, the device, the equipment, the storage medium and the program product provided by the embodiment of the disclosure, the bill identification and the target text information are obtained from the image of the bill to be processed; retrieving a check source bill from a preset database according to the bill identification; checking the target text information according to the checking information on the checking source bill to obtain an integrity checking result and a consistency checking result; rendering the integrity check result and the consistency check result on a visual page. The whole process can be automatically carried out in the bill auditing process, and the auditing result is displayed, so that the problems that the auditing efficiency of the related technology is low and the auditing result cannot be intuitively evaluated are at least partially solved, and the technical effects of improving the bill processing efficiency and the intelligent degree are achieved.
Drawings
The foregoing and other objects, features and advantages of the disclosure will be more apparent from the following description of embodiments of the disclosure with reference to the accompanying drawings, in which:
FIG. 1 schematically illustrates a system architecture diagram of a document processing method and apparatus according to an embodiment of the present disclosure;
FIG. 2 schematically illustrates a flow chart of a document processing method according to an embodiment of the present disclosure;
FIG. 3 schematically illustrates a rendered visual page diagram according to an embodiment of the present disclosure;
FIG. 4 schematically illustrates a flow chart of a document processing method according to another embodiment of the present disclosure;
FIG. 5 schematically illustrates a flow chart of a document processing method according to yet another embodiment of the present disclosure;
FIG. 6 schematically illustrates a block diagram of a document processing system according to an embodiment of the present disclosure;
FIG. 7 schematically illustrates a block diagram of a document processing apparatus according to an embodiment of the present disclosure; and
fig. 8 schematically illustrates a block diagram of an electronic device adapted to implement a document processing method according to an embodiment of the disclosure.
Detailed Description
Hereinafter, embodiments of the present disclosure will be described with reference to the accompanying drawings. It should be understood that the description is only exemplary and is not intended to limit the scope of the present disclosure. In the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the embodiments of the present disclosure. It may be evident, however, that one or more embodiments may be practiced without these specific details. In addition, in the following description, descriptions of well-known structures and techniques are omitted so as not to unnecessarily obscure the concepts of the present disclosure.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the disclosure. The terms "comprises," "comprising," and/or the like, as used herein, specify the presence of stated features, steps, operations, and/or components, but do not preclude the presence or addition of one or more other features, steps, operations, or components.
All terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art unless otherwise defined. It should be noted that the terms used herein should be construed to have meanings consistent with the context of the present specification and should not be construed in an idealized or overly formal manner.
Where expressions like at least one of "A, B and C, etc. are used, the expressions should generally be interpreted in accordance with the meaning as commonly understood by those skilled in the art (e.g.," a system having at least one of A, B and C "shall include, but not be limited to, a system having a alone, B alone, C alone, a and B together, a and C together, B and C together, and/or A, B, C together, etc.).
In the technical scheme of the disclosure, the related data (such as including but not limited to personal information of a user) are collected, stored, used, processed, transmitted, provided, disclosed, applied and the like, all conform to the regulations of related laws and regulations, necessary security measures are adopted, and the public welcome is not violated.
At present, the main mode of checking the bill is to manually check the bill image, manually annotate the bill without points, the checking flow is complicated, the feedback description of the checking result is quite different, and the checking quality cannot be visually evaluated. In addition, although the existing information recognition technology can extract the content text of the document, the result information of clearly checking the document cannot be directly provided. When a banking staff examines a bill, whether all contents of the bill are consistent with each other needs to be carefully checked, and the conventional bill text recognition engine cannot provide visual result information of the checked bill which accords with the rule of checking the bill. In order to improve the quality and efficiency of checking the bill, a bill processing method needs to be provided, which is used for realizing intelligent checking of the bill, displaying the bill checking result, clearly positioning each bill element, and automatically checking whether the bill is consistent with the follow-up credit card or not, thereby improving the service processing efficiency.
In view of this, embodiments of the present disclosure provide a document processing method, apparatus, device, storage medium, and program product, which are used to improve document processing efficiency and intelligence, and further improve business processing efficiency. Specifically, the method comprises the following steps: acquiring a bill identifier and target text information in at least one effective area from an image of a bill to be processed; retrieving a check source bill from a preset database according to the bill identification, wherein the check source bill and the bill to be processed have the same type, and check information is filled in the check source bill; checking the target text information according to the checking information to obtain checking results of the documents to be processed, wherein the checking results comprise an integrity checking result and a consistency checking result; and rendering the integrity check result and the consistency check result on a visual page, wherein the visual page is formed by a check result main display page, and an integrity check result sub-display page and a consistency check result sub-display page which are mounted under the check result main display page.
It should be noted that the document processing method and apparatus determined in the embodiments of the present disclosure may be used in the field of data processing technology or the field of financial technology, and may also be used in any field other than the field of data processing technology or the field of financial technology, where the application field of the determined document processing method and apparatus in the embodiments of the present disclosure is not limited.
FIG. 1 schematically illustrates a system architecture diagram of a document processing method and apparatus according to an embodiment of the present disclosure.
As shown in fig. 1, a system architecture 100 according to this embodiment may include a first terminal device 101, a second terminal device 102, a third terminal device 103, a network 104, and a server 105. The network 104 is a medium used to provide a communication link between the first terminal device 101, the second terminal device 102, the third terminal device 103, and the server 105. The network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.
The user may interact with the server 105 via the network 104 using at least one of the first terminal device 101, the second terminal device 102, the third terminal device 103 to receive or send documents to be audited, etc. Various communication client applications, such as an application for handling international business categories, an application capable of generating a document category, a financial category application, a shopping category application, a web browser application, a search category application, an instant messaging tool, a mailbox client, social platform software, etc. (only examples) may be installed on the first terminal device 101, the second terminal device 102, the third terminal device 103.
The first terminal device 101, the second terminal device 102, the third terminal device 103 may be various electronic devices having a display screen that also support web browsing, including but not limited to smartphones, tablets, laptop and desktop computers, and the like.
The server 105 may be a server providing various services, such as a background management server (by way of example only) providing an audit of documents sent by the user using the first terminal device 101, the second terminal device 102, and the third terminal device 103. The background management server can analyze and the like the received data of the document to be checked and the like, and feed back the processing result (such as checking result, web page, information, data or the like obtained or generated according to the document to be checked) to the terminal equipment. Specifically, the server 105 may obtain a document identifier and target text information in at least one effective area from an image of a document to be processed; retrieving a check source bill from a preset database according to the bill identification, wherein the check source bill and the bill to be processed have the same type, and check information is filled in the check source bill; checking the target text information according to the checking information to obtain checking results of the documents to be processed, wherein the checking results comprise an integrity checking result and a consistency checking result; and rendering the integrity check result and the consistency check result on a visual page, wherein the visual page is formed by a check result main display page, and an integrity check result sub-display page and a consistency check result sub-display page which are mounted under the check result main display page.
It should be noted that the document processing method provided in the embodiments of the present disclosure may be generally executed by the server 105. Accordingly, the document processing apparatus provided in the embodiments of the present disclosure may be generally disposed in the server 105. The document processing method provided by the embodiment of the present disclosure may also be performed by a server or a server cluster that is different from the server 105 and is capable of communicating with the first terminal device 101, the second terminal device 102, the third terminal device 103, and/or the server 105. Accordingly, the document processing apparatus provided by the embodiments of the present disclosure may also be provided in a server or a server cluster that is different from the server 105 and is capable of communicating with the first terminal device 101, the second terminal device 102, the third terminal device 103, and/or the server 105.
It should be understood that the number of terminal devices, networks and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
The document processing method of the disclosed embodiment will be described in detail with reference to fig. 2 to 6 based on the system architecture described in fig. 1.
FIG. 2 schematically illustrates a flow chart of a document processing method according to an embodiment of the present disclosure.
As shown in fig. 2, the document processing of this embodiment includes operations S210 to S240.
In operation S210, a document identifier and target text information in at least one effective area are acquired from an image of a document to be processed.
In operation S220, according to the document identifier, a check source document is retrieved from a preset database, where the check source document has the same type as the document to be processed, and check information is filled in the check source document.
In operation S230, the target text information is checked according to the check information, and a check result of the document to be processed is obtained, wherein the check result includes an integrity check result and a consistency check result.
In operation S240, the integrity check result and the consistency check result are rendered onto a visual page, wherein the visual page is formed by a check result main display page, and an integrity check result sub-display page and a consistency check result sub-display page mounted under the check result main display page.
According to embodiments of the present disclosure, the documents to be processed may include documents submitted by a credit card beneficiary or a document legal holder, invoices, security documents, case slips, origin certificates, verification certificates, and the like. Documents may refer to documents, files and certificates in international settlement whereby transactions such as payment, transportation, insurance, business, collection, etc. of international goods are processed. It can be understood that the document to be processed is not only suitable for international transaction service, but also suitable for non-international transaction service, and particularly can be adaptively adjusted according to actual needs.
According to an embodiment of the present disclosure, the image of the document to be processed may refer to an image obtained by scanning the document to be processed through an OCR recognition method, a form detection method, an image feature recognition method, and the like.
According to the embodiment of the disclosure, the bill identification is used for identifying the bill to be processed, specifically, the bill identification can be located at a certain fixed position of the whole bill, and when the bill is identified by utilizing the OCR method, the bill identification can be obtained by reading the information of the fixed position, so that the check source file can be conveniently searched for later. It will be appreciated that the fixed location of the tag may be outside the active area or within the active area.
According to embodiments of the present disclosure, an active area may be understood as an area where text information exists. The target text information may be text information that is valid among these text information, such as a transaction account, a transaction amount, a transaction channel, information of both parties to the transaction, an invoice number, and the like.
According to an embodiment of the present disclosure, the preset database is used to store a verification source document, where the verification source document may be understood as a document capable of being used as a verification standard, for example, a verification source file generated according to transaction service M information required by a credit card, and the transaction service M information required by the credit card may be used as verification information. Or generating a check source bill according to the information filled by the user in the process of handling the business or registering for the first time, and endowing the check source bill with a check bill identifier and storing the check source bill in a preset database. Wherein, the information filled by the user, verification information obtained according to the information filled by the user, and the like can be used as verification information. It will be appreciated that the verification document identification may be based on an identification of the document type, a credit card beneficiary or a document legal holder identification. The identification of the document type is, for example, a field or a number corresponding to the type of bill of lading, invoice, insurance document, packing bill, certificate of origin, certificate of inspection, or the like. The identification of the credit card beneficiary or the legal document holder is obtained according to the information filled by the credit card beneficiary or the legal document holder.
According to the embodiment of the disclosure, the check-source document and the document to be processed have the same type, which may mean that the check-source document and the document of the document to be processed have the same document type, and the filling information of the credit card beneficiary or the document legal holder is the same. Specifically, the document identifier of the document to be processed and the verification document identifier may have the same structure, so that the verification source document can be quickly found from the preset database according to the document identifier, and information verification is performed. It should be noted that, the processes of obtaining the information used for filling, the identification of the credit card beneficiary or the legal holder of the document and storing the verification source document are all carried out after the permission of the user is obtained, and the processing process of the user information in the embodiment of the disclosure accords with the regulations of the related laws and regulations, takes necessary security measures and does not violate the public order harmony.
According to embodiments of the present disclosure, the integrity check result may refer to whether the number of documents is consistent with the number of documents required by the letter of credit, if the number is consistent, the integrity is deemed to be complete, and if the number is inconsistent, the incompleteness is deemed to be incomplete. The consistency check result may refer to whether the element value corresponding to each element in each document to be processed is the same as the element value corresponding to each element in the document required by the credit card, for example, taking an element as an invoice number as an example, checking whether the invoice number in the document to be processed is the same as an invoice number filled in the document of the credit card, and the like, if the invoice number is the same, the invoice number is considered to be consistent, and if the check result of each element on the document to be processed is the same, the document to be processed is considered to be consistent with the document required by the credit card.
According to an embodiment of the present disclosure, the visualization page is used to present the integrity check result and the consistency check result. The visual page can be composed of a checking result main display page, an integrity checking result sub-display page and a consistency checking result sub-display page which are mounted under the checking result main display page. The checking result main display page can be used for displaying the current total checking result, the integrity checking result sub-display page is used for displaying the integrity checking result, and the consistency checking result sub-display page is used for displaying the consistency checking result.
According to the bill processing method, the device, the equipment, the storage medium and the program product provided by the embodiment of the disclosure, the bill identification and the target text information are obtained from the image of the bill to be processed; retrieving a check source bill from a preset database according to the bill identification; checking the target text information according to the checking information on the checking source bill to obtain an integrity checking result and a consistency checking result; rendering the integrity check result and the consistency check result on a visual page. The whole process can be automatically carried out in the bill auditing process, and the auditing result is displayed, so that the problems that the auditing efficiency of the related technology is low and the auditing result cannot be intuitively evaluated are at least partially solved, and the technical effects of improving the bill processing efficiency and the intelligent degree are achieved.
According to an embodiment of the present disclosure, operation S210 may include the following operations: based on a preset information identification method, carrying out identification processing on an image of a document to be processed to obtain a document identification and an identification image of the document to be processed, wherein the identification image comprises a plurality of identification areas; determining at least one effective area from the identification areas, wherein text information is recorded in the effective area; extracting target text information from the text information based on feature parameters of the document, wherein the feature parameters of the document comprise at least one of the following: document title feature parameters, document element name feature parameters, and document element value feature parameters.
According to an embodiment of the disclosure, the method further includes performing key value pair conversion processing on the target text information to obtain structured target text information; checking the structured target text information according to the checking information to obtain a checking result of the document to be processed; and rendering the checking result of the document to be processed on the visual page.
According to embodiments of the present disclosure, the preset information recognition method may include OCR recognition technology, form detection, image feature recognition, and the like. For example, based on OCR technology, acquiring document images to be audited, performing intelligent image recognition processing, acquiring a document title, a document element name and an element value set according to the recognized text position and the document element characteristics, converting the recognized information into structural information of key value pairs, and storing the structural information. The processing steps of operation S210 include, but are not limited to, the steps of actually and intelligently identifying the document image to be audited: OCR text detection, form detection, image feature recognition, extraction of structured information based on path analysis and semantic understanding, and the like. And outputting the structural information of the document title, the document element name and the key value pair corresponding to the element value by acquiring the document image under a set of externally input credit card items, and storing the structural information in a database of an application system after outputting the structural information.
Illustratively, the extracted information may include "document title: a bill of lading; document element field name: a shipper; document element field value: xxx ", may also include" document title: an invoice; document element field name: a principal; document element field value: xxx, xxx.
According to an embodiment of the present disclosure, operation S230 may include the following operations: generating a bill title list to be checked based on the bill titles corresponding to each bill to be processed; extracting at least one check bill title in the check information to obtain a check title list; checking the bill title list to be checked according to the checking title list; and obtaining an integrity check result.
According to the embodiment of the disclosure, the document to be processed may have a plurality of documents, the target text information may include a document title, and each document to be processed may correspond to the document title, that is, each document to be processed may have a document title. The verification information may include a plurality of verification documents, each of which may have a verification document header.
According to the embodiment of the disclosure, a bill title list is extracted from a target text information based on the extracted target text information, and then the bill title list is compared with a bill type list required by a credit card, so that an integrity checking result of the bill is obtained. If the transaction service M is required to be transacted on a follow-up ticket credit card, for example, the follow-up ticket credit card is required to have a bill, a box ticket, a sanitary card and a production place certificate, the verification title list contains the bill, the box ticket, the sanitary card and the production place certificate title, but the to-be-checked bill title list identified by the intelligent image comprises the bill, the box ticket and the production place certificate title, but no sanitary card is identified, and the integrity check result of the bill is "lack of the sanitary card bill". If the bill title list to be checked identified by the intelligent image comprises a bill of lading, a box bill, a sanitation certificate and a production place certificate bill title, the integrity checking result of the bill is 'the bill is complete'.
According to an embodiment of the present disclosure, operation S230 may further include the following operations: extracting a plurality of check document element names in the check information and a check document element value set corresponding to each check document element name in the plurality of check document element names; verifying the document element name for each of a plurality of verification document element names: based on each check document element name, searching a target document element name which is the same as the check document element name from a plurality of document element names included in each document to be processed; checking a target bill element value set corresponding to the target bill element name with the checking bill element value set to obtain a bill element value checking result; traversing and checking each check document element name in the check document element names until each check document element name obtains a document element value checking result; and determining a consistency check result according to the document element value check result.
According to the embodiment of the disclosure, the document to be processed can have a plurality of documents, the target text information can further comprise document element names and document element value sets, each document to be processed can comprise a plurality of document element names, and each document element name can correspond to a document element value set. For example, the document element name on each document may include an invoice number, a shipper, a receiver, a shipment port, an unloading port, and the like, and a specific number corresponding to the invoice number, a specific shipper information corresponding to the shipper, a specific receiver information corresponding to the receiver, a specific shipment port information corresponding to the shipment port, a specific unloading port information corresponding to the unloading port, and the like may be used as the element value set.
According to the embodiment of the disclosure, based on the extracted target text information, extracting a document element and a corresponding element value from a target text information seed, traversing the same document element of different documents to be processed by taking a checking document element name and a document element value set on a checking document as references, comparing whether the document element value set is consistent with the checking document element value set, if the document element value set is consistent with the checking document element value set, checking the element value associated with the checking document element name, otherwise, judging that the element value is inconsistent. And finally traversing all the check document element names and the document element value sets until each check document element name is associated with at least one element value auditing result. Judging whether the auditing result of each element value is consistent in comparison, if so, the consistency auditing result of the bill is consistent, otherwise, the auditing result is inconsistent.
According to the embodiment of the disclosure, the integrity auditing result and the consistency auditing result obtained by auditing a set of documents can be stored in the auditing result database, so that the integrity auditing result and the consistency auditing result can be conveniently obtained from the database in the subsequent rendering process. It can be understood that the integrity auditing result and the consistency auditing result are stored in the database, so that the abnormal condition of the bill can be conveniently repeated, and when the number of abnormal results in the integrity auditing result and the consistency auditing result reaches a certain number, the relevant business personnel and the relevant users can be notified to remind the business personnel to contact the users for confirmation so as to improve the safety of the transaction.
According to the embodiment of the disclosure, the integrity check result and the consistency check result of the bill to be checked can be automatically checked by the method, so that the working pressure of service personnel is reduced, the check error rate is reduced, the intelligent degree of bill processing is improved, and the bill processing efficiency and service processing efficiency are improved.
According to an embodiment of the present disclosure, the collation result master presentation page has a collation result master presentation component corresponding to that created by the preset framework. Operation S240 may include the following operations: in response to a request for obtaining the verification result, sending an integrity verification result and a consistency verification result to a verification result main display component; binding the integrity check result and the consistency check result into a state component of the main display component; triggering a rendering component of the preset frame under the condition that the state component completes state updating based on the received integrity check result and consistency check result; and respectively rendering the integrity check result and the consistency check result to an integrity check result sub-display page and a consistency check result sub-display page through a rendering component.
According to the embodiment of the disclosure, an integrity check result sub-display component and a consistency check result sub-display component are mounted on a virtual DOM node of a check result main display component, wherein the integrity check result sub-display component corresponds to an integrity check result sub-display page, and the consistency check result sub-display component corresponds to a consistency check result sub-display page. Rendering, by a rendering component, the integrity check result and the consistency check result to an integrity check result sub-display page and a consistency check result sub-display page respectively, including: calculating difference information of the virtual DOM nodes after the integrity check result sub-display assembly and the consistency check result sub-display assembly respectively receive the integrity check result and the consistency check result through the rendering assembly; and respectively updating the difference information of the virtual DOM nodes to an integrity check result sub-display page and a consistency check result sub-display page.
According to embodiments of the present disclosure, the preset framework may include a react framework (global wide area network development framework). The reactidom. Render based on the reacti framework may create a document verification result main presentation component and mount an integrity verification result sub-presentation component, a consistency verification result sub-presentation component, to a virtual DOM (document object model ) node of the main component.
According to the embodiment of the disclosure, based on the life cycle mechanism of the reaction, an asynchronous HTTP request is initiated in the life cycle componentDidMount of the verification result main display component, to obtain the verification result and the target text information, and to obtain the document title, the document element name and the element value set obtained in operation S210, and the integrity verification result and the consistency verification result obtained in operation S220.
According to the embodiment of the disclosure, by initiating the request for acquiring the verification result and the target text information in the life cycle mechanism, it can be indicated that a certain time is required to wait for acquiring the verification result and the target text information, and the verification result and the target text information are not fed back immediately generally because of large information quantity, and even if the feedback is performed immediately, the problem of insufficient feedback information can occur, thereby causing insufficient rendering of the result and reducing the service processing efficiency and accuracy. Therefore, a certain life cycle is required to be set, and the checking result and the target text information are fed back within a certain time, so that the problem of incomplete information feedback is avoided, and the efficiency and accuracy of service processing are improved.
According to the embodiment of the disclosure, the acquired document title, document element name and element value set, integrity check result and consistency check result are bound into the component State of the check result main display component, a rendering mechanism of the action component is triggered after the State is updated, and audit result data is rendered into the integrity check result sub-display component and the consistency check result sub-display component.
Fig. 3 schematically illustrates a rendered visual page diagram according to an embodiment of the present disclosure.
According to the embodiment of the disclosure, a compact check result main display assembly is created based on a compact development mode, a unique credit number is bound to an external state value tips of the assembly, an HTTP asynchronous request is initiated in the assembly according to the unique credit number to acquire document element structural information under a credit item output by an operation S210 and audit result information of an operation S220, a compact method is mobilized to update the assembly state after a request result is returned, a document title, a document element name, a document element value set, a consistency check result and an integrity check result are mapped to an integrity check result sub-display assembly and a consistency check result sub-display assembly, a compact rendering mechanism is triggered, differences of various virtual DOM nodes after the state of the assembly is calculated, and the changes are updated to a page. The rendering results may be as shown in fig. 3.
As shown in fig. 3, the visualization page 300 may include a collation result main presentation page 301, an integrity collation result sub-presentation page 302, and a consistency collation result sub-presentation page 303. In the integrity check result, there may be a complete document or a lack of xx documents, both of which result. In the consistency check result, there may be both a consistency and a non-consistency result. The specific consistency check result may be presented through a table in the consistency check result sub-presentation page 303.
According to the embodiment of the disclosure, the problem that the related technology cannot directly provide the clear receipt check result is at least partially solved by rendering the integrity check result and the consistency check result on the visual page.
FIG. 4 schematically illustrates a flow chart of a document processing method according to another embodiment of the present disclosure.
As shown in fig. 4, the document processing method of this embodiment may include operations S410 to S450.
In operation S410, based on the OCR recognition technology, the document image to be audited is acquired and intelligent image recognition processing is performed, and the document title, the document element name and the element value are acquired according to the recognized text position and the document element feature, and converted into structural information of key value pairs and stored.
In operation S420, comparing the document title identified in operation S410 with the document requirement of the credit card, and obtaining the integrity check result of the document; consistency comparison is carried out on the bill elements identified in the operation S410, and a consistency auditing result of the bill is obtained; and storing the integrity and consistency auditing results.
In operation S430, a main component of the document audit result is created based on the reactidom. Render of the reacti framework, and sub-components of the integrity check result, the consistency check result are mounted to virtual DOM nodes of the main component.
In operation S440, an asynchronous HTTP request is initiated in the lifecycle componentdi mount of the main component created in operation S430 based on the React lifecycle mechanism, and the document title, the document element name and element value set identified in operation S410, and the audit result stored in operation S420 are acquired.
In operation S450, the document title, the document element name and the element value set acquired in operation S440 are bound to the component State of the checking result main display component in operation S430, and after the State update is completed, the rendering mechanism of the reaction component is triggered, and the checking result data is rendered to the sub-components of the integrity checking result and the consistency checking result in operation S430.
According to the embodiment of the present disclosure, the contents of operations S410 to S450 may refer to the related contents of operations S210 to S240, and will not be described herein.
FIG. 5 schematically illustrates a flow chart of a document processing method according to yet another embodiment of the present disclosure.
As shown in fig. 5, the document processing method of this embodiment may include operations S501 to S511.
Step S501: and acquiring a document image to be audited under the credit item.
Step S502: and identifying the bill title, the bill element and the bill element value based on the OCR technology.
Step S503: and constructing and storing the structured information of the bill.
Step S504: and detecting the integrity of the bill according to the bill title list which is intelligently identified and the bill type list which is required by the credit card.
Step S505: and detecting the consistency of the bill according to the intelligently identified bill element and element value.
Step S506: the act application is initialized.
Step S507: and creating a main component of the bill auditing result based on the reactidom. Render.
Step S508: and respectively defining a real sub-component of the bill examination result and the bill element examination result, and mounting the real sub-components to a virtual DOM node of the main component.
Step S509: and initiating an asynchronous HTTP request in the life cycle componentDidMount, and acquiring the document element information and the auditing result.
Step S510: the setState method mobilizing the React binds the data element information and the audit results into the component state of the master component.
Step S511: triggering a rendering mechanism of the reaction component, rendering the bill identification information and the auditing result, and completing the display of the intelligent auditing result of the bill under the credit item.
According to the embodiment of the present disclosure, the contents of operations S501 to S511 may refer to the relevant contents of operations S210 to S240 and operations S410 to S450, which are not described herein.
FIG. 6 schematically illustrates a block diagram of a document processing system according to an embodiment of the present disclosure.
As shown in fig. 6, the document processing system 600 includes a document intelligent image recognition device 601, a document audit result detection device 602, and an audit result display device 603. The bill intelligent image recognition device 601 is connected with the bill checking result detection device 602; the bill intelligent image recognition device 601 is connected with the auditing result display device 603; the document audit result detection device 602 is connected to an audit result display device 603.
The document intelligent image recognition device 601 is configured to intelligently recognize a document image to be audited, and in actual processing, the processing steps of the document intelligent image recognition device 601 include, but are not limited to: OCR text detection, form detection, image feature recognition, extraction of structured information based on path analysis and semantic understanding, and the like. The device outputs the document title, the document element name and the key value pair structural information corresponding to the element value through acquiring the document image under a set of externally input credit card items, and outputs the structural information and stores the structural information in a database of an application system.
The document auditing result detecting device 602 is configured to extract a document title list based on the structured information output by the document intelligent image identifying device 601, and then compare the document title list with a document type list required by a credit card, to detect an integrity auditing result of a document, for example, the credit card requires that a bill of lading, a box bill, a sanitary card and a production place card must be included, but the document title list identified by the intelligent image includes a bill of lading, a box bill and a production place card, but no sanitary card is identified, and the integrity auditing result of the document is "the document lacking the production place card". Based on the document intelligent image recognition device 601, outputting document elements and element values, traversing the same document elements of different documents, comparing whether the element values are consistent, if the element values are consistent, the document element auditing result is consistent, otherwise, the document elements are inconsistent. And finally traversing the auditing results of all the element values, judging whether the element values are consistent, if so, enabling the consistency auditing results of the bill to be consistent, otherwise, enabling the bill to be inconsistent.
Audit result display means 603: based on a component development mode of the reaction, a reaction single-result examination display component is created, a credit card unique number is bound to a component external state value tips, an HTTP asynchronous request is initiated in the component according to the credit card unique number to acquire bill element structural information under a credit card item output by a bill intelligent image recognition device 601 and examination result information of a bill examination result detection device 602, after a request result is returned, a setState method of the reaction is mobilized to update the component state, the bill element structural information and the examination result information are mapped to a bill examination result and a bill element single-result examination sub-component, a reaction rendering mechanism is triggered, differences of all virtual DOM nodes after the component state is changed are calculated, and the change is updated to a page.
According to the embodiment of the present disclosure, the document processing system 600 is used for implementing operations S210 to S240, operations S410 to S450, and operations S501 to S511, which are not described herein again.
According to the embodiment of the disclosure, the display system of the intelligent single examination result is realized on the basis of OCR (optical character recognition) document images, whether each document element is consistent with a document is clearly positioned, and clear and effective intelligent examination results are provided for banking staff. And (3) the document image is processed through an OCR recognition engine to obtain document content text and position information of the text in the image, and structured information of the document is output.
The embodiment of the disclosure provides a method and a system for dynamically rendering intelligent audit results based on practice. Based on the business rules of OCR recognition document images and bank audit documents, an Internet Web mainstream development framework reaction is utilized, a componentized development mode is introduced, and audit pages of intelligent audit results are dynamically rendered through a virtual DOM rendering technology, so that the problems of low efficiency, poor audit quality and the like of manual audit documents are solved.
According to the embodiment of the disclosure, through combining the intelligent identification and the practice technology of the document images, the display of the intelligent examination results is realized, whether the document to be examined is complete or not and whether the document elements are consistent or not are clearly displayed, the requirement that whether all contents of the document are mutually consistent or not needs to be carefully checked when banking staff examine the document is met, the flow of manually examining the document can be reduced, the cost of examining the document is reduced, and the quality and the efficiency of examining the document are improved.
It should be noted that, unless there is an execution sequence between different operations or an execution sequence between different operations in technical implementation, the execution sequence between multiple operations may be different, and multiple operations may also be executed simultaneously in the embodiment of the disclosure.
Based on the bill processing method, the disclosure also provides a bill processing device. The device will be described in detail below in connection with fig. 7.
Fig. 7 schematically illustrates a block diagram of a document processing apparatus according to an embodiment of the present disclosure.
As shown in fig. 7, the document processing apparatus 700 of this embodiment includes an acquisition module 710, a calling module 720, a first collation module 730, and a first rendering module 740.
And the acquiring module 710 is configured to acquire a document identifier and target text information in at least one effective area from an image of the document to be processed.
And the retrieving module 720 is configured to retrieve a check source document from a preset database according to the document identifier, where the check source document has the same type as the document to be processed, and check information is filled in the check source document.
The first checking module 730 is configured to check the target text information according to the checking information, and obtain a checking result of the document to be processed, where the checking result includes an integrity checking result and a consistency checking result.
The first rendering module 740 is configured to render the integrity check result and the consistency check result onto a visual page, where the visual page is formed by a check result main display page, and an integrity check result sub-display page and a consistency check result sub-display page mounted under the check result main display page.
According to an embodiment of the present disclosure, the first collation module includes a generation unit, a first extraction unit, a first collation unit.
And the generation unit is used for generating a bill title list to be checked based on the bill titles corresponding to each bill to be processed.
The first extraction unit is used for extracting at least one check bill title in the check information to obtain a check title list.
The first checking unit is used for checking the bill title list to be checked according to the checking title list; and obtaining an integrity check result.
According to an embodiment of the present disclosure, the first collation module further includes a second extraction unit, a search unit, a second collation unit, a traversal unit, a first determination unit.
The second extracting unit is used for extracting a plurality of check document element names in the check information and a check document element value set corresponding to each check document element name in the plurality of check document element names.
Verifying the document element name for each of a plurality of verification document element names:
the searching unit is used for searching the target document element names which are the same as the check document element names from the document element names included in each document to be processed based on each check document element name.
The second checking unit is used for checking the target document element value set corresponding to the target document element name with the checking document element value set to obtain a document element value checking result;
the traversing unit is used for traversing and checking each check document element name in the plurality of check document element names until each check document element name obtains a document element value checking result.
And the first determining unit is used for determining a consistency check result according to the document element value check result.
According to an embodiment of the present disclosure, the first rendering module may include a transmitting unit, a binding unit, a triggering unit, and a rendering unit.
And the sending unit is used for responding to the request for acquiring the checking result and sending the integrity checking result and the consistency checking result to the checking result main display component.
And the binding unit is used for binding the integrity check result and the consistency check result into the state component of the main display component.
The triggering unit is used for triggering the rendering component of the preset frame under the condition that the state component completes state updating based on the received integrity check result and consistency check result.
The rendering unit is used for rendering the integrity check result and the consistency check result to the integrity check result sub-display page and the consistency check result sub-display page respectively through the rendering component.
According to an embodiment of the present disclosure, a rendering unit includes a calculation subunit, an update subunit.
And the calculating subunit is used for calculating the difference information of the virtual DOM nodes after the integrity check result sub-display assembly and the consistency check result sub-display assembly respectively receive the integrity check result and the consistency check result through the rendering assembly.
And the updating subunit is used for respectively updating the difference information of the virtual DOM nodes to the integrity check result sub-display page and the consistency check result sub-display page.
According to an embodiment of the present disclosure, the acquisition module may include an identification unit, a second determination unit, and a third extraction unit.
The identification unit is used for carrying out identification processing on the image of the document to be processed based on a preset information identification method to obtain a document identification and an identification image of the document to be processed, wherein the identification image comprises a plurality of identification areas.
And a second determining unit configured to determine at least one effective area from the identification areas, wherein text information is recorded in the effective area.
The third extraction unit is used for extracting target text information from the text information based on the feature parameters of the document, wherein the feature parameters of the document comprise at least one of the following: document title feature parameters, document element name feature parameters, and document element value feature parameters.
According to an embodiment of the disclosure, the document processing apparatus may further include a conversion module, a second collation module, and a second rendering module.
And the conversion module is used for carrying out key value pair conversion processing on the target text information to obtain the structured target text information.
And the second checking module is used for checking the structured target text information according to the checking information to obtain a checking result of the document to be processed.
And the second rendering module is used for rendering the checking result of the document to be processed onto the visual page.
According to embodiments of the present disclosure, any of the acquisition module 710, the retrieval module 720, the first collation module 730 and the first rendering module 740 may be incorporated in one module to be implemented, or any of the modules may be split into a plurality of modules. Alternatively, at least some of the functionality of one or more of the modules may be combined with at least some of the functionality of other modules and implemented in one module. According to embodiments of the present disclosure, at least one of the acquisition module 710, the retrieval module 720, the first collation module 730, and the first rendering module 740 may be implemented at least in part as hardware circuitry, such as a Field Programmable Gate Array (FPGA), a Programmable Logic Array (PLA), a system on a chip, a system on a substrate, a system on a package, an Application Specific Integrated Circuit (ASIC), or in hardware or firmware, such as any other reasonable way of integrating or packaging the circuitry, or in any one of or a suitable combination of three of software, hardware, and firmware. Alternatively, at least one of the acquisition module 710, the retrieval module 720, the first collation module 730 and the first rendering module 740 may be at least partially implemented as computer program modules which, when executed, may perform the respective functions.
It should be noted that, in the embodiment of the present disclosure, the document processing device portion corresponds to the document processing method portion in the embodiment of the present disclosure, and the description of the document processing device portion specifically refers to the document processing method portion and is not described herein again.
Fig. 8 schematically illustrates a block diagram of an electronic device adapted to implement a document processing method according to an embodiment of the disclosure.
As shown in fig. 8, an electronic device 800 according to an embodiment of the present disclosure includes a processor 801 that can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM) 802 or a program loaded from a storage section 808 into a Random Access Memory (RAM) 803. The processor 801 may include, for example, a general purpose microprocessor (e.g., a CPU), an instruction set processor and/or an associated chipset and/or a special purpose microprocessor (e.g., an Application Specific Integrated Circuit (ASIC)), or the like. The processor 801 may also include on-board memory for caching purposes. The processor 801 may include a single processing unit or multiple processing units for performing the different actions of the method flows according to embodiments of the disclosure.
In the RAM 803, various programs and data required for the operation of the electronic device 800 are stored. The processor 801, the ROM 802, and the RAM 803 are connected to each other by a bus 804. The processor 801 performs various operations of the method flow according to the embodiments of the present disclosure by executing programs in the ROM 802 and/or the RAM 803. Note that the program may be stored in one or more memories other than the ROM 802 and the RAM 803. The processor 801 may also perform various operations of the method flows according to embodiments of the present disclosure by executing programs stored in the one or more memories.
According to an embodiment of the present disclosure, the electronic device 800 may also include an input/output (I/O) interface 805, the input/output (I/O) interface 805 also being connected to the bus 804. The electronic device 800 may also include one or more of the following components connected to an input/output (I/O) interface 805: an input portion 806 including a keyboard, mouse, etc.; an output portion 807 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and a speaker; a storage section 808 including a hard disk or the like; and a communication section 809 including a network interface card such as a LAN card, a modem, or the like. The communication section 809 performs communication processing via a network such as the internet. The drive 810 is also connected to an input/output (I/O) interface 805 as needed. A removable medium 811 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 810 as needed so that a computer program read out therefrom is mounted into the storage section 808 as needed.
The present disclosure also provides a computer-readable storage medium that may be embodied in the apparatus/device/system described in the above embodiments; or may exist alone without being assembled into the apparatus/device/system. The computer-readable storage medium carries one or more programs which, when executed, implement methods in accordance with embodiments of the present disclosure.
According to embodiments of the present disclosure, the computer-readable storage medium may be a non-volatile computer-readable storage medium, which may include, for example, but is not limited to: a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this disclosure, a computer-readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. For example, according to embodiments of the present disclosure, the computer-readable storage medium may include ROM 802 and/or RAM 803 and/or one or more memories other than ROM 802 and RAM 803 described above.
Embodiments of the present disclosure also include a computer program product comprising a computer program containing program code for performing the methods shown in the flowcharts. The program code, when executed in a computer system, causes the computer system to implement the document processing method provided by embodiments of the present disclosure.
The above-described functions defined in the system/apparatus of the embodiments of the present disclosure are performed when the computer program is executed by the processor 801. The systems, apparatus, modules, units, etc. described above may be implemented by computer program modules according to embodiments of the disclosure.
In one embodiment, the computer program may be based on a tangible storage medium such as an optical storage device, a magnetic storage device, or the like. In another embodiment, the computer program may also be transmitted, distributed, and downloaded and installed in the form of a signal on a network medium, and/or from a removable medium 811 via a communication portion 809. The computer program may include program code that may be transmitted using any appropriate network medium, including but not limited to: wireless, wired, etc., or any suitable combination of the foregoing.
In such an embodiment, the computer program may be downloaded and installed from a network via the communication section 809, and/or installed from the removable media 811. The above-described functions defined in the system of the embodiments of the present disclosure are performed when the computer program is executed by the processor 801. The systems, devices, apparatus, modules, units, etc. described above may be implemented by computer program modules according to embodiments of the disclosure.
According to embodiments of the present disclosure, program code for performing computer programs provided by embodiments of the present disclosure may be written in any combination of one or more programming languages, and in particular, such computer programs may be implemented in high-level procedural and/or object-oriented programming languages, and/or assembly/machine languages. Programming languages include, but are not limited to, such as Java, c++, python, "C" or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, partly on a remote computing device, or entirely on the remote computing device or server. In the case of remote computing devices, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., connected via the Internet using an Internet service provider).
The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams or flowchart illustration, and combinations of blocks in the block diagrams or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
Those skilled in the art will appreciate that the features recited in the various embodiments of the disclosure and/or in the claims may be provided in a variety of combinations and/or combinations, even if such combinations or combinations are not explicitly recited in the disclosure. In particular, the features recited in the various embodiments of the present disclosure and/or the claims may be variously combined and/or combined without departing from the spirit and teachings of the present disclosure. All such combinations and/or combinations fall within the scope of the present disclosure.
The embodiments of the present disclosure are described above. However, these examples are for illustrative purposes only and are not intended to limit the scope of the present disclosure. Although the embodiments are described above separately, this does not mean that the measures in the embodiments cannot be used advantageously in combination. The scope of the disclosure is defined by the appended claims and equivalents thereof. Various alternatives and modifications can be made by those skilled in the art without departing from the scope of the disclosure, and such alternatives and modifications are intended to fall within the scope of the disclosure.

Claims (11)

1. A document processing method, comprising:
acquiring a bill identifier and target text information in at least one effective area from an image of a bill to be processed;
Retrieving a check source bill from a preset database according to the bill identification, wherein the check source bill and the bill to be processed are of the same type, and check information is filled in the check source bill;
checking the target text information according to the checking information to obtain a checking result of the document to be processed, wherein the checking result comprises an integrity checking result and a consistency checking result;
and rendering the integrity check result and the consistency check result on a visual page, wherein the visual page is formed by a check result main display page, an integrity check result sub-display page and a consistency check result sub-display page which are mounted under the check result main display page.
2. The method of claim 1, wherein the document to be processed has a plurality of documents, the target text information includes a document title, and each document to be processed corresponds to the document title;
the text information is checked according to the check information, and the checking result of the document to be processed is obtained, wherein the checking result comprises the following steps:
generating a bill title list to be checked based on the bill titles corresponding to each bill to be processed;
Extracting at least one check bill title in the check information to obtain a check title list;
checking the bill title list to be checked according to the check title list; and obtaining the integrity check result.
3. The method of claim 1, wherein the document to be processed has a plurality of documents, the target text information comprises a document element name and a document element value set, each document to be processed comprises a plurality of document element names, and each document element name corresponds to the document element value set;
the text information is checked according to the check information, and the checking result of the document to be processed is obtained, wherein the checking result comprises the following steps:
extracting a plurality of check document element names in the check information and a check document element value set corresponding to each of the plurality of check document element names;
for each of a plurality of said verification document element names:
based on each checked bill element name, searching a target bill element name which is the same as the checked bill element name from a plurality of bill element names included in each bill to be processed;
Checking a target document element value set corresponding to the target document element name with the checking document element value set to obtain a document element value checking result;
traversing and checking each check document element name in a plurality of check document element names until each check document element name obtains a document element value checking result;
and determining the consistency check result according to the document element value check result.
4. The method of claim 1, wherein the collation results master presentation page has a collation results master presentation component corresponding to that created by a preset framework;
the rendering of the integrity check result and the consistency check result on a visual page comprises the following steps:
transmitting the integrity check result, the consistency check result to the check result main display component in response to a request for acquiring the check result;
binding the integrity check result and the consistency check result into a state component of the main display component;
triggering a rendering component of the preset frame under the condition that the state component completes state updating based on the received integrity check result and the consistency check result;
And respectively rendering the integrity check result and the consistency check result to the integrity check result sub-display page and the consistency check result sub-display page through the rendering component.
5. The method of claim 4, wherein an integrity check result sub-presentation component corresponding to the integrity check result sub-presentation page and a consistency check result sub-presentation component corresponding to the consistency check result sub-presentation page are mounted at a virtual DOM node of the check result main presentation component;
the rendering, by the rendering component, the integrity check result and the consistency check result to the integrity check result sub-display page and the consistency check result sub-display page respectively includes:
calculating difference information of virtual DOM nodes after the integrity check result sub-display assembly and the consistency check result sub-display assembly respectively receive the integrity check result and the consistency check result through the rendering assembly;
and updating the difference information of the virtual DOM nodes to the integrity check result sub-display page and the consistency check result sub-display page respectively.
6. The method of claim 1, wherein the acquiring the document identifier and the target text information in the at least one effective area from the image of the document to be processed comprises:
based on a preset information identification method, carrying out identification processing on the image of the document to be processed to obtain the document identification and an identification image of the document to be processed, wherein the identification image comprises a plurality of identification areas;
determining at least one effective area from the identification areas, wherein text information is recorded in the effective area;
extracting the target text information from the text information based on the feature parameters of the document, wherein the feature parameters of the document comprise at least one of the following: document title feature parameters, document element name feature parameters, and document element value feature parameters.
7. The method of claim 1, further comprising:
performing key value pair conversion processing on the target text information to obtain structured target text information;
checking the structured target text information according to the checking information to obtain a checking result of the document to be processed;
and rendering the checking result of the document to be processed to the visual page.
8. A document processing apparatus comprising:
the acquisition module is used for acquiring the bill identification and the target text information in at least one effective area from the image of the bill to be processed;
the retrieving module is used for retrieving the check source bill from a preset database according to the bill identifier, wherein the check source bill and the bill to be processed are of the same type, and check information is filled in the check source bill;
the first checking module is used for checking the target text information according to the checking information to obtain a checking result of the document to be processed, wherein the checking result comprises an integrity checking result and a consistency checking result;
the first rendering module is used for rendering the integrity check result and the consistency check result on a visual page, wherein the visual page is formed by a check result main display page, an integrity check result sub-display page mounted under the check result main display page and a consistency check result sub-display page.
9. An electronic device, comprising:
one or more processors;
storage means for storing one or more programs,
Wherein the one or more programs, when executed by the one or more processors, cause the one or more processors to perform the method of any of claims 1-7.
10. A computer readable storage medium having stored thereon executable instructions which, when executed by a processor, cause the processor to perform the method according to any of claims 1-7.
11. A computer program product comprising a computer program which, when executed by a processor, implements the method according to any one of claims 1 to 7.
CN202310602834.9A 2023-05-25 2023-05-25 Document processing method, device, equipment and storage medium Pending CN116629230A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310602834.9A CN116629230A (en) 2023-05-25 2023-05-25 Document processing method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310602834.9A CN116629230A (en) 2023-05-25 2023-05-25 Document processing method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN116629230A true CN116629230A (en) 2023-08-22

Family

ID=87636239

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310602834.9A Pending CN116629230A (en) 2023-05-25 2023-05-25 Document processing method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN116629230A (en)

Similar Documents

Publication Publication Date Title
US10546351B2 (en) System and method for automatic generation of reports based on electronic documents
US8798354B1 (en) Method and system for automatic correlation of check-based payments to customer accounts and/or invoices
US11710332B2 (en) Electronic document data extraction
KR101883694B1 (en) Automation system for import-export customs procedure and adjustment
CN113205402A (en) Account checking method and device, electronic equipment and computer readable medium
CN111651552A (en) Structured information determination method and device and electronic equipment
CN113450075A (en) Work order processing method and device based on natural language technology
US20150046304A1 (en) Analysis of e-receipts for charitable donations
CN114140649A (en) Bill classification method, bill classification device, electronic apparatus, and storage medium
CN112767107A (en) Method, apparatus, device, medium and program product for detecting blacklist
CN117033431A (en) Work order processing method, device, electronic equipment and medium
CN109740130B (en) Method and device for generating file
CN115471228A (en) Financial business certificate checking method, device, equipment and storage medium
US20170083600A1 (en) Creating data objects to separately store common data included in documents
US20130300562A1 (en) Generating delivery notification
CN116629230A (en) Document processing method, device, equipment and storage medium
CN113395292B (en) Message processing method and device, electronic equipment and storage medium
US20220058341A1 (en) Semantic language feature definition language for use in fraud detection
CN114138712A (en) File processing method, processing device, electronic equipment and storage medium
US11900289B1 (en) Structuring unstructured data via optical character recognition and analysis
US20240185151A1 (en) Structuring unstructured data via optical character recognition and analysis
CN113887955B (en) Method and device for examining business document, electronic equipment and readable storage medium
US11875374B2 (en) Automated auditing and recommendation systems and methods
CN113127689B (en) Data processing method, device, equipment and storage medium
CN115689721A (en) Credit system information processing method, device, equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination