CN116628307A - Document detection method, device, apparatus, storage medium and program product - Google Patents

Document detection method, device, apparatus, storage medium and program product Download PDF

Info

Publication number
CN116628307A
CN116628307A CN202310604038.9A CN202310604038A CN116628307A CN 116628307 A CN116628307 A CN 116628307A CN 202310604038 A CN202310604038 A CN 202310604038A CN 116628307 A CN116628307 A CN 116628307A
Authority
CN
China
Prior art keywords
document
matching
result
check
evaluation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310604038.9A
Other languages
Chinese (zh)
Inventor
张磊
肖雷
曾凡麟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Construction Bank Corp
CCB Finetech Co Ltd
Original Assignee
China Construction Bank Corp
CCB Finetech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Construction Bank Corp, CCB Finetech Co Ltd filed Critical China Construction Bank Corp
Priority to CN202310604038.9A priority Critical patent/CN116628307A/en
Publication of CN116628307A publication Critical patent/CN116628307A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9532Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/1444Selective acquisition, locating or processing of specific regions, e.g. highlighted text, fiducial marks or predetermined fields
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Multimedia (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Technology Law (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Mathematical Physics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application discloses a document detection method, a document detection device, a document detection apparatus, a document detection storage medium and a document detection program product. The application relates to the technical field of data processing. The method comprises the following steps: acquiring a target document and a corresponding checking and controlling instruction thereof; extracting document elements in the target document and checking and controlling fields in the checking and controlling instruction; wherein, the document elements are in one-to-one correspondence with the check and control fields; matching the document element with the check field based on element matching rules to obtain a matching result; wherein, the matching result includes: matching the document element with the check field, and not matching the document element with the check field; and determining the detection result of the target document according to the matching result. According to the technical scheme, the document elements extracted from the target document are matched with the check and control fields extracted from the check and control instruction through the element matching rule, and the detection result of the target document is determined based on the matching result, so that the efficiency and accuracy of document detection can be improved.

Description

Document detection method, device, apparatus, storage medium and program product
Technical Field
The embodiment of the application relates to the technical field of text recognition, in particular to a method, a device, equipment, a storage medium and a program product for detecting a document.
Background
In recent years, network judicial control of financial services is becoming more and more popular, and related personnel can complete online control of account numbers, individuals and enterprises through a judicial control platform. By means of the judicial check and control platform established between each authority and the bank, automatic judicial check and control can be realized without manual intervention.
Because judicial investigation and control relates to investigation and control of sensitive information such as object privacy, amount and the like. Therefore, the network judicial check and control is initiated by taking a judicial document as a basis to avoid disputes. In this regard, an effective auditing mechanism is required to audit the document content and to standardize network judicial investigation and control. After the legal inspection and control instructions initiated by the authorized authorities through the network inspection and control platform are landed, the authorized authorities carry out off-line manual inspection and control, so that the inspection of the content of the document is time-consuming and labor-consuming, low in efficiency and easy to make mistakes. With the increasing number of network judicial checks in recent years, the traditional manual checking mode cannot meet the requirements.
Disclosure of Invention
The embodiment of the application provides a method, a device, equipment, a storage medium and a program product for detecting a document, which can improve the efficiency and the accuracy of document detection.
In a first aspect, an embodiment of the present application provides a document detection method, including:
acquiring a target document and a corresponding checking and controlling instruction thereof;
extracting document elements in the target document and checking and controlling fields in the checking and controlling instruction; wherein, the document elements are in one-to-one correspondence with the check and control fields;
matching the document element with the check field based on element matching rules to obtain a matching result; wherein, the matching result includes: matching the document element with the check field, and not matching the document element with the check field;
and determining the detection result of the target document according to the matching result.
In a second aspect, an embodiment of the present application further provides a document detection apparatus, including:
the acquisition module is used for acquiring the target document and the corresponding check and control instruction thereof;
the extraction module is used for extracting document elements in the target document and check and control fields in the check and control instruction; wherein, the document elements are in one-to-one correspondence with the check and control fields;
the matching module is used for matching the document element with the check field based on element matching rules to obtain a matching result; wherein, the matching result includes: matching the document element with the check field, and not matching the document element with the check field;
and the detection result determining module is used for determining the detection result of the target document according to the matching result.
In a third aspect, an embodiment of the present application further provides an electronic device, including a memory, a processor, and a computer program stored in the memory and capable of running on the processor, where the processor implements the document detection method according to the embodiment of the present application when executing the program.
In a fourth aspect, an embodiment of the present application further provides a computer readable storage medium, where a computer program is stored, where the program is executed by a processor to implement a document detection method according to an embodiment of the present application.
In a fifth aspect, embodiments of the present application further provide a computer program product comprising a computer program which, when executed by a processor, implements a document detection method according to embodiments of the present application.
The embodiment of the application discloses a document detection method, a device, equipment, a storage medium and a program product, and a target document and a corresponding checking and controlling instruction thereof are obtained; extracting document elements in the target document and checking and controlling fields in the checking and controlling instruction; wherein, the document elements are in one-to-one correspondence with the check and control fields; matching the document element with the check field based on the element matching rule to obtain a matching result; wherein, the matching result includes: matching the document element with the check field, and not matching the document element with the check field; and determining the detection result of the target document according to the matching result. According to the document detection method provided by the embodiment of the application, the document elements extracted from the target document are matched with the check and control fields extracted from the check and control instruction through the element matching rule, and the detection result of the target document is determined based on the matching result, so that the document detection efficiency and accuracy can be improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the embodiments will be briefly described below, it being understood that the following drawings only illustrate some embodiments of the present application and therefore should not be considered as limiting the scope, and other related drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flowchart of a document detection method according to an embodiment of the present application;
FIG. 2 is a schematic diagram of a document detection method according to an embodiment of the present application;
fig. 3 is a schematic structural diagram of a document detection device according to an embodiment of the present application;
fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
Detailed Description
The application is described in further detail below with reference to the drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the application and are not limiting thereof. It should be further noted that, for convenience of description, only some, but not all of the structures related to the present application are shown in the drawings.
It should be noted that: like reference numerals and letters denote like items in the following figures, and thus once an item is defined in one figure, no further definition or explanation thereof is necessary in the following figures. Meanwhile, in the description of the present application, the terms "first", "second", and the like are used only to distinguish the description, and are not to be construed as indicating or implying relative importance. The technical scheme of the application obtains, stores, uses, processes and the like the data, which all meet the relevant regulations of national laws and regulations.
Fig. 1 is a flowchart of a document detection method provided by the embodiment of the present application, where the method is suitable for detecting whether a issued document is compliant, and the method may be performed by a document detection device, where the device may be implemented in a form of software and/or hardware, and optionally, implemented by an electronic device, where the electronic device may be a mobile terminal, a PC side, a server, or the like. As shown in fig. 1, the method specifically includes the following steps:
s110, acquiring a target document and a corresponding check and control instruction thereof.
The target document can be a judicial document of an electronic version issued to an enterprise by a related organization through a network checking and controlling platform. The checking and controlling instruction corresponds to the target document and comprises specific checking and controlling operations executed by enterprises. In this embodiment, the target document for the target enterprise and the corresponding search control instruction thereof may be downloaded from the network search control platform.
S120, extracting document elements in the target document and checking and controlling fields in the checking and controlling instruction.
Wherein, the file elements are in one-to-one correspondence with the check and control fields. In this embodiment, the target document and the search control instruction are identified mainly by a text recognition (optical character recognition, OCR) technique, so as to obtain document elements included in the target document and search control fields included in the search control instruction.
In this embodiment, the elements included in different types of documents are different, so that it is necessary to extract document elements in the target document and check and control fields in the check and control instruction according to the type information. Specifically, the process of extracting the document element in the target document and the check field in the check instruction may be: determining category information of a target document; and respectively carrying out character recognition on the target document and the checking and controlling instruction based on the category information to obtain document elements in the target document and checking and controlling fields in the checking and controlling instruction.
The category information may include, among other things, query documents, frozen documents, and no-pay documents.
If the target text is a query document, the document elements include: document type, authorized institution name, document number, initiating bank, account number, identity, stamp information, etc. Correspondingly, the checking and controlling field is the corresponding field of the document type, the authorized organ name, the document number, the initiating bank, the account number, the identity mark, the seal information and the like.
If the target document is a frozen document, the document elements include: the document type, the authority name, the document number, the originating bank, the user name, the account number, the frozen amount, the frozen start date, the frozen end date, the seal information, and the like. Correspondingly, the check and control fields are corresponding fields such as document type, authorized organ name, document number, initiating bank, family name, account number, frozen amount, frozen start date, frozen end date, seal information and the like.
If the target document is paid up, the document elements include: document type, authorized institution name, document number, initiating bank, account, stamp information, etc. Correspondingly, the check and control fields are corresponding fields such as book type, authority name, document number, initiating bank, account, seal information and the like.
Specifically, firstly, performing text recognition on a target document and a checking and controlling instruction to obtain text contents corresponding to the target document and the checking and controlling instruction respectively, and then extracting document elements and checking and controlling fields corresponding to category information from the recognized text contents.
And S130, matching the document element with the check field based on the element matching rule to obtain a matching result.
Wherein, the matching result includes: the document element is matched with the check field, and the document element is not matched with the check field.
Wherein, the element matching rules are set by the user and can be dynamically changed, namely, the element matching rules based on each time can be the same or different. And a corresponding element matching rule needs to be set for each document element. The element matching rule includes: at least one of document element processing rules, check and control field processing rules and matching rules. The document element processing rule may be understood as a rule for formatting or structuring the document element, and the check field processing rule may be understood as a rule for formatting or structuring the check field. The matching rule can be understood as a condition that the document element is required to be satisfied when matching with the check field. The matching rules may include any of the following: the document element is the same as the check field, and the document element has an inclusion relationship with the check field. Optionally, the element matching rule further includes element codes, that is, unique codes are allocated to the document elements.
Specifically, the process of matching the document element with the check field based on the element matching rule to obtain the matching result may be: carrying out structuring treatment on the document elements based on the document element treatment rules; carrying out structuring processing on the check and control field based on the check and control field processing rule; and matching the document elements after the structuring treatment with the check and control fields after the structuring treatment based on the matching rules to obtain a matching result.
The document element is structured based on the document element processing rule, which is understood as converting the document element into a data structure or a data format specified by the document element processing rule. Structuring the control field based on the control field processing rules may be understood as converting the control field into a data structure or data format specified by the control field processing rules. The process of matching the structured document element with the structured check and control field based on the matching rule may be: judging whether the structured document elements and the structured check and control fields meet the conditions specified by the matching rules, if so, matching the structured document elements and the structured check and control fields, otherwise, not matching the structured document elements and the structured check and control fields.
And S140, determining the detection result of the target document according to the matching result.
The detection result comprises two conditions of passing detection of the target document and failing detection of the target document. In this embodiment, the manner of determining the detection result of the target document according to the matching result may be: if the matching result exceeding the set proportion in the document elements is that the document elements are matched with the check and control fields, the target document is detected to pass. Or determining the detection result of the target document by adopting a preset document evaluation rule and a matching result.
Specifically, the method for determining the detection result of the target document according to the matching result may be: acquiring at least one document evaluation rule; determining an evaluation result of the at least one document evaluation rule according to the matching results corresponding to the document elements respectively contained in the at least one document evaluation rule; and determining the detection result of the target document based on the evaluation result.
The document evaluation rule includes a plurality of document elements, and the document elements included in the document evaluation rule may be dynamically set by a user, which is not limited herein. For example, assume that there are three document evaluation rules, a first document evaluation rule, a second document evaluation rule, and a third document evaluation rule, respectively. Wherein, the first document evaluation rule contains document elements: authority name, document type, document number, seal; the second document evaluation rule includes document elements of: initiating a bank, a family name and an account number; the third document evaluation rule includes document elements of: a freezing amount, a freezing start date, and a freezing end date.
Wherein, the evaluation result of the document evaluation rule comprises that the evaluation passes or the evaluation fails.
Specifically, the manner of determining the evaluation result of the at least one document evaluation rule according to the matching result corresponding to the document element included in the at least one document evaluation rule may be: if the matching result of one or more of the plurality of document elements of the document evaluation rule is that the document elements are matched with the check field, the evaluation result of the document evaluation rule is that the evaluation is passed; if the matching results of the document elements of the document evaluation rule are mismatching between the document elements and the check fields, the evaluation result of the document evaluation rule is that the evaluation is failed.
In this embodiment, as long as the matching result of one of the plurality of document elements of the document evaluation rule is that the document element and the check field are matched, the evaluation result of the document evaluation rule is that the evaluation is passed. When the matching results of the document elements of the document evaluation rule are mismatching of the document elements and the check fields, the evaluation result of the document evaluation rule is that the evaluation is not passed. For the second document evaluation rule in the above embodiment, the "initiating bank" is matched with its corresponding check field, or the "user name" is matched with its corresponding check field, or the "account number" is matched with its corresponding check field, and then the second document evaluation rule is evaluated as passing. If the initiating bank is not matched with the corresponding check field, the household name is not matched with the corresponding check field, and the account number is not matched with the corresponding check field, the second document evaluation rule is that the evaluation is not passed.
Specifically, the manner of determining the detection result of the target document based on the evaluation result may be: if the evaluation results of at least one document evaluation rule are all evaluation passing, the detection result of the target document is detection passing; if the evaluation result of the at least one document evaluation rule does not pass the evaluation, the detection result of the target document is that the detection does not pass.
In this embodiment, when the evaluation results of the evaluation rules of the plurality of documents are all evaluation passing, the detection result of the target document is detection passing. When the evaluation result of the evaluation rules of the plurality of documents fails to pass, the detection result of the target document is that the detection fails. For example, for the first document evaluation rule, the second document evaluation rule, and the third document evaluation rule in the above embodiments, if the first document evaluation rule evaluation passes, the second document evaluation rule evaluation passes, and the third document evaluation rule evaluation passes, the detection result of the target document is the detection pass. If the evaluation of the first document evaluation rule is not passed, or the evaluation of the second document evaluation rule is not passed, or the evaluation of the third document evaluation rule is not passed, the detection result of the target document is that the detection is not passed.
Optionally, after determining the detection result of the target document according to the matching result, the method further includes the following steps: determining first statistical information of the document elements and/or second statistical information of element matching rules based on the matching result; the first report is generated based on the first statistical information and/or the second report is generated based on the second statistical information.
Wherein the first statistical information includes at least one of: the total number of matching of the document elements and the matching result are the number of matching of the document elements and the check and control fields and the matching rate. The second statistical information includes at least one of: the total number of matching of the element matching rules and the matching result are the number of matching of the document elements and the check and control fields and the matching rate. In this embodiment, the matching rate of each document element can be obtained through the first report, so that analysis can be performed for the case of low matching rate, and a plurality of element matching rules for the document elements can be adjusted. The matching rate of each set element matching rule can be known through the second report, analysis can be conducted according to the condition that the matching rate is low, and a single element matching rule can be adjusted.
Optionally, after determining the detection result of the target document according to the matching result, the method further includes the following steps: determining third statistical information of the target document based on the detection result; and generating a third report based on the third statistical information.
Wherein the third statistical information includes at least one of: the total detection times and detection results of the target document are the detection passing times and detection passing rate. The matching rate of each document can be known through the third report, and corresponding analysis can be performed aiming at the condition of low matching rate.
On the basis of the above embodiment, fig. 2 is a schematic diagram of the document detection method in this embodiment, and as shown in fig. 2, the query document, the frozen document and the stop payment document are respectively subjected to text recognition to obtain the elements included in the document, namely, element 1, element 2, … … and element 6, an element matching rule is set for each element, a matching result corresponding to each element is obtained based on the element matching result, a detection result of the document is obtained based on the matching result and the document evaluation rule, and finally a statistical report is reproduced.
According to the technical scheme, the target document and the corresponding checking and controlling instruction are acquired; extracting document elements in the target document and checking and controlling fields in the checking and controlling instruction; wherein, the document elements are in one-to-one correspondence with the check and control fields; matching the document element with the check field based on the element matching rule to obtain a matching result; wherein, the matching result includes: matching the document element with the check field, and not matching the document element with the check field; and determining the detection result of the target document according to the matching result. According to the document detection method provided by the embodiment of the application, the document elements extracted from the target document are matched with the check and control fields extracted from the check and control instruction through the element matching rule, and the detection result of the target document is determined based on the matching result, so that the document detection efficiency and accuracy can be improved.
Fig. 3 is a schematic structural diagram of a document detection device according to an embodiment of the present application, as shown in fig. 3, where the device includes:
an obtaining module 310, configured to obtain a target document and a corresponding search control instruction thereof;
the extracting module 320 is configured to extract document elements in the target document and check and control fields in the check and control instruction; wherein, the document elements are in one-to-one correspondence with the check and control fields;
the matching module 330 is configured to match the document element with the search field based on an element matching rule, so as to obtain a matching result; wherein, the matching result includes: matching the document element with the check field, and not matching the document element with the check field;
the detection result determining module 340 is configured to determine a detection result of the target document according to the matching result.
Optionally, the extracting module 320 is further configured to:
determining category information of a target document;
and respectively carrying out character recognition on the target document and the checking and controlling instruction based on the category information to obtain document elements in the target document and checking and controlling fields in the checking and controlling instruction.
Optionally, the element matching rule includes: at least one of document element processing rules, check and control field processing rules and matching rules.
Optionally, the matching module 330 is further configured to:
carrying out structuring treatment on the document elements based on the document element treatment rules;
carrying out structuring processing on the check and control field based on the check and control field processing rule;
and matching the document elements after the structuring treatment with the check and control fields after the structuring treatment based on the matching rules to obtain a matching result.
Optionally, the detection result determining module 340 is further configured to:
acquiring at least one document evaluation rule; wherein the document evaluation rule includes a plurality of document elements;
determining an evaluation result of the at least one document evaluation rule according to the matching results corresponding to the document elements respectively contained in the at least one document evaluation rule;
and determining the detection result of the target document based on the evaluation result.
Optionally, the detection result determining module 340 is further configured to:
if the matching result of one or more of the plurality of document elements of the document evaluation rule is that the document elements are matched with the check field, the evaluation result of the document evaluation rule is that the evaluation is passed;
if the matching results of the document elements of the document evaluation rule are mismatching between the document elements and the check fields, the evaluation result of the document evaluation rule is that the evaluation is failed.
Optionally, the detection result determining module 340 is further configured to:
if the evaluation results of at least one document evaluation rule are all evaluation passing, the detection result of the target document is detection passing;
if the evaluation result of the at least one document evaluation rule does not pass the evaluation, the detection result of the target document is that the detection does not pass.
Optionally, the method further comprises: the first report or the second report generating module is used for:
determining first statistical information of the document elements and/or second statistical information of element matching rules based on the matching result;
the first report is generated based on the first statistical information and/or the second report is generated based on the second statistical information.
Optionally, the first statistical information includes at least one of: the total matching times and matching results of the document elements are the matching times and matching rate of the document elements and the check and control fields;
the second statistical information includes at least one of: the total number of matching of the element matching rules and the matching result are the number of matching of the document elements and the check and control fields and the matching rate.
Optionally, the method further comprises: the third report generating module is used for:
determining third statistical information of the target document based on the detection result;
and generating a third report based on the third statistical information.
Optionally, the third statistical information includes at least one of: the total detection times and detection results of the target document are the detection passing times and detection passing rate.
Fig. 4 shows a schematic diagram of the structure of an electronic device 10 that may be used to implement an embodiment of the application. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. Electronic equipment may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices (e.g., helmets, glasses, watches, etc.), and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the applications described and/or claimed herein.
As shown in fig. 4, the electronic device 10 includes at least one processor 11, and a memory, such as a Read Only Memory (ROM) 12, a Random Access Memory (RAM) 13, etc., communicatively connected to the at least one processor 11, in which the memory stores a computer program executable by the at least one processor, and the processor 11 may perform various appropriate actions and processes according to the computer program stored in the Read Only Memory (ROM) 12 or the computer program loaded from the storage unit 18 into the Random Access Memory (RAM) 13. In the RAM 13, various programs and data required for the operation of the electronic device 10 may also be stored. The processor 11, the ROM 12 and the RAM 13 are connected to each other via a bus 14. An input/output (I/O) interface 15 is also connected to bus 14.
Various components in the electronic device 10 are connected to the I/O interface 15, including: an input unit 16 such as a keyboard, a mouse, etc.; an output unit 17 such as various types of displays, speakers, and the like; a storage unit 18 such as a magnetic disk, an optical disk, or the like; and a communication unit 19 such as a network card, modem, wireless communication transceiver, etc. The communication unit 19 allows the electronic device 10 to exchange information/data with other devices via a computer network, such as the internet, and/or various telecommunication networks.
The processor 11 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of processor 11 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various specialized Artificial Intelligence (AI) computing chips, various processors running machine learning model algorithms, digital Signal Processors (DSPs), and any suitable processor, controller, microcontroller, etc. The processor 11 performs the various methods and processes described above, such as the document detection method.
In some embodiments, the document detection method may be implemented as a computer program tangibly embodied on a computer-readable storage medium, such as the storage unit 18. In some embodiments, part or all of the computer program may be loaded and/or installed onto the electronic device 10 via the ROM 12 and/or the communication unit 19. When the computer program is loaded into RAM 13 and executed by processor 11, one or more steps of the document detection method described above may be performed. Alternatively, in other embodiments, the processor 11 may be configured to perform the document detection method in any other suitable way (e.g., by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuit systems, field Programmable Gate Arrays (FPGAs), application Specific Integrated Circuits (ASICs), application Specific Standard Products (ASSPs), systems On Chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
A computer program for carrying out methods of the present application may be written in any combination of one or more programming languages. These computer programs may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the computer programs, when executed by the processor, cause the functions/acts specified in the flowchart and/or block diagram block or blocks to be implemented. The computer program may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of the present application, a computer-readable storage medium may be a tangible medium that can contain, or store a computer program for use by or in connection with an instruction execution system, apparatus, or device. The computer readable storage medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. Alternatively, the computer readable storage medium may be a machine readable signal medium. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on an electronic device having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) through which a user can provide input to the electronic device. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), wide Area Networks (WANs), blockchain networks, and the internet.
The computing system may include clients and servers. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server can be a cloud server, also called a cloud computing server or a cloud host, and is a host product in a cloud computing service system, so that the defects of high management difficulty and weak service expansibility in the traditional physical hosts and VPS service are overcome.
The embodiments of the present application also provide a computer program product comprising a computer program which, when executed by a processor, implements a document detection method as provided by any of the embodiments of the present application.
Computer program product in the implementation, the computer program code for carrying out operations of the present application may be written in one or more programming languages, including an object oriented programming language such as Java, smalltalk, C ++ and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computer (for example, through the Internet using an Internet service provider).
Note that the above is only a preferred embodiment of the present application and the technical principle applied. It will be understood by those skilled in the art that the present application is not limited to the particular embodiments described herein, but is capable of various obvious changes, rearrangements and substitutions as will now become apparent to those skilled in the art without departing from the scope of the application. Therefore, while the application has been described in connection with the above embodiments, the application is not limited to the embodiments, but may be embodied in many other equivalent forms without departing from the spirit or scope of the application, which is set forth in the following claims.

Claims (15)

1. A document detection method, comprising:
acquiring a target document and a corresponding checking and controlling instruction thereof;
extracting document elements in the target document and checking and controlling fields in the checking and controlling instruction; wherein, the document elements are in one-to-one correspondence with the check and control fields;
matching the document element with the check field based on element matching rules to obtain a matching result; wherein, the matching result includes: matching the document element with the check field, and not matching the document element with the check field;
and determining the detection result of the target document according to the matching result.
2. The method of claim 1, wherein extracting document elements in the target document and the audit field in the audit instruction comprises:
determining category information of the target document;
and respectively carrying out character recognition on the target document and the checking and controlling instruction based on the category information to obtain document elements in the target document and checking and controlling fields in the checking and controlling instruction.
3. The method of claim 1, wherein the element matching rule comprises: at least one of document element processing rules, check and control field processing rules and matching rules.
4. The method of claim 3, wherein matching the document element with the audit field based on element matching rules to obtain a matching result comprises:
carrying out structuring processing on the document elements based on the document element processing rules;
carrying out structuring processing on the check and control field based on the check and control field processing rule;
and matching the structured document elements with the structured check and control fields based on the matching rules to obtain a matching result.
5. The method of claim 1, wherein determining the detection result of the target document based on the matching result comprises:
acquiring at least one document evaluation rule; wherein the document evaluation rule includes a plurality of document elements;
determining an evaluation result of the at least one document evaluation rule according to the matching results corresponding to the document elements respectively contained in the at least one document evaluation rule;
and determining the detection result of the target document based on the evaluation result.
6. The method of claim 5, wherein determining the evaluation result of the at least one document evaluation rule according to the matching result corresponding to the document element included in the at least one document evaluation rule, respectively, comprises:
if the matching result of one or more of the document elements of the document evaluation rule is that the document elements are matched with the check field, the evaluation result of the document evaluation rule is that the evaluation is passed;
if the matching results of the document elements of the document evaluation rule are mismatching between the document elements and the check fields, the evaluation result of the document evaluation rule is that the evaluation is failed.
7. The method of claim 5, wherein determining the detection result of the target document based on the evaluation result comprises:
if the evaluation results of the at least one document evaluation rule are all evaluation passing, the detection result of the target document is detection passing;
if the evaluation result of the at least one document evaluation rule does not pass the evaluation, the detection result of the target document is that the detection does not pass.
8. The method according to claim 5, further comprising, after determining the detection result of the target document according to the matching result:
determining first statistical information of the document elements and/or second statistical information of the element matching rules based on the matching result;
generating a first report based on the first statistical information and/or generating a second report based on the second statistical information.
9. The method of claim 8, wherein the first statistical information comprises at least one of: the total matching times and matching results of the document elements are the matching times and matching rate of the document elements and the check and control fields;
the second statistical information includes at least one of: the total matching times and matching results of the element matching rules are the times and matching rate of matching the document elements and the check and control fields.
10. The method according to claim 5, further comprising, after determining the detection result of the target document according to the matching result:
determining third statistical information of the target document based on the detection result;
and generating a third report based on the third statistical information.
11. The method of claim 10, wherein the third statistical information comprises at least one of: the total detection times and the detection results of the target document are the detection passing times and the detection passing rate.
12. A document detection apparatus, comprising:
the acquisition module is used for acquiring the target document and the corresponding check and control instruction thereof;
the extraction module is used for extracting document elements in the target document and check and control fields in the check and control instruction; wherein, the document elements are in one-to-one correspondence with the check and control fields;
the matching module is used for matching the document element with the check field based on element matching rules to obtain a matching result; wherein, the matching result includes: matching the document element with the check field, and not matching the document element with the check field;
and the detection result determining module is used for determining the detection result of the target document according to the matching result.
13. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable by the processor, wherein the processor implements the document detection method according to any one of claims 1-9 when executing the computer program.
14. A computer-readable storage medium, on which a computer program is stored, characterized in that the program, when being executed by a processor, implements the document detection method according to any one of claims 1-9.
15. A computer program product comprising a computer program, characterized in that the computer program, when executed by a processor, implements the document detection method according to any one of claims 1-9.
CN202310604038.9A 2023-05-25 2023-05-25 Document detection method, device, apparatus, storage medium and program product Pending CN116628307A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310604038.9A CN116628307A (en) 2023-05-25 2023-05-25 Document detection method, device, apparatus, storage medium and program product

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310604038.9A CN116628307A (en) 2023-05-25 2023-05-25 Document detection method, device, apparatus, storage medium and program product

Publications (1)

Publication Number Publication Date
CN116628307A true CN116628307A (en) 2023-08-22

Family

ID=87636254

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310604038.9A Pending CN116628307A (en) 2023-05-25 2023-05-25 Document detection method, device, apparatus, storage medium and program product

Country Status (1)

Country Link
CN (1) CN116628307A (en)

Similar Documents

Publication Publication Date Title
CN111598122B (en) Data verification method and device, electronic equipment and storage medium
CN113407610A (en) Information extraction method and device, electronic equipment and readable storage medium
CN113657395A (en) Text recognition method, and training method and device of visual feature extraction model
US20230096921A1 (en) Image recognition method and apparatus, electronic device and readable storage medium
CN114244795B (en) Information pushing method, device, equipment and medium
CN113313114B (en) Certificate information acquisition method, device, equipment and storage medium
CN113312611A (en) Password detection method, device, equipment and computer readable storage medium
CN117076610A (en) Identification method and device of data sensitive table, electronic equipment and storage medium
CN116628307A (en) Document detection method, device, apparatus, storage medium and program product
CN115665285A (en) Data processing method and device, electronic equipment and storage medium
CN113010721B (en) Picture auditing method and device, electronic equipment and storage medium
CN114924959A (en) Page testing method and device, electronic equipment and medium
CN114444087A (en) Unauthorized vulnerability detection method and device, electronic equipment and storage medium
CN114254650A (en) Information processing method, device, equipment and medium
CN113191137A (en) Operation risk obtaining method and system, electronic equipment and storage medium
CN112862305A (en) Method, device, equipment and storage medium for determining risk state of object
CN113037555A (en) Risk event marking method, risk event marking device and electronic equipment
CN116611917B (en) Financial risk early warning method, device, equipment and storage medium
CN114492409B (en) Method and device for evaluating file content, electronic equipment and program product
CN116244740B (en) Log desensitization method and device, electronic equipment and storage medium
CN113868660B (en) Training method, device and equipment for malicious software detection model
CN117522143A (en) Method, device, equipment and storage medium for determining risk level
CN114065008A (en) Method, apparatus, device and storage medium for object identification based on search
CN117011026A (en) Associated risk identification method, device, equipment, storage medium and program product
CN117010760A (en) Rank evaluation method, rank evaluation device, rank evaluation apparatus, rank evaluation program product, and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination