WO2021054850A1

WO2021054850A1 - Method and system for intelligent document processing

Info

Publication number: WO2021054850A1
Application number: PCT/RU2019/000641
Authority: WO
Inventors: Кирилл Геннадьевич ТАРАСОВ; Антон Юрьевич КОЛЕСОВ
Original assignee: Публичное Акционерное Общество "Сбербанк России"
Priority date: 2019-09-17
Filing date: 2019-09-17
Publication date: 2021-03-25

Abstract

The proposed technical solution relates, in general, to the field of image analysis, and more particularly to methods and systems for intelligently processing an electronic set of documents, for example scanned documents of bank clients. The technical result is increasing the efficiency and providing high accuracy in the detection of errors when carrying out automated intelligent document processing. The aforementioned technical result is achieved on account of an intelligent document processing method which is carried out by at least one computing device and comprises steps in which: at least one image of a document is obtained; symbols in the image of the document are identified and transformed into textual information; the document type is determined on the basis of the textual information; an entity set is extracted from the textual information while taking into account the document type; the entity set is compared with a reference entity set for such a document; and the results of the document processing are produced on the basis of the results of the comparison of the above-mentioned entity sets.

Description

METHOD AND SYSTEM FOR INTELLIGENT DOCUMENT PROCESSING FIELD OF TECHNOLOGY

[0001] The presented technical solution relates generally to the field of image analysis, and in particular to methods and systems for intelligent processing of an electronic set of documents, for example, scanned documents of bank customers.

LEVEL OF TECHNOLOGY

[0002] Currently, there is a problem of prompt and high-quality processing of data from an electronic set of scanned documents in order to check for the presence of mandatory filled document fields from structured and unstructured documents, as well as signer attributes such as a signature. From the prior art, various solutions are known, made with the possibility of processing documents, for example, a client of the Bank, implemented on the basis of ABBYY FlexiCapture software, etc. Also known is a solution for checking a set of documents disclosed in the application US 2011134494 (A1), publ. 06/09/2011, in which the reading of a document having many pages is carried out; checking the image data of each page of a document having a plurality of pages, while checking certain areas of the document image for the presence and absence of information. This solution is the closest analogue.

[0003] A significant drawback of the known solutions is the low efficiency in detecting errors when checking documents for the correctness of their filling, since in a very large number of cases the known solutions give the result “there is an error”, although there is actually no error, all fields are filled in correctly, but known the solution simply could not find them in the text due to the fact that the text is weakly structured. Also in the known solutions there is no mechanism for automated decision-making on the basis of the above-mentioned check. DISCLOSURE OF THE INVENTION

[0004] The technical problem or task posed in this technical solution is the creation of a new effective, simple and reliable method for automated intelligent processing of any types of documents for the correctness of their filling.

[0005] The technical result is to improve efficiency and ensure high accuracy in detecting errors during automated intelligent document processing.

[0006] The specified technical result is achieved through the implementation of a method for intelligent processing of documents, performed by at least one computing device, and containing the steps in which:

- get at least one image of the document;

- recognize characters in the image of the document and convert them into text information;

- on the basis of text information determine the type of document;

- extract from the text information a set of entities, taking into account the type of document;

- compare the set of entities with the reference set of entities for this document;

- based on the results of comparison of the mentioned sets of entities, the results of document processing are generated.

[0007] In one of the particular examples of the implementation of the method, the document is an agreement on individual credit conditions (ICC) or a surety agreement (DP).

[0008] In another particular embodiment of the method, the following steps are additionally performed, at which: detecting the signer's attribute on the received image of the document; determine the location of at least one attribute of the signer on the page of the document; the results of processing the document are formed taking into account the information about the location of at least one attribute of the signer on the page of the document. [0009] In another particular embodiment of the method, the step is additionally performed, at which the status of the person to whom the detected attributes of the signer belongs is determined.

[0010] In another particular embodiment of the method, the following steps are additionally performed, at which: a process identifier is obtained; define a set of text classification models based on the process identifier; transform the received text information into a set of vectors; process the set of vectors using a previously defined set of text classification models to determine the type of document.

[0011] In another particular embodiment of the method, the steps are additionally performed, at which: the set of entities is divided into simple entities, consisting of 1-3 words, and complex entities, consisting of less than four words; moreover, if, as a result of comparing the above sets of entities, the threshold values of matching words for simple and complex entities are reached, then the results of the reconciliation are generated, which include information on the successful completion of the reconciliation of the data; if the aforementioned threshold values of matching words for simple and complex entities are not reached, then the reconciliation results are generated, which include information about entities in the set of entities that have not passed the reconciliation; in this case, the results of document processing are formed taking into account the results of the reconciliation. [0012] In another particular embodiment of the method, a step is additionally performed, at which the quality of scanning of the document is determined; moreover, the results of processing the document are formed taking into account the quality of scanning the document.

[0013] In another preferred embodiment of the claimed solution, an intelligent document processing system is provided comprising at least one computing device and at least one memory device containing machine-readable instructions that, when executed by at least one computing device, perform the above method.

BRIEF DESCRIPTION OF DRAWINGS

[0014] The features and advantages of the present technical solution will become apparent from the following detailed description of the invention and the accompanying drawings, in which: [0015] in FIG. 1 shows a general diagram of the interaction of the elements of an intelligent document processing system.

[0016] in FIG. 2 shows an example of a scanned document.

[0017] in FIG. 3 shows an example of a general view of an intelligent document processing system.

CARRYING OUT THE INVENTION

[0018] The following will describe the concepts and terms necessary to understand this technical solution.

[0019] In this technical solution, a system means, including a computer system, a computer (electronic computer), a CNC (numerical control), a PLC (programmable logic controller), computerized control systems and any other devices capable of performing a given , a well-defined sequence of operations (actions, instructions).

[0020] By a command processing device is meant an electronic unit, a computing device, or an integrated circuit (microprocessor) that executes machine instructions (programs).

[0021] A command processor reads and executes machine instructions (programs) from one or more storage devices. The role of data storage devices can be, but are not limited to, hard disks (HDD), flash memory, ROM (read only memory), solid state drives (SSD), optical drives.

[0022] A program is a sequence of instructions for execution by a computer control device or command processing device.

[0023] Database (DB) - a collection of data organized in accordance with a conceptual structure describing the characteristics of this data and the relationship between them, and such a collection of data that supports one or more areas of application (ISO / IEC 2382: 2015, 2121423 " database ").

[0024] In accordance with the diagram of FIG. 1, the intelligent document processing system 10 comprises interconnected: a data conversion unit 11; a signature detection module 12, a data extraction module 13, a package document classification module 17, and a module 18 business rules, consisting of a data reconciliation module 14, a document properties analysis module 15, a decision module 16 and a legal validity analysis module 19.

[0025] These modules can be implemented on the basis of the software and hardware of the intelligent document processing system 10, for example, on the basis of at least one computing device, in particular a microprocessor, and at least one memory device containing machine-readable instructions written in the language Python programming to implement functions performed by modules. For example, the data conversion module 11 may be implemented based on an optical character recognition (OCR) tool. The signature detection module 12 can be implemented on the basis of a neural network of the YOLOv3 architecture, pre-trained on a typical set of signatures and seals. The package document classification module 17 may be implemented in firmware of system 10 configured to represent text as vectors (eg, TFIDF), and include a set of text classification models, eg, SVM or Random Fields. The data extraction module 13 can be implemented on the basis of the hardware and software of system 10 and include a set of models for analyzing the semantics of natural languages word2vec, a pre-trained mathematical model - Conditional Random Fields and computational tools for natural language processing (Natural Language Processing , NLP). The business rules module 18, consisting of a data reconciliation module 14, a document properties analysis module 15, a decision module 16 and a legal validity analysis module 19, can be implemented on the basis of the system 10 firmware, configured in the firmware in this way to perform the functions assigned to them below.

[0026] At the first stage of the system 10 operation, the data conversion module 11 and the signature detection module 12 receive at least one image of a document, in particular a scanned document, for example, a file in the multi-page PDF, JPEG, TIFF format or any other known format, which can be used to store the scanned document image in it. The document image can come from an image data source 1, in particular directly from a document scanning device such as a scanner, or can be retrieved from a corresponding image database in which the document image data is stored in advance.

[0027] Also, in accordance with a predetermined software and hardware algorithm, the data on the process identifier from the automated system (AS) 2 of the Bank is sent to the module 17 for classifying documents in the package and to the module 18 of business rules. The process identifier from the AC 2 Bank can be supplied to the mentioned modules by methods well known in the art, for example, before submitting a document to a scanner or before extracting an image of a document from a database, according to the process in which the document is checked. Based on the data on the process identifier, a set of possible types of documents is subsequently determined, which may be on the document image received in the data conversion unit 11; a set of entities to be extracted by module 13, and data on the location of signatures in documents. For example, the data on the process identifier may indicate that 2 types of documents can come to the input to the module 17 for classifying documents: an agreement on individual credit terms (ILC) or a surety agreement (DP), so the corresponding classifier is triggered.

[0028] The document, the image of which arrives at the data conversion module 11, can be any document consisting of at least one page, which can contain the attributes of the signer, and filled in accordance with a known template. The document can be, for example, a document / IUC agreement signed by a bank client or a surety agreement (DP). The document may contain fields that contain information about the signer, for example, the signer's full name, the signer's address, the signer's card number, passport data, etc., as well as information about the terms of the agreement, for example, the terms of credit. In particular, according to the diagram shown in FIG. 2, the area 101 of the document 100 may contain a field with information about the number of the said application, in the area 102 - the field with the name of the city, in the area 103 - the field with the date of the application, in the area 104 - the fields with information about the signer and credit conditions, in the area 105 or 106 of the document - images of the attributes of the signer, for example, the image of the signature. [0029] The data conversion unit 11 performs character recognition on the document image and converts them into text information. Along with this, the signature detection module 12 detects the signer's attribute on the received image of the document, determining its location on the document page. The signer attribute can be absent on the page, this information is also transmitted further according to the scheme shown in FIG. 1. For example, module 12 may determine that the signer's attribute image is a signature image in the document area 105 or 106 (see FIG. 2) by automatically indicating the coordinates of the found boxes 105 and 106. Accordingly, the data on the location of the signer's attributes on the document page or about their absence, module 12 sends to module 19 of the analysis of legal validity.

[0030] To detect images of the signer's attributes, the well-known algorithms of the neural network of the YOLOv3 architecture are used, trained on a selected set of signatures and seals data, disclosed, for example, in an article published on the Internet at: https://pireddie.com/media/files /papers/YOLOv3.pdf.

[0031] If the document image contains attributes of more than one signer, for example, an image of a signature of a Bank client and an image of a signature of a Bank employee, then the legal validity analysis module 19 may be configured to determine the status of the person to whom the detected signer attributes belong. For this, in the memory of the module 19, the user of the system 10 can preset a list of the statuses of persons and information about the location of their attributes of the signer on the document image based on the process identifier, the data about which came from the AC 2 Bank to the module 18, and the information on the status of persons can indicate to which person the signer's attribute belongs to, in particular, for example, a client of the Bank or an employee of the Bank. For example, for Bank customer face, location data may indicate that its signer attributes should be located in area 105 of the document, and for Bank employee face status, location data may indicate that its signatory attributes are located in area 106. document.

[0032] Accordingly, the module 19 analysis of legal validity compares the data on the location of the image of the signer's attribute on the page of the document, received from the module 12, with the above-mentioned stored in memory with data, in particular data on the location of the signer's attributes according to the type of process determined by the module 19 based on the previously obtained data on the process identifier, and based on the comparison result determines the status of the person to whom the detected signer attribute belongs, i.e. based on information about the location of the signer's attribute on the image of the document page. The data on the status of the person and the data on the location of the images of the signers' attributes on the page of the document are sent by the module 19 to the module 16 for making decisions. If module 19 receives information about the absence of signer attributes on the image, then module 19 redirects this information to module 16.

[0033] As for the text information, the data conversion unit 11 forwards it to the data extraction unit 13 and to the package document classification unit 17. Module 17, based on the data on the process identifier received from the AC 2, determines a set of text classification models that can be predefined in said module 17 for each type of process by the user of the system 10, after which the received text information is converted by the module 17 into a set of vectors, which a previously defined set of text classification models is processed to determine the type of document. The module 17 transmits data on the type of the document to the module 13, which extracts from the received text information from the module 11 a set of entities in accordance with the type of the document. A set of entities can include name, address, card number, document date, card number, passport data, credit conditions, etc. To extract a set of entities from the received text information, the module 13 tokenizes the text information and feeds the tokenized text information to the input to the word2vec model set, at the output of which the module 13 receives a sequence of vectors.

[0034] Next, within the module 13, a trained machine learning model CRF (Conditional Random Fields) is defined based on the document type data, and the sequence of vectors is processed by said trained model that defines a set of entities. The trained CRF machine learning models for each type of document can be predefined in the mentioned module 13 by the user of the system 10. Machine learning models trained by the CRF method are widely used in various fields of AI, in particular, in speech and image recognition, processing textual information, as well as in other subject areas: bioinformatics, computer graphics, etc.

[0035] In an alternative embodiment of the claimed solution, entities can be retrieved using Natural Language Processing (NLP) technology. This technology is widely known from the prior art (see, for example, the article "NLP. Basics. Techniques. Self-development. Part 2: NER", published on the Internet at: https://habr.com/ru/company/abbyy/blog / 449514 /) and, additionally, will not be disclosed in more detail in this application. The algorithm for processing a sequence of vectors can also be selected depending on the type of document.

[0036] The obtained set of entities is sent by the data extraction module 13 to the data reconciliation module 14. Also, module 14 is fed a reference set of entities by module 18 of business rules. The reference set of entities by the module 18 is determined on the basis of the previously received data on the process identifier from the AS 2 of the Bank. The reference set of entities for each type of process can be predefined in the above-mentioned module 18 by the user of the system 10. The module 14 divides the obtained data of the sets of entities into simple entities, consisting of 1-3 words, and complex entities, consisting of less than four words. For example, if an IUC document is received at the entrance to system 10, then simple entities will be, for example, full name, credit amount, contract start date, passport number, passport issue date, etc., and complex entities will be, for example, address, place of issue passports, etc.

[0037] Next, the data validation unit 14 proceeds to the step of comparing the entity set obtained from the unit 13 with the reference entity set. The data of simple entities, the data reconciliation module 14 leads to one format, and then compares them. In these complex entities, before comparing them, generally recognized abbreviations are deciphered, words that do not contain names are excluded. If the threshold values of matching words for simple and complex entities set by the user of the system 10 are reached, then the set of entities received from module 13 is validated. If the matching word thresholds for simple and / or complex entities are not met, then the entity set fails the validation. As a result of comparing the sets of entities, the data reconciliation module 14 generates reconciliation results, which include information about the successful completion of the reconciliation, or if the set of entities has not passed the reconciliation, information about the entities in the set of entities that have not passed the reconciliation. The information about the set of entities obtained from the module 13, together with the text information and the results of the reconciliation by the module 14 of the reconciliation of data, are sent to the module 15 for analyzing the properties of the document.

[0038] All information collected by module 15 during the operation of all previous modules, in particular text information and verification results from module 14 and document images from source 1, is checked by module 15 to ensure that all necessary document items (or document fields) are contained in the text of the document. For this, module 15 processes the received text information using NLP methods (fuzzy entry of keywords for each paragraph), according to the results of which module 15 determines the integrity of the document. The NLP processing algorithm can also be selected on the basis of the process identifier data that were previously received by the module 18 from the AC 2 Bank.

[0039] To process the received text information using NLP methods, a set of typical documents was analyzed for the distribution of words in paragraphs of the document and characteristic words and / or phrases were found for each paragraph of the document, and from its different parts (beginning, middle, end). Thus, it became known for each significant (which must be present in the document to check the integrity) paragraph of the document, its characteristic words. Further, a rule was created according to which: if a certain proportion of words or phrases occurs (fuzzy search) in a paragraph of the document, then this significant paragraph is found. If all the necessary paragraphs (points) of the document are found in the text, then the integrity is checked successfully. In an alternative embodiment of the claimed solution, the integrity of the document can be checked using the means and methods disclosed in the application US 2011134494 (A1).

[0040] Based on the document integrity data and the reconciliation data, the unit 15 determines the scan quality of the document image. For example, if the data check is successful and the document integrity data indicates that the document contains all items, then module 15 assigns a high scan quality score to the document image. If the matching results indicate that the matching word thresholds for simple and / or complex entities are not achieved, and the data on the integrity of the document indicates that the document does not contain all the items, then the module 15 assigns the document image a low scan quality index. The module 15 transmits information about the scan quality index to the decision module 16.

[0041] Also, the document property analysis module 15 is configured to check if a document is attached from another person. This verification is performed on the basis of document integrity data and data on unique entities of a set of entities that differ from client to client or which may coincide among different clients with a very low probability (for example, entities that identify a signer). The analysis of only unique entities allows you to exclude those entities that may be repeated for different clients, for example, the loan currency, which is most often in rubles, and other entities depending on the type of document. For example, for an ICC or PD document, the name of the borrower is a unique entity. Also, unique entities can be TIN, SNILS, passport serial number, etc.

[0042] If the unique entities do not match (for example, in relation to the ICC document - the name of the borrower), and the data on the integrity of the document indicates that all items in the document are present, then module 15 determines that the document, the image of which entered the system 10 belongs to another person. If module 15 has determined that the integrity of the document is incomplete, while the unique entities of the set of entities, for example, identifying the signer, indicate that the document whose image entered the system 10 is a document of this person, then module 15 generates a list of entities that have not been verified. Accordingly, if the unique entities of the set of entities identifying the signer match the reference entity set and the document integrity data indicates that all items in the document are present, then module 15 determines that the said document is a document of this person. The algorithms of the property analysis module 15 are parameterized by the process identifier.

[0043] All information collected during the operation of all previous document modules, with the exception of document images, is sent to decision module 16. If the reconciliation results from Module 14 are positive and the data received from module 19 indicates that all the necessary attributes of signers are present on the image of the document in its respective areas (i.e. the rule for the location of all signatures is fulfilled; in this case, it is determined by the number of found signatures, by their relative position , excluding the places where the signature cannot be known), then the module 16 writes information about the successful passage of the document verification into the results storage of the document processing web service 20. For example, if the package of documents contained only the IAC document and the DP was not required, then the module 16 writes to the said storage of the web service 20 information about the successful passage of the document verification, as well as information about the decision, in particular, that it is possible to issue a loan. In addition, information about the set of entities and the results of reconciliation are entered into the results of document processing generated and recorded in the storage by module 16. If the data received from the module 19 indicates that the signer's attribute is not present in the image of the document in the corresponding area, then the decision module 16 generates information that the document should be checked by a person, which also includes information about the verification results.

[0044] Corresponding areas (acceptable range of coordinates for the signer's attributes) can be determined by module 18 based on the document type, which is determined based on the process identifier data received from the AC 2 of the Bank, and subsequently supplied to module 16. [0045] If the results of the reconciliation are negative, then the decision-making module 16 extracts from the received data information about all entities from the set of entities that have not passed the reconciliation of the data, and determines the types of these entities. If the entity type indicates that the entity is a simple entity, and the scan quality information from unit 15 indicates that a high scan quality score has been assigned to the document image, then decision unit 16 generates information that the document is not passed the audit, which also includes information on the results of the reconciliation, and that the loan should be refused. At the same time, if the information on the scan quality indicates that a low scan quality indicator is assigned to the document image, then the unit 16 generates and writes information about the the fact that the document should be checked by a person, which also includes information about the results of the reconciliation.

[0046] If the entity that has not passed the data reconciliation is a complex entity, then the decision module 16, regardless of the quality index of scanning the document, generates and records in the storage of the results of processing documents of the web service 20 information that the document should be checked by a person, which also includes information on the results of the reconciliation. The generated document processing results in case of negative reconciliation results also include information about the presence or absence of the signer's attributes.

[0047] The document processing results generated by the decision making module 16 can be obtained via the web service 20 interface or its API. Web service 20 generates a json response with the results of document processing. These results of document processing can be output to a data display device, for example, a display of a computing device such as a laptop or desktop computer, a communication terminal, a mobile phone or a smartphone, a tablet, etc. For example, if the document was an IAC document, then to the data display device in addition, a decision can be made to issue a loan, to refuse to issue it, or to check the document manually. [0048] Thus, due to the fact that the results of document processing are formed on the basis of the results of comparing the set of entities extracted from the text information, taking into account the type of the document, with the reference set of entities for this document, high accuracy is provided in identifying errors during automated intelligent processing documents, as well as its effectiveness, i.e. the achievement of the specified technical result is ensured. Also, due to the use of machine learning algorithms and NLP-methods disclosed in this application, and data typing, the efficiency and accuracy in identifying errors during automated intelligent document processing are additionally increased.

[0049] In addition, the presented technical solution has enhanced functionality in comparison with the known solutions, in particular: it provides the ability to automatically make a decision on the issuance of a loan, identify the reason for the refusal, or justification of the transfer of the document for verification to a person; provides a mechanism for checking the legal validity and completeness of documents.

[0050] In general terms (see Fig. 3), the system (200) of intelligent document processing comprises one or more processors (201) united by a common bus of information exchange, memory means such as RAM (202) and ROM (203), interfaces input / output (204), input / output (205), and a device for networking (206).

[0051] The processor (201) (or multiple processors, multi-core processor, etc.) can be selected from a range of devices currently widely used, for example, from manufacturers such as: Intel ™, AMD ™, Apple ™, Samsung Exynos ™, MediaTEK ™, Qualcomm Snapdragon ™, etc. Under the processor or one of the processors used in the system (200), it is also necessary to take into account the graphics processor, for example, NVIDIA GPU with a CUDA-compatible programming model, or Graphcore, the type of which is also suitable for full or partial execution of the method, and can also be used for training and application of machine learning models in various information systems.

[0052] RAM (202) is a random access memory and is intended for storing machine-readable instructions executed by the processor (201) for performing the necessary operations for logical data processing. RAM (202), as a rule, contains executable instructions of the operating system and corresponding software components (applications, software modules, etc.). In this case, the available memory of a graphics card or a graphics processor can act as RAM (202).

[0053] ROM (203) is one or more persistent storage devices such as a hard disk drive (HDD), solid state data storage device (SSD), flash memory (EEPROM, NAND, etc.), optical storage media ( CD-R / RW, DVD-R / RW, BlueRay Disc, MD), etc.

[0054] Various types of I / O interfaces (204) are used to organize the operation of the system components (200) and to organize the operation of external connected devices. The choice of the appropriate interfaces depends on the specific version of the computing device, which can be, but are not limited to: PCI, AGP, PS / 2, IrDa, FireWire, LPT, COM, SATA, IDE, Lightning, USB (2.0, 3.0, 3.1, micro, mini, type C), TRS / Audio jack (2.5, 3.5, 6.35), HDMI, DVI, VGA, Display Port, RJ45, RS232, etc. [0055] To ensure user interaction with the computing system (200), various means (205) I / O information are used, for example, a keyboard, display (monitor), touch display, touch pad, joystick, mouse manipulator, light pen, stylus, touch panel, trackball, speakers, microphone, augmented reality, optical sensors, tablet, light indicators, projector, camera, biometric identification (retina scanner, fingerprint scanner, voice recognition module), etc.

[0056] The networking tool (206) provides data transmission via an internal or external computer network, for example, Intranet, Internet, LAN, and the like. One or more means (206) may be used, but not limited to: Ethernet card, GSM modem, GPRS modem, LTE modem, 5G modem, satellite communication module, NFC module, Bluetooth and / or BLE module, Wi-Fi module, and others. [0057] Additionally, satellite navigation means can be used as part of the system (200), for example, GPS, GLONASS, BeiDou, Galileo. [0058] The specific choice of elements of the device (200) for the implementation of various software and hardware architectures can vary while maintaining the required functionality provided. [0059] Modifications and improvements to the above-described embodiments of the present technical solution will be apparent to those skilled in the art. The foregoing description is provided by way of example only and is not intended to be limiting in any way. Thus, the scope of the present technical solution is limited only by the scope of the attached claims.

Claims

Claim.

1. A method for intelligent processing of documents, performed by at least one computing device, comprising the steps of:

- get at least one image of the document;

- on the basis of text information determine the type of document;

2. A method according to claim 1, characterized in that the document is an agreement on individual credit conditions (ILC) or a surety agreement (DP).

3. The method according to claim 1, characterized in that it further comprises the stages at which:

- carry out the detection of the signer's attribute on the received image of the document;

- determine the location of at least one attribute of the signer on the page of the document; the results of processing the document are formed taking into account the information about the location of at least one attribute of the signer on the page of the document.

4. The method according to claim 3, further comprising the step of determining the status of the person to whom the detected attributes of the signer belongs.

5. The method according to claim 1, characterized in that the stage at which the document type is determined based on the text information contains the stages at which:

- get the process identifier; - define a set of text classification models based on the process identifier;

- transform the received text information into a set of vectors;

- process a set of vectors using a previously defined set of text classification models to determine the type of document.

5. The method according to claim 1, characterized in that the stage at which the set of entities is compared with the reference set of entities comprises the stages at which:

- the set of entities is divided into simple entities, consisting of 1-3 words, and complex entities, consisting of less than four words; moreover, if, as a result of comparing the above sets of entities, the threshold values of matching words for simple and complex entities are reached, then the results of the reconciliation are generated, which include information on the successful completion of the reconciliation of the data; if the aforementioned threshold values of matching words for simple and complex entities are not reached, then the reconciliation results are generated, which include information about entities in the set of entities that have not passed the reconciliation; in this case, the results of document processing are formed taking into account the results of the reconciliation.

6. The method according to claim 1, characterized in that it further comprises a stage, which determines the quality of scanning the document; moreover, the results of processing the document are formed taking into account the quality of scanning the document.

7. An intelligent document processing system comprising at least one computing device and at least one memory device containing machine-readable instructions that, when executed by at least one computing device, execute the method according to any one of claims. 1-6.