WO2023101636A2

WO2023101636A2 - A document reading system

Info

Publication number: WO2023101636A2
Application number: PCT/TR2022/050896
Authority: WO
Inventors: Sabri DEMIR; Mehmet Berk CAVDAR; Nurullah KUS
Original assignee: Dogus Bilgi Islem Ve Teknoloji Hiz. A.S.
Priority date: 2021-11-30
Filing date: 2022-08-23
Publication date: 2023-06-08
Also published as: WO2023101636A3

Abstract

The present invention relates to a system (1) which enables to automatically recognize document types of documents obtained and scanned from any external source, by means of smart image recognition techniques; to obtain and then save the requested fields within scanned documents.

Description

A DOCUMENT READING SYSTEM

Technical Field

The present invention relates to a system which enables to automatically recognize document types of documents obtained and scanned from any external source, by means of smart image recognition techniques; to obtain and then save the requested fields within scanned documents.

Background of the Invention

In order to carry out transaction on printed papers/ documents in digital environment or to transfer data from the said documents, it is required to transfer documents to digital media. Today, printed documents can be transferred to digital media upon being written, scanned or rendered in digital media manually. Transfer of documents to digital media is commonly used in all sectors and it is quite important in terms of saving of time and resource. However, incorrect data are transferred to system based on user error during transfer of documents to digital media manually.

In the state of the art, various systems are used in order to avoid user errors in transfer of printed documents to digital media. Some of the systems developed are technologies such as OCR (Optical Character Recognition) which enables to convert types of documents such as scanned paper documents, PDF files, or pictures taken by a digital camera into editable and searchable data, or image recognition which is an artificial intelligence technology used in order to automatically identify and categorize objects, places, people, texts and actions in images. Although use of OCR or image recognition technologies enables to convert scanned documents into processable digital files; it is not possible to receive documents, select the requested fields from the document or transfer them to different sources. Therefore, considering the studies and the deficiencies in the state of art, there is a need for a system which enables to automatically recognize document types of documents obtained and scanned from any external source, by means of smart image recognition techniques; to obtain and then save the requested fields within scanned documents; and transmit them to different sources.

The United States patent document no. US10796080, an application in the state of the art, discloses a system for executing artificial intelligence-based document processing transactions. In the system of the said invention; discrepancies needed in the document are determined from contents of a request that is received from the process initiated by the user to ensure that at least one document is automatically created with respect to a process, and the documents that are loaded by users; documents required for the process are automatically selected; and discrepancies needed within the document are created upon being automatically determined from within the documents. In the said invention, requests and documents are received by means of an objective setting engine and then they are examined by an optical character recognition unit, a compiler, a discrepancy identification unit and a natural language processing unit; characters included in documents are determined by the optical character recognition unit and discrepancies are determined by the discrepancy identification unit; discrepancies are added into documents upon being compiled by the compiler.

Summary of the Invention

An objective of the present invention is to realize a system which enables to automatically recognize document types of documents obtained from any external source periodically, by means of smart image recognition techniques; to obtain and then save the requested fields within scanned documents.

Detailed Description of the Invention “A Document Reading System” realized to fulfil the objective of the present invention is shown in the figure attached, in which:

Figure l is a schematic view of the inventive system.

The components illustrated in the figure are individually numbered, where the numbers refer to the following:

1. System

2. Database

3. Server

A. Document server

B. Instant messaging application

C. Chat application

The inventive system (1) which enables to obtain documents from an external source periodically; to automatically recognize document types of documents obtained, by means of smart image recognition techniques; to receive and then save the requested fields within scanned documents comprises: at least one database (2) which is configured to keep the obtained documents under record in it together with the document definitions and such that the parts received from documents are associated with the documents; at least one server (3) which can establish communication with the database (2) such that it will realize data exchange; manage the data within the database (2); is configured to establish communication with eternal servers (A,B,C) and to realize data exchange by using any remote communication protocol; to receive documents by being triggered at certain periods and/or upon certain events; to identify the received documents; to obtain texts by detecting the text fields on the identified documents; to extract the related fields by detecting the requested fields in the obtained text fields and to save them to the database (2).

The database (2) included in the inventive system (1) is in communication with the server (3) and is configured to be managed by the server (3). In a preferred embodiment of the invention, the database (2) is configured to keep the keyword and/or document definitions, that are related to the text fields to be extracted from documents, under record in it. The said database (2) is configured to keep the documents, that are obtained by the server (3), under record in it together with the detected document definitions. The database (2) is also configured to keep the text fields, that are detected on identified documents, under record in it.

The server (3) included in the inventive system (1) is configured to establish communication with remote servers such as document server (A), instant messaging application (B) and chat application (C) by using any remote communication protocol included in the state of the art and to realize data exchange over this communication established. The said server (3) is configured to manage the database (2) by means of transactions such as performing new data recording into the database (2), deleting the recorded data within the database (2) or changing the recorded data within the database (2) and updating the recorded data within the database (2) and to access the recorded data within the database (2). In a preferred embodiment of the invention, the server (3) is configured to run at least one artificial intelligence that can perform data processing or document reading, on it. The said server (3) is configured to receive documents to be identified and processed from external servers such as document server (A), instant messaging application (B) and chat application (C) by being triggered at certain periods. In one embodiment of the invention, the server (3) is configured to receive and then process document images from application servers such as document server (A), instant messaging application (B) and chat application (C), in base64 format that is a coding scheme enabling to transmit and store binary data only in media that use ASCII characters. In a different embodiment of the invention, the server (3) is configured to receive and then process the documents to be processed from external servers such as document server (A), instant messaging application (B) and chat application (C), by logging in to systems of external servers (A, B, C) by means of a robotic process automation that automatizes the processes being carried out in base64 format or manually over a web service by means of an API (Application Programming Interface). The server (3) is configured to process the documents received from external servers such as document server (A), instant messaging application (B) and chat application (C), by means of an artificial intelligence algorithm being run on it and to identify the said documents. In a preferred embodiment of the invention, the server (3) is configured to process and identify the documents received from external servers (A,B,C), by means of an artificial intelligence algorithm such as Inception V3 that is a convolutional neural network enabling image analysis and obj ect detection. The said server (3) is configured to detect text fields on a document by performing optical character recognition on identified documents and to ensure that texts are extracted in text fields. The said server (3) is configured to detect fields and data comprising pre-determined subject, word, expression and/or data by means of natural language processing and regular expression methods in accordance with keywords or definitions, that are kept under record on texts extracted from identified documents, and to parse the said text fields and then save them to the database (2). The server (3) is configured to transmit the text fields, that are extracted from document and documents saved to the database (2), to the related units at request and/or by using any communication means periodically.

Industrial Application of the Invention

In the inventive system (1), it is ensured that documents received automatically at certain periods are processed by means of artificial intelligence techniques and thereby the said documents are identified and saved by obtaining the requested fields in the documents. Within these basic concepts; it is possible to develop various embodiments of the inventive “Document Reading System (1)”; the invention cannot be limited to examples disclosed herein and it is essentially according to claims.

Claims

CLAIMS A system (1) which enables to obtain documents from an external source periodically; to automatically recognize document types of documents obtained, by means of smart image recognition techniques; to receive and then save the requested fields within scanned documents; comprising at least one database (2) which is configured to keep the obtained documents under record in it together with the document definitions and such that the parts received from documents are associated with the documents; and characterized by at least one server (3) which can establish communication with the database (2) such that it will realize data exchange; manage the data within the database (2); is configured to establish communication with eternal servers (A,B,C) and to realize data exchange by using any remote communication protocol; to receive documents by being triggered at certain periods and/or upon certain events; to identify the received documents; to obtain texts by detecting the text fields on the identified documents; to extract the related fields by detecting the requested fields in the obtained text fields and to save them to the database (2). A system (1) according to Claim 1 ; characterized by the database (2) which is in communication with the server (3) and is configured to be managed by the server (3). A system (1) according to Claim 1 or 2; characterized by the database (2) which is configured to keep the keyword and/or document definitions, that are related to the text fields to be extracted from documents, under record in it.

7

4. A system (1) according to any of the preceding claims; characterized by the database (2) which is configured to keep the documents, that are obtained by the server (3), under record in it together with the detected document definitions.

5. A system (1) according to any of the preceding claims; characterized by the database (2) which is configured to keep the text fields, that are detected on identified documents, under record in it.

6. A system (1) according to any of the preceding claims; characterized by the server (3) which is configured to establish communication with remote servers such as document server (A), instant messaging application (B) and chat application (C) by using any remote communication protocol and to realize data exchange over this communication established.

7. A system (1) according to any of the preceding claims; characterized by the server (3) which is configured to manage the database (2) by means of transactions such as performing new data recording into the database (2), deleting the recorded data within the database (2) or changing the recorded data within the database (2) and updating the recorded data within the database (2) and to access the recorded data within the database (2).

8. A system (1) according to any of the preceding claims; characterized by the server (3) which is configured to run at least one artificial intelligence that can perform data processing or document reading, on it.

9. A system (1) according to any of the preceding claims; characterized by the server (3) which is configured to receive documents to be identified and processed from external servers such as document server (A), instant messaging application (B) and chat application (C) by being triggered at certain periods.

8 A system (1) according to Claim 9; characterized by the server (3) which is configured to receive and then process document images from application servers such as document server (A), instant messaging application (B) and chat application (C), in base64 format that is a coding scheme enabling to transmit and store binary data only in media that use ASCII characters. A system (1) according to Claim 9; characterized by the server (3) which is configured to receive and then process the documents to be processed from external servers such as document server (A), instant messaging application (B) and chat application (C), by logging in to systems of external servers (A, B, C) by means of a robotic process automation that automatizes the processes being carried out in base64 format or manually over a web service by means of an API. A system (1) according to any of the preceding claims; characterized by the server (3) which is configured to process the documents received from external servers such as document server (A), instant messaging application (B) and chat application (C), by means of an artificial intelligence algorithm being run on it and to identify the said documents. A system (1) according to any of the preceding claims; characterized by the server (3) which is configured to process and identify the documents received from external servers (A,B,C), by means of an artificial intelligence algorithm such as Inception V3 that is a convolutional neural network enabling image analysis and object detection. A system (1) according to any of the preceding claims; characterized by the server (3) which is configured to detect text fields on the document by performing optical character recognition on identified documents and to ensure that texts are extracted in text fields.

9

15. A system (1) according to any of the preceding claims; characterized by the server (3) which is configured to detect fields and data comprising predetermined subject, word, expression and/or data by means of natural language processing and regular expression methods in accordance with keywords or definitions, that are kept under record on texts extracted from identified documents, and to parse the said text fields and then save them to the database (2). 16. A system (1) according to any of the preceding claims; characterized by the server (3) which is configured to transmit the text fields, that are extracted from document and documents saved to the database (2), to the related units at request and/or by using any communication means periodically.

10