CN111368828A

CN111368828A - Multi-bill identification method and device

Info

Publication number: CN111368828A
Application number: CN202010124850.8A
Authority: CN
Inventors: 李兴蒙; 刘平君; 张玲; 陈道龙; 叶京翔; 李晏光; 乔川
Original assignee: Ele Cloud Information Technology Co ltd
Current assignee: Ele Cloud Information Technology Co ltd
Priority date: 2020-02-27
Filing date: 2020-02-27
Publication date: 2020-07-03

Abstract

The invention discloses a method for identifying multiple bills, which comprises the following steps: preprocessing a plurality of bill images to be identified; simultaneously distinguishing each bill in a plurality of bill images to be recognized by using a YOLOv3 model for the preprocessed bill images, and positioning characters in each bill image; characters in each positioned bill image are identified by using a CRNN + CTC model, and information of a plurality of bills to be identified is acquired, so that the problems that in the prior art, an identification system based on optical characters is relatively low in identification rate and high in later maintenance cost, and only a single bill can be identified at one time are solved.

Description

Multi-bill identification method and device

Technical Field

The application relates to the field of artificial intelligence, in particular to a multi-bill identification method and a multi-bill identification device.

Background

Existing document recognition techniques, including ordinary OCR optical character positioning, also have in part used mainstream deep learning techniques. However, only single bill information can be identified, the common multi-bill reimbursement scene cannot be met, and the problem of bill identification cannot be fundamentally solved. In addition, the recognition system based on the optical characters has relatively low recognition rate and higher later maintenance cost, and can not meet the existing reimbursement market.

Disclosure of Invention

The application provides a method and a device for identifying multiple bills, which solve the problems that in the prior art, an identification system based on optical characters is relatively low in identification rate and high in later maintenance cost, and only a single bill can be identified at a time.

The application provides a multi-bill identification method, which comprises the following steps:

preprocessing a plurality of bill images to be identified;

simultaneously distinguishing each bill in a plurality of bill images to be recognized by using a YOLOv3 model for the preprocessed bill images, and positioning characters in each bill image;

and identifying the characters in each positioned bill image by using a CRNN + CTC model to obtain a plurality of pieces of bill information to be identified.

Preferably, the preprocessing is performed on the image containing a plurality of bills to be identified, and comprises the following steps:

carrying out normalization processing on a bill image containing a plurality of bills to be identified;

a standardized ticket image is obtained.

Preferably, the standardized ticket image comprises:

bill images with equal size and same gray level.

Preferably, the method for distinguishing each bill in the images containing the multiple bills to be recognized simultaneously by using the YOLOv3 model comprises the following steps:

distinguishing the outline of each bill in the image containing the bills to be recognized by using a YOLOv3 model, and further acquiring the image of each bill; and

and acquiring the number of the contained bills to be identified.

Preferably, after the step of positioning the text in each bill image, the method further comprises:

cutting the character part in each bill image to obtain an image containing the character part in each bill image;

the image including the text portion is passed into a CRNN + CTC model.

Preferably, the method for recognizing the characters in each located bill image by using the CRNN + CTC model to obtain a plurality of pieces of bill information to be recognized includes:

the CRNN + CTC model receives the image containing the text part sent by the YOLOv3 model;

and the CRNN + CTC model acquires the information of each invoice in a plurality of bill images to be identified by identifying the image containing the character part.

Preferably, the information of the invoice comprises:

the type of invoice, invoice code, invoice number, invoice date, buyer and seller information, invoice amount, and invoice detail.

This application provides the recognition device of many bills simultaneously, includes:

the device comprises a preprocessing unit, a recognition unit and a recognition unit, wherein the preprocessing unit is used for preprocessing a plurality of bill images to be recognized;

the bill distinguishing and positioning unit is used for distinguishing each bill in the bill images to be recognized simultaneously by using a YOLOv3 model and positioning characters in each bill image after the preprocessing;

and the bill identification unit is used for identifying the characters in each positioned bill image by using a CRNN + CTC model and acquiring a plurality of pieces of bill information to be identified.

Preferably, the bill distinguishing and positioning unit comprises:

the single bill image acquiring subunit is used for distinguishing the outline of each bill in the multiple bill images to be identified by using a YOLOv3 model so as to acquire the image of each bill; and

and the bill quantity acquiring subunit is used for acquiring the quantity of the contained bills to be identified.

Preferably, the bill identifying unit includes:

the image receiving subunit is used for receiving the image containing the text part sent by the YOLOv3 model through the CRNN + CTC model;

and the bill information identification subunit is used for acquiring the information of each invoice in the plurality of bill images to be identified by identifying the image containing the character part through the CRNN + CTC model.

The application provides a method and a device for identifying multiple bills, which are characterized in that a YOLOv3 model is used for simultaneously distinguishing single bills and positioning characters, then a CRNN + CTC model is used for identifying information of multiple materials, and the information of multiple bills is acquired simultaneously, so that the problems that in the prior art, an identification system based on optical characters is relatively low in identification rate, high in later maintenance cost and only capable of identifying a single bill at a time are solved.

Drawings

FIG. 1 is a schematic flow chart of a multi-bill identification method provided by the present application;

fig. 2 is a schematic diagram of a multi-bill identification device provided by the application.

Detailed Description

In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present application. This application is capable of implementation in many different ways than those herein set forth and of similar import by those skilled in the art without departing from the spirit of this application and is therefore not limited to the specific implementations disclosed below.

Fig. 1 is a schematic flow chart of a method for identifying multiple tickets, which is described in detail below with reference to fig. 1.

Step S101, preprocessing the bill images containing a plurality of bills to be identified.

The method provided by the application can simultaneously identify a plurality of bills, firstly, the plurality of bills are tiled, images of the plurality of bills are shot through image acquisition equipment such as a camera, each shot image comprises a plurality of bills to be identified, and the plurality of bills can contain various types of bills such as common invoices, special invoices for value-added tax, special invoices and the like.

When the image of the bill is shot, due to the reasons of angle, explosion and the like, each image interferes with the used recognition model during recognition to influence the execution efficiency, so that the bill image needs to be preprocessed before bill recognition, and the preprocessing method is to perform normalization processing on the bill image containing a plurality of to-be-recognized bills to obtain a standardized bill image. The standardized bill images can be bill images with the same size and the same gray level, wherein the preferred value of the gray level value is 0-1. The standardized bill image can reduce a large amount of calculation for processing the original image in the identification process, and the execution efficiency of the corresponding identification model is higher.

Step S102, using a YOLOv3 model to simultaneously distinguish each bill in the bill images to be recognized and locate characters in each bill image after the preprocessing.

The principle of the YOLOv3 model is to detect a target image by using feature maps of 3 different scales to obtain a feature image of a corresponding size. In the present application, the image of the bill to be recognized includes information of a plurality of bills to be recognized, and the YOLOv3 model can distinguish the outline of each bill in the image of the bill to be recognized according to the preset characteristic diagram of each bill, so as to obtain the image of each bill, and the advising also obtains the number of the bills to be recognized.

Then, the text in each bill image is located, the text part in each bill image is cut, and an image including the text part in each bill image is obtained, for example, the text part of the bill type is cut, so that an image including the text part of the bill type, such as a "common invoice for value-added tax of a certain market" or a "special invoice for value-added tax of a certain market" can be obtained. The image containing the text portion is then passed into the CRNN + CTC model.

And step S103, identifying the characters in each positioned bill image by using a CRNN + CTC model, and acquiring a plurality of pieces of bill information to be identified.

The CRNN + CTC model is used to identify text in an image, and typically includes: character detection and character recognition.

The CRNN + CTC model receives the image containing the text part sent by the YOLOv3 model, and acquires information of each invoice in a plurality of to-be-identified bill images by identifying the image containing the text part. Specifically, the CRNN + CTC model further detects the image, and then locates the range of the text. For example, the bill code is positioned, the digit of the bill code is obtained, then the positioned text area is identified, the text area is lent as character information, and therefore information of each invoice in a plurality of bill images to be identified is obtained. Information of invoices, including: the type of invoice, invoice code, invoice number, invoicing date, buyer and seller information, invoicing amount, invoice detail and the like.

Based on the same inventive concept, the present application also provides a bill recognition apparatus 200, as shown in fig. 2, including:

the preprocessing unit 210 is used for preprocessing the bill images to be identified;

the bill distinguishing and positioning unit 220 is used for distinguishing each bill in the bill images to be recognized simultaneously by using a YOLOv3 model and positioning characters in each bill image after the preprocessing;

and the bill identification unit 230 is configured to identify the characters in each located bill image by using a CRNN + CTC model, and acquire information of a plurality of bills to be identified.

Preferably, the bill distinguishing and positioning unit comprises:

Preferably, the bill identifying unit includes:

In summary, the present application provides a method and an apparatus for identifying multiple bills, which perform single-sheet distinguishing and text positioning on multiple bills simultaneously through the YOLOv3 model, then identify information of multiple materials through the CRNN + CTC model, and obtain information of multiple bills simultaneously, thereby solving the problems that in the prior art, an identification system based on optical characters is relatively low in identification rate, high in later maintenance cost, and only a single bill can be identified at a time.

Finally, it should be noted that: although the present invention has been described in detail with reference to the above embodiments, it should be understood by those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof without departing from the spirit and scope of the invention.

Claims

1. A method for identifying multiple bills, comprising:

preprocessing a plurality of bill images to be identified;

2. The method of claim 1, wherein preprocessing the image containing the plurality of documents to be recognized comprises:

a standardized ticket image is obtained.

3. The method of claim 2, wherein the normalized document image comprises:

bill images with equal size and same gray level.

4. The method of claim 1, wherein distinguishing each document containing a plurality of document images to be recognized simultaneously using the YOLOv3 model comprises:

and acquiring the number of the contained bills to be identified.

5. The method of claim 1, further comprising, after the step of locating text in each document image:

the image including the text portion is passed into a CRNN + CTC model.

6. The method of claim 1, wherein identifying the located text in each document image using a CRNN + CTC model to obtain a plurality of document information to be identified comprises:

7. The method of claim 6, wherein the information of the invoice comprises:

8. A multiple note identification device comprising:

9. The apparatus of claim 8, wherein the ticket differentiating and locating unit comprises:

10. The apparatus of claim 8, wherein the bill identifying unit comprises: