CN117351492A

CN117351492A - Value-added tax invoice information identification method

Info

Publication number: CN117351492A
Application number: CN202311257686.8A
Authority: CN
Inventors: 王彦瑞; 杜旭江
Original assignee: Beijing Langsi Intelligent Technology Co ltd
Current assignee: Beijing Langsi Intelligent Technology Co ltd
Priority date: 2023-09-27
Filing date: 2023-09-27
Publication date: 2024-01-05

Abstract

The invention discloses a value-added tax invoice information identification method, which relates to the technical field of image processing and comprises the following steps: acquiring an image of a value added tax invoice, and preprocessing the image of the value added tax invoice to obtain a target image; dividing the target image into areas to form a plurality of target areas containing target information; identifying the characters in the target area and extracting the target information; and storing and outputting the extracted target information. The invention realizes the rapid and accurate identification and information extraction of the value-added tax invoice by using the computer vision and machine learning technology, reduces the workload and the error rate of manually processing the value-added tax invoice, and improves the processing efficiency and the accuracy.

Description

Value-added tax invoice information identification method

Technical Field

The invention relates to the technical field of image processing, in particular to a value-added tax invoice information identification method.

Background

Processing a large number of value-added tax invoices is a tedious and time-consuming task, and is currently generally performed manually. The data is manually entered and processed at the time of processing. Not only is the efficiency low, but also mistakes are easy to occur, and the accuracy of data is affected.

Disclosure of Invention

The invention aims to solve the technical problem of overcoming the defects of the prior art and providing a value-added tax invoice information identification method.

In order to solve the technical problems, the technical scheme of the invention is as follows:

a value-added tax invoice information identification method is characterized in that: comprising the following steps:

acquiring an image of a value added tax invoice, and preprocessing the image of the value added tax invoice to obtain a target image;

dividing the target image into areas to form a plurality of target areas containing target information;

identifying the characters in the target area and extracting the target information;

and storing and outputting the extracted target information.

As a preferable scheme of the value-added tax invoice information identification method, the invention comprises the following steps: the acquiring the image of the value added tax invoice comprises the following steps:

and obtaining the image of the value added tax invoice by scanning and shooting the value added tax invoice or converting the value added tax invoice file in other formats into an image.

As a preferable scheme of the value-added tax invoice information identification method, the invention comprises the following steps: the preprocessing of the image of the value added tax invoice to obtain a target image comprises the following steps:

denoising, improving brightness, improving contrast and improving saturation of the image of the increment tax invoice, so that the image is enhanced;

judging whether the image of the value added tax invoice is skewed, converting the image into a single-channel gray level image when the image exists, detecting all contours in the image, calculating the area of each contour, determining the contour with the largest area as the position of the form of the value added tax invoice, calculating the perimeter of the contour, and determining the corresponding parameter coefficient to enable the contour to be infinitely close to the form grid line of the value added tax invoice.

As a preferable scheme of the value-added tax invoice information identification method, the invention comprises the following steps: the dividing the target image into regions to form a plurality of target regions containing target information comprises:

identifying a value-added tax invoice image in a standard format, determining the distance between the unit cells containing target information in the form of the value-added tax invoice and the ratio between the total length and the total width of the form, and forming a form rule;

identifying the target image and determining the outline position of a table in the target image;

and carrying out proportional mapping on the table outline in the target image through the table rule, and calculating the coordinate positions of each cell containing target information in the table outline in the target image to form a plurality of target areas containing the target information.

As a preferable scheme of the value-added tax invoice information identification method, the invention comprises the following steps: the identifying of the character in the target area includes:

performing open source character recognition on the target area, performing text detection on the target area through a text detection model, and determining a character area in the target area;

and carrying out character recognition on the character area through a text character recognition model to obtain character information in the target area.

As a preferable scheme of the value-added tax invoice information identification method, the invention comprises the following steps: the character recognition of the character area through the text character recognition model comprises the following steps:

judging whether a red seal shields characters in the character area, if so, performing the next step, and if not, performing character recognition on the character area through a text character recognition model;

performing color channel separation on the target image, separating the three-channel target image into a blue channel, a green channel and a red channel, removing gray elements in the separated red channel, desalting to obtain a single-channel target image, and converting the single-channel target image into a three-channel gray image;

and carrying out character recognition on the character area through a text character recognition model.

converting the pre-read RGB color space of the target image into an HSV color space, determining and selecting a red HSV color space color gamut in the target image as a red threshold value in threshold value screening, and setting a pixel value in the red threshold value range in the target image to 255;

As a preferable scheme of the value-added tax invoice information identification method, the invention comprises the following steps: the extracting the target information comprises the following steps:

performing polygon character frame coordinate ordering processing on the characters identified in each character area, and determining character frame sets in the same row;

ordering all character frames positioned in the same row according to the abscissa, and merging the character frames positioned in the same row to obtain target information in each character area;

and extracting the target information through the regular expression.

As a preferable scheme of the value-added tax invoice information identification method, the invention comprises the following steps: the storing and outputting the extracted target information comprises the following steps:

storing the extracted target information in a mode of key value pairs according to character strings, lists and JSON formats, and storing the extracted target information in a database or other storage media;

and outputting the target information to a user interface for display or further processing.

As a preferable scheme of the value-added tax invoice information identification method, the invention comprises the following steps: the outputting the target information to the user interface for display or further processing comprises the following steps:

and outputting the target information to a user interface for display or further processing according to a table form.

The beneficial effects of the invention are as follows:

(1) The invention realizes the rapid and accurate identification and information extraction of the value-added tax invoice by using the computer vision and machine learning technology, reduces the workload and the error rate of manually processing the value-added tax invoice, and improves the processing efficiency and the accuracy.

(2) Aiming at value-added tax invoices with different formats and styles, the invention has certain adaptability and flexibility, can adapt to various invoice layouts and languages, and simultaneously provides an extensible and customizable architecture, so that the system can adapt to application scenes with different scales and demands.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings that are needed in the description of the embodiments will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a flow chart of a method for identifying value added tax invoice information provided by the invention;

FIG. 2 is a schematic flow chart of step S102 in the method for identifying value-added tax invoice information provided by the invention;

fig. 3 is a schematic flow chart of extracting target information in the value-added tax invoice information identification method provided by the invention.

Detailed Description

In order that the invention may be more readily understood, a more particular description thereof will be rendered by reference to specific embodiments that are illustrated in the appended drawings.

Fig. 1 is a flow chart of a method for identifying value-added tax invoice information according to an embodiment of the present application. The method comprises the following steps of S101-S104, wherein the specific steps are as follows:

step S101: and acquiring an image of the value added tax invoice, and preprocessing the image of the value added tax invoice to obtain a target image.

Specifically, the image of the value-added tax invoice can be obtained by scanning or shooting the value-added tax invoice. Meanwhile, the image of the value added tax invoice can be obtained by converting the value added tax invoice file with other formats, such as a PDF format file, into a picture file.

And taking the obtained value-added tax invoice image as an image to be processed, and preprocessing the value-added tax invoice image to obtain a target image.

The preprocessing operation includes removing noise from the image to be processed to increase smoothness of the image, and then performing brightness enhancement, contrast enhancement and saturation enhancement on the image to be processed, so as to achieve the purpose of image enhancement.

In addition, the preprocessing operation also comprises judging whether the image of the value added tax invoice is skewed. For a value added tax invoice image with skew, converting the value added tax invoice image into a single-channel gray level image, detecting all contours in the value added tax invoice image on the basis of the gray level image, calculating the contour area of each contour, judging that the contour with the largest area is the position of a table in the value added tax invoice image, calculating the perimeter of the contour, determining corresponding parameter coefficients, enabling the contour to be in wireless proximity to the table line of the value added tax invoice, determining the specific coordinate standard of the table position in the value added tax invoice, improving the basic standard of the position of each specific unit in the follow-up segmentation value added tax invoice, and ensuring the accuracy of follow-up identification.

It is understood that the above operation mode is an operation mode in OpenCV.

Step S102: and dividing the target image into areas to form a plurality of target areas containing target information.

Specifically, because the electronic invoice of the value-added tax and the paper invoice of the value-added tax are in the national accurate standard format, namely, the electronic invoice of the value-added tax has the characteristics of specific form interval and fixed information printing position, the positions of a plurality of target areas containing target information in the image of the value-added tax can be determined by identifying the value-added tax invoice in the national standard format in advance. Referring to fig. 2, the specific steps are as follows:

step S102a: and identifying the value-added tax invoice image in the standard format, determining the spacing between the cells containing the target information in the form of the value-added tax invoice and the ratio between the total length and the total width of the form, and forming a form rule.

Step S102b: and identifying the target image and determining the outline position of the table in the target image.

Step S102c: and carrying out proportional mapping on the table outline in the target image through the table rule, calculating the coordinate positions of each cell containing target information in the table outline in the target image, and forming a plurality of target areas containing the target information.

Specifically, relevant information such as invoice head-up, billing date, purchaser name, purchaser address, password area, goods detail amount and the like is target information. If the form is the full-electric invoice form, the space ratio among the rows is not necessarily the same because of the characteristic of the fixed width and the variable length of the form of the full-electric invoice, so after the width of the full-electric invoice form is identified and mapped, the detail and the segmentation of the specific row and column project of the full-electric invoice are calculated by identifying all transverse lines in the width range, thereby being convenient for the next character identification and acquisition.

Step S103: and identifying the characters in the target area and extracting the target information.

Specifically, firstly, performing open source character recognition on a target area, performing text detection on the target area through a text detection model, and determining a character area in the target area. And then, character recognition is carried out on the character area through a text character recognition model, so that character information in the target area is obtained.

In this embodiment, an open source Optical Character Recognition (OCR) technique is used to select a text character detection model and a text character recognition model with recognition accuracy of 83.5%, perform character detection on an image, determine positions of possible chinese characters in all the image, and perform specific chinese character recognition on the recognized text region, thereby achieving the purpose of recognizing specific chinese characters in the image.

When character recognition is carried out on the character area through the text character recognition model, judging whether a red seal shielding character exists in the character area or not for a value-added tax invoice sample with poor character recognition effect, and if the character recognition effect is poor due to the red seal shielding, carrying out red seal removal on the value-added tax invoice by using a technology for removing the red seal. The method comprises the following steps:

and (3) carrying out color channel separation on the target image, separating the three-channel target image into a blue channel, a green channel and a red channel, removing gray elements in the separated red channel, desalting to obtain a single-channel target image, and converting the single-channel target image into a three-channel gray image. And then character recognition is carried out on the character area through a text character recognition model.

In addition, the removal of the red stamp may also be performed by threshold screening techniques. The RGB color space of the original pre-reading of the value added tax invoice is converted into an HSV color space, a plurality of sections of red HSV color space color gamut which occurs most frequently in the value added tax invoice are determined and selected as red threshold values in threshold value screening through specific color gamut judgment of red, and if the pixel value of the value added tax invoice is in the range of the color threshold values, the pixel of the value added tax invoice is set to 255 (namely white), so that the purpose of removing the red seal is achieved.

For the identified text field after the segmentation of the value added tax information area, information extraction needs to be performed on the text field, see fig. 3, and the specific steps are as follows:

step S201: and carrying out polygon character frame coordinate sorting processing on the characters identified in each character area, and determining character frame sets in the same row.

Step S202: and sequencing all character frames positioned in the same row according to the abscissa, and merging the character frames positioned in the same row to obtain invoice information in each character area.

Specifically, in the character frame set of the same row, the text sequence of the horizontal axis is consistent with the text sequence in the value-added tax invoice picture, and the accuracy of the text information is ensured. And combining the characters belonging to the same row in the cell, and performing character matching extraction processing.

Step S203: and extracting the target information through the regular expression.

Specifically, the key information of the required value-added tax invoice is extracted through specific rule information such as regular expressions and the like, and the key information of the value-added tax invoice is extracted and analyzed from the identified characters, such as invoice number, invoice date, seller name, account, address, bank information, purchaser name, account, address, bank information, commodity detail and amount, tax rate and the like.

Step S104: and storing and outputting the extracted target information.

Specifically, the extracted value added tax invoice information is stored in a key value pair mode (such as a buyer name: XX company) according to a character string, a list and a JSON format, and is stored in a database or other storage media.

For the target information such as detailed goods detail in the value-added tax invoice, the target information is displayed in a form special for use and can be exported in an Excel format, and can be output to a user interface or other systems for further processing or display.

Therefore, the technical scheme of the method and the device realize quick and accurate identification and information extraction of the value-added tax invoice by using computer vision and machine learning technology, reduce the cost of processing the value-added tax invoice by enterprises and individuals, reduce the workload and error rate of manually processing the value-added tax invoice, and improve the processing efficiency and accuracy.

In addition to the above embodiments, the present invention may have other embodiments; all technical schemes formed by equivalent substitution or equivalent transformation fall within the protection scope of the invention.

Claims

1. A value-added tax invoice information identification method is characterized in that: comprising the following steps:

and storing and outputting the extracted target information.

2. The value added tax invoice information recognition method as claimed in claim 1, wherein: the acquiring the image of the value added tax invoice comprises the following steps:

3. The value added tax invoice information recognition method as claimed in claim 1, wherein: the preprocessing of the image of the value added tax invoice to obtain a target image comprises the following steps:

4. The value added tax invoice information recognition method as claimed in claim 1, wherein: the dividing the target image into regions to form a plurality of target regions containing target information comprises:

5. The value added tax invoice information recognition method as claimed in claim 1, wherein: the identifying of the character in the target area includes:

6. The value added tax invoice information recognition method as claimed in claim 5, wherein: the character recognition of the character area through the text character recognition model comprises the following steps:

7. The value added tax invoice information recognition method as claimed in claim 5, wherein: the character recognition of the character area through the text character recognition model comprises the following steps:

8. The value added tax invoice information recognition method of claim 6 or 7, wherein: the extracting the target information comprises the following steps:

and extracting the target information through the regular expression.

9. The value added tax invoice information recognition method as claimed in claim 1, wherein: the storing and outputting the extracted target information comprises the following steps:

10. The value added tax invoice information recognition method of claim 9, wherein: the outputting the target information to the user interface for display or further processing comprises the following steps: