CN112435012A

CN112435012A - Customs data positioning, auditing and editing system and method based on computer vision and storage medium

Info

Publication number: CN112435012A
Application number: CN202011399703.8A
Authority: CN
Inventors: 陆欢旺; 冯玉静; 张东峰; 万晓磊
Original assignee: Shanghai Sandao Intelligent Technology Co ltd
Current assignee: Shanghai Sandao Intelligent Technology Co ltd
Priority date: 2020-12-02
Filing date: 2020-12-02
Publication date: 2021-03-02

Abstract

The application relates to the technical field of customs clearance, and discloses a system, a method and a storage medium for positioning, auditing and editing clearance data based on computer vision, wherein the method comprises the following steps: acquiring a file to be audited; extracting fields and/or elements needing to be audited from the files to be audited; generating an editable file to be edited according to the extracted fields and/or elements; and positioning the marking and/or editing position of the file to be edited, and highlighting the marking and/or editing position. The customs clearance data positioning, auditing and editing system, method and storage medium based on computer vision realize on-line auditing, improve the efficiency of examining list, and simultaneously highlight the positions with postil and modification in the auditing process, enlarge the visual effect and reduce the probability of label omission.

Description

Customs data positioning, auditing and editing system and method based on computer vision and storage medium

Technical Field

The application relates to the technical field of customs clearance, in particular to a system, a method and a storage medium for positioning, auditing and editing clearance data based on computer vision.

Background

Before shipping and shipping of goods at import and export, a declaration procedure needs to be handled to customs, and declaration materials comprise: import and export goods customs declaration form, goods invoice, land freight bill, air freight bill, shipping import bill, shipping export bill, goods packing bill, export receipt and reimbursement bill and the like. The forms such as customs declaration forms, delivery forms, loading forms and cargo packing forms need to be audited after being manufactured, the auditing and labeling efficiency of the traditional paper forms is low, visual fatigue is easy to occur, and the problem of label omission occurs.

Disclosure of Invention

In order to improve auditing efficiency and improve visual effect of labeling, the application provides a system, a method and a storage medium for customs data positioning, auditing and editing based on computer vision.

In a first aspect, the present application provides a method for positioning, reviewing and editing customs clearance data based on computer vision, including:

acquiring a file to be audited;

extracting fields and/or elements needing to be audited from the files to be audited;

generating an editable file to be edited according to the extracted fields and/or elements;

and positioning the marking and/or editing position of the file to be edited, and highlighting the marking and/or editing position.

By adopting the technical scheme, on one hand, on-line auditing is realized, the efficiency of the examination is improved, and meanwhile, the places with annotations and modifications in the auditing process are highlighted, so that the visual effect is enlarged, and the probability of label omission is reduced.

In some embodiments, the acquired file to be audited includes a picture class and a non-picture class, and the non-picture class is converted into a picture format and stored together with the picture class file.

By adopting the technical scheme, the received files are uniformly converted into the picture format, so that the application range of the document is expanded.

In some embodiments, after acquiring the file to be audited, the method further includes:

analyzing the file, and analyzing the type and format of the file to be examined;

image preprocessing, namely correcting the image imaging problem of the file to be examined;

detecting characters, namely detecting the position, the range and the layout of a text in a file to be examined;

and character recognition, namely recognizing the text content on the basis of character detection.

By adopting the technical scheme, the file is firstly analyzed, the image is processed, the image problem is corrected, the position, the range and the layout of the text are identified from the image, and the text content is identified on the basis of character detection, so that the characters in the file can be conveniently obtained.

In some embodiments, the image pre-processing comprises:

inputting an image of a file to be checked into a pre-trained image correction network for geometric change and/or distortion correction to obtain a corrected first target image;

performing small-angle correction on the first target image through a CV algorithm and an affine transformation matrix to obtain a second target image;

removing the blur of the second target image through a denoising algorithm to obtain a third target image;

and carrying out binarization processing on the third target image to obtain a binarized image.

In some embodiments, the text detection comprises:

inputting the binary image into a pre-trained feature extraction network;

extracting output information of at least two convolution layers in the feature extraction network, and fusing the output information;

inputting the fused information into a full connection layer in the feature extraction network, and outputting 2k vertical direction coordinates and coordinate scores of k anchors corresponding to the text region of the binary image and k boundary regression results to realize text positioning and obtain a rectangular text box.

The invoices and the case sheets in the customs clearance industry have different character typesetting structures according to different customers, and have the condition of one-to-many, and the data with any structure can be extracted and displayed by adopting the technical scheme.

In some embodiments, the text recognition comprises: and performing character recognition on the text content in the rectangular text box through a pre-trained character recognition network to acquire text content information.

In some embodiments, the extracting of the fields and/or elements requiring review from the document to be reviewed includes:

generating a basic semantic analysis engine based on a preset semantic database, wherein the semantic database comprises a field basic corpus, a field dictionary and a field knowledge map;

performing field analysis processing on the text content information based on a basic semantic analysis engine;

extracting the required fields and/or elements in the text content based on the extraction requirement extraction data set.

By adopting the technical scheme, the intelligent text place of the characters is identified by adopting natural language processing and combining with the industry: the extracted model is subjected to deep learning model training in combination with the industry, and the recognized data can be subjected to simple data cleaning.

In some embodiments, highlighting based on annotation and/or editing of the file to be edited includes:

positioning the marking and/or editing position of the file to be edited;

and highlighting the positioned position.

By adopting the technical scheme, the places with annotations and modifications in the auditing process are highlighted, the visual effect is enlarged, and the probability of annotation omission is reduced.

In a second aspect, the present application discloses a system for locating and auditing clearance data based on computer vision, comprising:

the file acquisition unit is used for acquiring a file to be audited;

the file analysis unit is used for receiving the file to be audited and analyzing the type and the format of the file to be audited;

the image preprocessing unit is used for correcting the image imaging problem of the analyzed file to be examined;

the character detection unit is used for detecting the position, the range and the layout of the text in the file to be checked on the basis of correcting the image imaging problem;

the character recognition unit is used for recognizing the text content on the basis of character detection;

the text extraction unit extracts required fields and/or elements from the text recognition result;

the editable generating unit generates an editable file to be edited according to the extracted fields and/or elements;

the positioning unit is used for positioning the marking and/or editing position of the file to be edited;

the marking unit is used for highlighting and marking the position positioned by the positioning unit; and

the system comprises a memory and a processor, wherein the memory is stored with a computer program which can be loaded by the processor and can execute the clearance data positioning, auditing and editing method based on computer vision.

In a third aspect, the present application discloses a computer-readable storage medium storing a computer program capable of being loaded by a processor and executing the above-mentioned customs data positioning, auditing and editing method based on computer vision.

In summary, the system, method and storage medium for customs clearance data positioning, auditing and editing based on computer vision provided by the application have at least one of the following beneficial technical effects:

1. by the system, online auditing is realized, the efficiency of the examination is improved, and meanwhile, places with annotations and modifications in the auditing process are highlighted, so that the visual effect is enlarged, and the probability of label omission is reduced.

Drawings

Fig. 1 is a block diagram of a system for locating, reviewing and editing customs clearance data based on computer vision according to the present invention.

In the figure:

1. a file acquisition unit; 2. a file parsing unit; 3. an image preprocessing unit; 4. a character detection unit; 5. a character recognition unit; 6. a text extraction unit; 7. an editable generation unit; 8. a positioning unit; 9. labeling units; 10. a memory; 11. a processor.

Detailed Description

The present application is described in further detail below with reference to the attached drawings.

The embodiment of the application provides a system, a method and a storage medium for positioning, auditing and editing clearance data based on computer vision.

The application provides a clearance data positioning, auditing and editing method based on computer vision, which comprises the following steps:

acquiring a file to be audited, wherein the file to be audited is acquired; the files to be processed comprise pictures and non-pictures, the non-pictures comprise a photocopy and a PDF file, and meanwhile, the non-pictures are converted into a picture format and are stored together with the pictures.

And simultaneously storing the input files to be processed into a file library, and performing model training based on manual labeling to obtain an image correction network, a feature extraction network, a character recognition network and a deep learning extraction data set.

In the embodiment of the application, the file analysis supports the processing of files with JPG, PNG, TIF and PDF formats.

Image preprocessing, namely correcting the image imaging problem of the file to be processed; the method specifically comprises the following steps:

inputting the image of the file to be processed into a pre-trained image correction network for geometric change and/or distortion correction to obtain a corrected first target image, namely:

regressing the network parameters of the space transformation corresponding to the first target image by utilizing a positioning network in the image correction network;

calculating the position of a pixel point in the corrected first target image in the first target image by using a grid generator in the image correction network and the network parameters;

outputting the corrected first target image by using a sampler in the image correction network and the calculated position;

then, the user can use the device to perform the operation,

carrying out binarization processing on the third target image to obtain a binarized image;

after image preprocessing, the following steps are carried out.

The method comprises the following steps of character detection, wherein the position, the range and the layout of a text in a file to be processed are detected, the layout analysis, the character line detection and the like are generally included, and the character detection mainly solves the problems of where characters exist and how large the range of the characters exists. The method comprises the following specific steps:

inputting the binary image into a pre-trained feature extraction network;

inputting the fused information into a full-connection layer in the feature extraction network, and outputting 2k vertical direction coordinates and coordinate scores of k anchors corresponding to the text region of the binarized image and k boundary regression results to realize text positioning and obtain a rectangular text box;

the processing algorithm adopted by the character detection comprises the following steps: fast-RCNN, Mask-RCNN, FPN, PANET, Unet, IoUNet, YOLO, SSD.

Then the step of character recognition is entered,

the character recognition is used for recognizing the text content on the basis of character detection, and the problem mainly solved by the character recognition is what each character is. In this embodiment of the present application, character recognition is performed on text contents in a rectangular text box through a pre-trained character recognition network to obtain text content information, and a processing algorithm adopted in the method includes: CRNN, AttentionOCR, RNNLM, BERT.

And then extracting required fields and/or elements from the text recognition result through text extraction, wherein the required fields and/or elements comprise:

extracting required fields and/or elements in text content from a data set based on extraction requirements, wherein the extraction requirements comprise: sequence labeling extraction, deep learning extraction and table extraction,

the processing algorithm adopted by the text extraction comprises the following steps: CRF, HMM, HAN, DPCNN, BilSTM + CRF, BERT + CRF, Regex.

And generating an editable file to be edited according to the extracted fields and/or elements.

And positioning the marking and/or editing position of the file to be edited, and highlighting the positioned position.

The application also discloses clearance data positioning, auditing system based on computer vision, includes:

the file acquisition unit 1 is used for acquiring a file to be audited;

the file analysis unit 2 is used for receiving the file to be examined and analyzing the type and format of the file to be examined;

the image preprocessing unit 3 is used for correcting the image imaging problem of the analyzed file to be examined;

the character detection unit 4 is used for detecting the position, the range and the layout of the text in the file to be examined on the basis of correcting the image imaging problem;

a character recognition unit 5 for recognizing the text content based on the character detection;

a text extraction unit 6 for extracting required fields and/or elements from the text recognition result;

an editable generation unit 7 which generates an editable file to be edited according to the extracted fields and/or elements;

the positioning unit 8 is used for positioning the marking and/or editing position of the file to be edited;

a marking unit 9 for highlighting the position positioned by the positioning unit 8; and

a memory 10 and a processor 11, wherein the memory 10 stores a computer program which can be loaded by the processor 11 and execute the above-mentioned method for locating, checking and editing the customs clearance data based on computer vision.

The embodiment of the application provides a storage medium, wherein the storage medium stores an instruction set, and the instruction set is suitable for a processor 11 to load and execute the steps of the automatic capturing and understanding method for the elements of the phenomenon of the feature of the dynamic analytic text image.

The computer storage medium includes, for example: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.

The above embodiments are only used to describe the technical solutions of the present application in detail, but the above embodiments are only used to help understanding the method and the core idea of the present application, and should not be construed as limiting the present application. Those skilled in the art should also appreciate that various modifications and substitutions can be made without departing from the scope of the present disclosure.

Claims

1. A clearance data positioning, auditing and editing method based on computer vision is characterized by comprising the following steps:

acquiring a file to be audited;

2. The computer vision-based clearance data positioning, auditing and editing method of claim 1, characterized in that the acquired files to be audited include picture classes and non-picture classes, and the non-picture classes are converted into picture formats and stored with the picture class files.

3. The computer vision-based clearance data locating, auditing and editing method according to claim 1, further comprising, after obtaining the file to be audited:

4. A computer vision based clearance data locating, auditing, editing method according to claim 3 wherein the image pre-processing comprises:

5. The computer vision-based clearance data locating, auditing, editing method of claim 4 wherein the text detection comprises:

inputting the binary image into a pre-trained feature extraction network;

6. The computer vision-based clearance data locating, auditing and editing method of claim 5 wherein the text recognition comprises: and performing character recognition on the text content in the rectangular text box through a pre-trained character recognition network to acquire text content information.

7. The computer vision-based clearance data locating, auditing and editing method according to claim 6, wherein the extraction of the fields and/or elements to be audited from the document to be audited includes:

8. The computer vision-based clearance data locating, auditing, editing method of claim 7 wherein highlighting based on labeling and/or editing of a file to be edited comprises:

positioning the marking and/or editing position of the file to be edited;

and highlighting the positioned position.

9. Customs data positioning and auditing system based on computer vision is characterized by comprising:

the file acquisition unit (1) is used for acquiring a file to be audited;

the file analysis unit (2) is used for receiving the file to be examined and analyzing the type and the format of the file to be examined;

the image preprocessing unit (3) is used for correcting the image imaging problem of the analyzed file to be examined;

the character detection unit (4) is used for detecting the position, the range and the layout of the text in the file to be checked on the basis of correcting the image imaging problem;

a character recognition unit (5) for recognizing the text content on the basis of the character detection;

a text extraction unit (6) for extracting required fields and/or elements from the text recognition result;

an editable generation unit (7) which generates an editable file to be edited according to the extracted fields and/or elements;

the positioning unit (8) is used for positioning the marking and/or editing position of the file to be edited;

a marking unit (9) which highlights the position positioned by the positioning unit (8); and

a memory (10) and a processor (11), the memory (10) having stored thereon a computer program that can be loaded by the processor (11) and that executes the computer vision based clearance data locating, auditing, editing method according to any of claims 1 to 8.

10. A computer-readable storage medium, characterized in that a computer program is stored which can be loaded by a processor (11) and which performs the computer vision based clearance data locating, auditing, editing method according to any of claims 1 to 8.