CN112435012A - Customs data positioning, auditing and editing system and method based on computer vision and storage medium - Google Patents

Customs data positioning, auditing and editing system and method based on computer vision and storage medium Download PDF

Info

Publication number
CN112435012A
CN112435012A CN202011399703.8A CN202011399703A CN112435012A CN 112435012 A CN112435012 A CN 112435012A CN 202011399703 A CN202011399703 A CN 202011399703A CN 112435012 A CN112435012 A CN 112435012A
Authority
CN
China
Prior art keywords
file
auditing
text
computer vision
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202011399703.8A
Other languages
Chinese (zh)
Inventor
陆欢旺
冯玉静
张东峰
万晓磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Sandao Intelligent Technology Co ltd
Original Assignee
Shanghai Sandao Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Sandao Intelligent Technology Co ltd filed Critical Shanghai Sandao Intelligent Technology Co ltd
Priority to CN202011399703.8A priority Critical patent/CN112435012A/en
Publication of CN112435012A publication Critical patent/CN112435012A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/28Quantising the image, e.g. histogram thresholding for discrimination between background and foreground patterns
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/30Noise filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • General Health & Medical Sciences (AREA)
  • Strategic Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Tourism & Hospitality (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Evolutionary Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Primary Health Care (AREA)
  • Image Analysis (AREA)

Abstract

The application relates to the technical field of customs clearance, and discloses a system, a method and a storage medium for positioning, auditing and editing clearance data based on computer vision, wherein the method comprises the following steps: acquiring a file to be audited; extracting fields and/or elements needing to be audited from the files to be audited; generating an editable file to be edited according to the extracted fields and/or elements; and positioning the marking and/or editing position of the file to be edited, and highlighting the marking and/or editing position. The customs clearance data positioning, auditing and editing system, method and storage medium based on computer vision realize on-line auditing, improve the efficiency of examining list, and simultaneously highlight the positions with postil and modification in the auditing process, enlarge the visual effect and reduce the probability of label omission.

Description

Customs data positioning, auditing and editing system and method based on computer vision and storage medium
Technical Field
The application relates to the technical field of customs clearance, in particular to a system, a method and a storage medium for positioning, auditing and editing clearance data based on computer vision.
Background
Before shipping and shipping of goods at import and export, a declaration procedure needs to be handled to customs, and declaration materials comprise: import and export goods customs declaration form, goods invoice, land freight bill, air freight bill, shipping import bill, shipping export bill, goods packing bill, export receipt and reimbursement bill and the like. The forms such as customs declaration forms, delivery forms, loading forms and cargo packing forms need to be audited after being manufactured, the auditing and labeling efficiency of the traditional paper forms is low, visual fatigue is easy to occur, and the problem of label omission occurs.
Disclosure of Invention
In order to improve auditing efficiency and improve visual effect of labeling, the application provides a system, a method and a storage medium for customs data positioning, auditing and editing based on computer vision.
In a first aspect, the present application provides a method for positioning, reviewing and editing customs clearance data based on computer vision, including:
acquiring a file to be audited;
extracting fields and/or elements needing to be audited from the files to be audited;
generating an editable file to be edited according to the extracted fields and/or elements;
and positioning the marking and/or editing position of the file to be edited, and highlighting the marking and/or editing position.
By adopting the technical scheme, on one hand, on-line auditing is realized, the efficiency of the examination is improved, and meanwhile, the places with annotations and modifications in the auditing process are highlighted, so that the visual effect is enlarged, and the probability of label omission is reduced.
In some embodiments, the acquired file to be audited includes a picture class and a non-picture class, and the non-picture class is converted into a picture format and stored together with the picture class file.
By adopting the technical scheme, the received files are uniformly converted into the picture format, so that the application range of the document is expanded.
In some embodiments, after acquiring the file to be audited, the method further includes:
analyzing the file, and analyzing the type and format of the file to be examined;
image preprocessing, namely correcting the image imaging problem of the file to be examined;
detecting characters, namely detecting the position, the range and the layout of a text in a file to be examined;
and character recognition, namely recognizing the text content on the basis of character detection.
By adopting the technical scheme, the file is firstly analyzed, the image is processed, the image problem is corrected, the position, the range and the layout of the text are identified from the image, and the text content is identified on the basis of character detection, so that the characters in the file can be conveniently obtained.
In some embodiments, the image pre-processing comprises:
inputting an image of a file to be checked into a pre-trained image correction network for geometric change and/or distortion correction to obtain a corrected first target image;
performing small-angle correction on the first target image through a CV algorithm and an affine transformation matrix to obtain a second target image;
removing the blur of the second target image through a denoising algorithm to obtain a third target image;
and carrying out binarization processing on the third target image to obtain a binarized image.
In some embodiments, the text detection comprises:
inputting the binary image into a pre-trained feature extraction network;
extracting output information of at least two convolution layers in the feature extraction network, and fusing the output information;
inputting the fused information into a full connection layer in the feature extraction network, and outputting 2k vertical direction coordinates and coordinate scores of k anchors corresponding to the text region of the binary image and k boundary regression results to realize text positioning and obtain a rectangular text box.
The invoices and the case sheets in the customs clearance industry have different character typesetting structures according to different customers, and have the condition of one-to-many, and the data with any structure can be extracted and displayed by adopting the technical scheme.
In some embodiments, the text recognition comprises: and performing character recognition on the text content in the rectangular text box through a pre-trained character recognition network to acquire text content information.
In some embodiments, the extracting of the fields and/or elements requiring review from the document to be reviewed includes:
generating a basic semantic analysis engine based on a preset semantic database, wherein the semantic database comprises a field basic corpus, a field dictionary and a field knowledge map;
performing field analysis processing on the text content information based on a basic semantic analysis engine;
extracting the required fields and/or elements in the text content based on the extraction requirement extraction data set.
By adopting the technical scheme, the intelligent text place of the characters is identified by adopting natural language processing and combining with the industry: the extracted model is subjected to deep learning model training in combination with the industry, and the recognized data can be subjected to simple data cleaning.
In some embodiments, highlighting based on annotation and/or editing of the file to be edited includes:
positioning the marking and/or editing position of the file to be edited;
and highlighting the positioned position.
By adopting the technical scheme, the places with annotations and modifications in the auditing process are highlighted, the visual effect is enlarged, and the probability of annotation omission is reduced.
In a second aspect, the present application discloses a system for locating and auditing clearance data based on computer vision, comprising:
the file acquisition unit is used for acquiring a file to be audited;
the file analysis unit is used for receiving the file to be audited and analyzing the type and the format of the file to be audited;
the image preprocessing unit is used for correcting the image imaging problem of the analyzed file to be examined;
the character detection unit is used for detecting the position, the range and the layout of the text in the file to be checked on the basis of correcting the image imaging problem;
the character recognition unit is used for recognizing the text content on the basis of character detection;
the text extraction unit extracts required fields and/or elements from the text recognition result;
the editable generating unit generates an editable file to be edited according to the extracted fields and/or elements;
the positioning unit is used for positioning the marking and/or editing position of the file to be edited;
the marking unit is used for highlighting and marking the position positioned by the positioning unit; and
the system comprises a memory and a processor, wherein the memory is stored with a computer program which can be loaded by the processor and can execute the clearance data positioning, auditing and editing method based on computer vision.
In a third aspect, the present application discloses a computer-readable storage medium storing a computer program capable of being loaded by a processor and executing the above-mentioned customs data positioning, auditing and editing method based on computer vision.
In summary, the system, method and storage medium for customs clearance data positioning, auditing and editing based on computer vision provided by the application have at least one of the following beneficial technical effects:
1. by the system, online auditing is realized, the efficiency of the examination is improved, and meanwhile, places with annotations and modifications in the auditing process are highlighted, so that the visual effect is enlarged, and the probability of label omission is reduced.
Drawings
Fig. 1 is a block diagram of a system for locating, reviewing and editing customs clearance data based on computer vision according to the present invention.
In the figure:
1. a file acquisition unit; 2. a file parsing unit; 3. an image preprocessing unit; 4. a character detection unit; 5. a character recognition unit; 6. a text extraction unit; 7. an editable generation unit; 8. a positioning unit; 9. labeling units; 10. a memory; 11. a processor.
Detailed Description
The present application is described in further detail below with reference to the attached drawings.
The embodiment of the application provides a system, a method and a storage medium for positioning, auditing and editing clearance data based on computer vision.
The application provides a clearance data positioning, auditing and editing method based on computer vision, which comprises the following steps:
acquiring a file to be audited, wherein the file to be audited is acquired; the files to be processed comprise pictures and non-pictures, the non-pictures comprise a photocopy and a PDF file, and meanwhile, the non-pictures are converted into a picture format and are stored together with the pictures.
And simultaneously storing the input files to be processed into a file library, and performing model training based on manual labeling to obtain an image correction network, a feature extraction network, a character recognition network and a deep learning extraction data set.
In the embodiment of the application, the file analysis supports the processing of files with JPG, PNG, TIF and PDF formats.
Image preprocessing, namely correcting the image imaging problem of the file to be processed; the method specifically comprises the following steps:
inputting the image of the file to be processed into a pre-trained image correction network for geometric change and/or distortion correction to obtain a corrected first target image, namely:
regressing the network parameters of the space transformation corresponding to the first target image by utilizing a positioning network in the image correction network;
calculating the position of a pixel point in the corrected first target image in the first target image by using a grid generator in the image correction network and the network parameters;
outputting the corrected first target image by using a sampler in the image correction network and the calculated position;
then, the user can use the device to perform the operation,
performing small-angle correction on the first target image through a CV algorithm and an affine transformation matrix to obtain a second target image;
removing the blur of the second target image through a denoising algorithm to obtain a third target image;
carrying out binarization processing on the third target image to obtain a binarized image;
after image preprocessing, the following steps are carried out.
The method comprises the following steps of character detection, wherein the position, the range and the layout of a text in a file to be processed are detected, the layout analysis, the character line detection and the like are generally included, and the character detection mainly solves the problems of where characters exist and how large the range of the characters exists. The method comprises the following specific steps:
inputting the binary image into a pre-trained feature extraction network;
extracting output information of at least two convolution layers in the feature extraction network, and fusing the output information;
inputting the fused information into a full-connection layer in the feature extraction network, and outputting 2k vertical direction coordinates and coordinate scores of k anchors corresponding to the text region of the binarized image and k boundary regression results to realize text positioning and obtain a rectangular text box;
the processing algorithm adopted by the character detection comprises the following steps: fast-RCNN, Mask-RCNN, FPN, PANET, Unet, IoUNet, YOLO, SSD.
Then the step of character recognition is entered,
the character recognition is used for recognizing the text content on the basis of character detection, and the problem mainly solved by the character recognition is what each character is. In this embodiment of the present application, character recognition is performed on text contents in a rectangular text box through a pre-trained character recognition network to obtain text content information, and a processing algorithm adopted in the method includes: CRNN, AttentionOCR, RNNLM, BERT.
And then extracting required fields and/or elements from the text recognition result through text extraction, wherein the required fields and/or elements comprise:
generating a basic semantic analysis engine based on a preset semantic database, wherein the semantic database comprises a field basic corpus, a field dictionary and a field knowledge map;
performing field analysis processing on the text content information based on a basic semantic analysis engine;
extracting required fields and/or elements in text content from a data set based on extraction requirements, wherein the extraction requirements comprise: sequence labeling extraction, deep learning extraction and table extraction,
the processing algorithm adopted by the text extraction comprises the following steps: CRF, HMM, HAN, DPCNN, BilSTM + CRF, BERT + CRF, Regex.
And generating an editable file to be edited according to the extracted fields and/or elements.
And positioning the marking and/or editing position of the file to be edited, and highlighting the positioned position.
The application also discloses clearance data positioning, auditing system based on computer vision, includes:
the file acquisition unit 1 is used for acquiring a file to be audited;
the file analysis unit 2 is used for receiving the file to be examined and analyzing the type and format of the file to be examined;
the image preprocessing unit 3 is used for correcting the image imaging problem of the analyzed file to be examined;
the character detection unit 4 is used for detecting the position, the range and the layout of the text in the file to be examined on the basis of correcting the image imaging problem;
a character recognition unit 5 for recognizing the text content based on the character detection;
a text extraction unit 6 for extracting required fields and/or elements from the text recognition result;
an editable generation unit 7 which generates an editable file to be edited according to the extracted fields and/or elements;
the positioning unit 8 is used for positioning the marking and/or editing position of the file to be edited;
a marking unit 9 for highlighting the position positioned by the positioning unit 8; and
a memory 10 and a processor 11, wherein the memory 10 stores a computer program which can be loaded by the processor 11 and execute the above-mentioned method for locating, checking and editing the customs clearance data based on computer vision.
The embodiment of the application provides a storage medium, wherein the storage medium stores an instruction set, and the instruction set is suitable for a processor 11 to load and execute the steps of the automatic capturing and understanding method for the elements of the phenomenon of the feature of the dynamic analytic text image.
The computer storage medium includes, for example: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
The above embodiments are only used to describe the technical solutions of the present application in detail, but the above embodiments are only used to help understanding the method and the core idea of the present application, and should not be construed as limiting the present application. Those skilled in the art should also appreciate that various modifications and substitutions can be made without departing from the scope of the present disclosure.

Claims (10)

1. A clearance data positioning, auditing and editing method based on computer vision is characterized by comprising the following steps:
acquiring a file to be audited;
extracting fields and/or elements needing to be audited from the files to be audited;
generating an editable file to be edited according to the extracted fields and/or elements;
and positioning the marking and/or editing position of the file to be edited, and highlighting the marking and/or editing position.
2. The computer vision-based clearance data positioning, auditing and editing method of claim 1, characterized in that the acquired files to be audited include picture classes and non-picture classes, and the non-picture classes are converted into picture formats and stored with the picture class files.
3. The computer vision-based clearance data locating, auditing and editing method according to claim 1, further comprising, after obtaining the file to be audited:
analyzing the file, and analyzing the type and format of the file to be examined;
image preprocessing, namely correcting the image imaging problem of the file to be examined;
detecting characters, namely detecting the position, the range and the layout of a text in a file to be examined;
and character recognition, namely recognizing the text content on the basis of character detection.
4. A computer vision based clearance data locating, auditing, editing method according to claim 3 wherein the image pre-processing comprises:
inputting an image of a file to be checked into a pre-trained image correction network for geometric change and/or distortion correction to obtain a corrected first target image;
performing small-angle correction on the first target image through a CV algorithm and an affine transformation matrix to obtain a second target image;
removing the blur of the second target image through a denoising algorithm to obtain a third target image;
and carrying out binarization processing on the third target image to obtain a binarized image.
5. The computer vision-based clearance data locating, auditing, editing method of claim 4 wherein the text detection comprises:
inputting the binary image into a pre-trained feature extraction network;
extracting output information of at least two convolution layers in the feature extraction network, and fusing the output information;
inputting the fused information into a full connection layer in the feature extraction network, and outputting 2k vertical direction coordinates and coordinate scores of k anchors corresponding to the text region of the binary image and k boundary regression results to realize text positioning and obtain a rectangular text box.
6. The computer vision-based clearance data locating, auditing and editing method of claim 5 wherein the text recognition comprises: and performing character recognition on the text content in the rectangular text box through a pre-trained character recognition network to acquire text content information.
7. The computer vision-based clearance data locating, auditing and editing method according to claim 6, wherein the extraction of the fields and/or elements to be audited from the document to be audited includes:
generating a basic semantic analysis engine based on a preset semantic database, wherein the semantic database comprises a field basic corpus, a field dictionary and a field knowledge map;
performing field analysis processing on the text content information based on a basic semantic analysis engine;
extracting the required fields and/or elements in the text content based on the extraction requirement extraction data set.
8. The computer vision-based clearance data locating, auditing, editing method of claim 7 wherein highlighting based on labeling and/or editing of a file to be edited comprises:
positioning the marking and/or editing position of the file to be edited;
and highlighting the positioned position.
9. Customs data positioning and auditing system based on computer vision is characterized by comprising:
the file acquisition unit (1) is used for acquiring a file to be audited;
the file analysis unit (2) is used for receiving the file to be examined and analyzing the type and the format of the file to be examined;
the image preprocessing unit (3) is used for correcting the image imaging problem of the analyzed file to be examined;
the character detection unit (4) is used for detecting the position, the range and the layout of the text in the file to be checked on the basis of correcting the image imaging problem;
a character recognition unit (5) for recognizing the text content on the basis of the character detection;
a text extraction unit (6) for extracting required fields and/or elements from the text recognition result;
an editable generation unit (7) which generates an editable file to be edited according to the extracted fields and/or elements;
the positioning unit (8) is used for positioning the marking and/or editing position of the file to be edited;
a marking unit (9) which highlights the position positioned by the positioning unit (8); and
a memory (10) and a processor (11), the memory (10) having stored thereon a computer program that can be loaded by the processor (11) and that executes the computer vision based clearance data locating, auditing, editing method according to any of claims 1 to 8.
10. A computer-readable storage medium, characterized in that a computer program is stored which can be loaded by a processor (11) and which performs the computer vision based clearance data locating, auditing, editing method according to any of claims 1 to 8.
CN202011399703.8A 2020-12-02 2020-12-02 Customs data positioning, auditing and editing system and method based on computer vision and storage medium Withdrawn CN112435012A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011399703.8A CN112435012A (en) 2020-12-02 2020-12-02 Customs data positioning, auditing and editing system and method based on computer vision and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011399703.8A CN112435012A (en) 2020-12-02 2020-12-02 Customs data positioning, auditing and editing system and method based on computer vision and storage medium

Publications (1)

Publication Number Publication Date
CN112435012A true CN112435012A (en) 2021-03-02

Family

ID=74691615

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011399703.8A Withdrawn CN112435012A (en) 2020-12-02 2020-12-02 Customs data positioning, auditing and editing system and method based on computer vision and storage medium

Country Status (1)

Country Link
CN (1) CN112435012A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113408446A (en) * 2021-06-24 2021-09-17 成都新希望金融信息有限公司 Bill accounting method and device, electronic equipment and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113408446A (en) * 2021-06-24 2021-09-17 成都新希望金融信息有限公司 Bill accounting method and device, electronic equipment and storage medium
CN113408446B (en) * 2021-06-24 2022-11-29 成都新希望金融信息有限公司 Bill accounting method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
Shen et al. Layoutparser: A unified toolkit for deep learning based document image analysis
US7697757B2 (en) Computer assisted document modification
RU2695489C1 (en) Identification of fields on an image using artificial intelligence
US11232300B2 (en) System and method for automatic detection and verification of optical character recognition data
JP2022541199A (en) A system and method for inserting data into a structured database based on image representations of data tables.
US10489645B2 (en) System and method for automatic detection and verification of optical character recognition data
CN112434691A (en) HS code matching and displaying method and system based on intelligent analysis and identification and storage medium
CN112434690A (en) Method, system and storage medium for automatically capturing and understanding elements of dynamically analyzing text image characteristic phenomena
CN112418812A (en) Distributed full-link automatic intelligent clearance system, method and storage medium
US9286526B1 (en) Cohort-based learning from user edits
US20140254886A1 (en) Method and system for inspecting variable-data printing
CN112149663A (en) RPA and AI combined image character extraction method and device and electronic equipment
US20200364452A1 (en) A heuristic method for analyzing content of an electronic document
CN112509661B (en) Methods, computing devices, and media for identifying physical examination reports
CN116052193B (en) RPA interface dynamic form picking and matching method and system
Elanwar et al. Extracting text from scanned Arabic books: a large-scale benchmark dataset and a fine-tuned Faster-R-CNN model
CN112435012A (en) Customs data positioning, auditing and editing system and method based on computer vision and storage medium
CN112418813B (en) AEO qualification intelligent rating management system and method based on intelligent analysis and identification and storage medium
Akanksh et al. Automated invoice data extraction using image processing
CN114463767A (en) Credit card identification method, device, computer equipment and storage medium
RU2597163C2 (en) Comparing documents using reliable source
CN113673294A (en) Method and device for extracting key information of document, computer equipment and storage medium
CN114092936A (en) Techniques for tagging, checking and correcting tag predictions for P & IDs
CN111241329A (en) Image retrieval-based ancient character interpretation method and device
Milleville et al. Automatic extraction of specimens from multi-specimen herbaria

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20210302