CN110889402A - Business license content identification method and system based on deep learning - Google Patents

Business license content identification method and system based on deep learning Download PDF

Info

Publication number
CN110889402A
CN110889402A CN201911067919.1A CN201911067919A CN110889402A CN 110889402 A CN110889402 A CN 110889402A CN 201911067919 A CN201911067919 A CN 201911067919A CN 110889402 A CN110889402 A CN 110889402A
Authority
CN
China
Prior art keywords
text
image
training
model
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911067919.1A
Other languages
Chinese (zh)
Inventor
陈曦
蓝志坚
喻春霞
李海燕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Feng Shi Technology Co Ltd
Original Assignee
Guangzhou Feng Shi Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Feng Shi Technology Co Ltd filed Critical Guangzhou Feng Shi Technology Co Ltd
Priority to CN201911067919.1A priority Critical patent/CN110889402A/en
Publication of CN110889402A publication Critical patent/CN110889402A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • G06V10/225Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition based on a marking or identifier characterising the area
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biomedical Technology (AREA)
  • Evolutionary Biology (AREA)
  • Health & Medical Sciences (AREA)
  • Image Analysis (AREA)
  • Character Input (AREA)
  • Character Discrimination (AREA)

Abstract

The invention discloses a business license content identification method and a system based on deep learning, wherein the method comprises the following steps: collecting images of business licenses for preprocessing, wherein the preprocessing comprises the following steps: graying, filtering and denoising, image binarization and tilt correction; constructing a text detection model, and respectively performing primary training and secondary training; constructing a text recognition model based on a convolutional neural network, and training the text recognition model by using a training sample generated randomly; inputting the preprocessed business license image into a text detection model obtained by secondary training, outputting a text line image, recognizing the text line image by using the trained text recognition model, and outputting text information; and performing semantic analysis on the text information, and connecting the contents of the same text line in series to obtain a final result of content identification of the business license. The invention reduces the amount of training samples, overcomes the defect of high difficulty in character cutting and improves the content recognition rate.

Description

Business license content identification method and system based on deep learning
Technical Field
The invention relates to the field of image content identification, in particular to a business license content identification method and system based on deep learning.
Background
Optical Character Recognition (OCR) is now mainly applied to document recognition and document recognition. The certificate identification is realized by digitizing a certificate original, a scanned part and a copied part, converting the certificate original, the scanned part and the copied part into pictures and then identifying the certificate content through texts, so that the working efficiency is improved, and the working intensity is reduced. Three key techniques of OCR in conventional image processing: character region detection, character cutting and recognition. Text region detection, also called text detection, extracts a text information region in an image. Text detection methods are now broadly divided into layout analysis, which extracts target regions from images using a feature extraction method, and deep learning, which is automatic recognition and extraction of text regions of document images. The character cutting is to divide the extracted text area into single characters according to lines. The recognition is to recognize the divided single characters one by one.
The optical character recognition method in the current image processing mainly has the following problems:
the training samples are difficult to collect in large quantities
When the content identification of the license certificate based on deep learning is needed, the deep neural network training using a large number of samples cannot be avoided. However, it is difficult to collect a large number of certificate pictures such as a license as a training sample.
The difficulty of character cutting is large
The single character segmentation of one line of characters is greatly influenced by language characters, and for multi-language mixture, for example, the certificate address type characters contain Chinese, numbers, English, symbols and the like, the difficulty of character segmentation is greatly increased. And the segmentation of the characters is basically the projection method adopted at present.
The recognition accuracy of the credit codes in the business license is low
For the application scenarios of OCR of certificate types such as business licenses, there are often strict requirements on the accuracy of recognition. Among them, the "unified social credit code" in the license is an important factor influencing the recognition rate. This is because the credit code is composed of numbers and letters, and the connection is compact, which is likely to cause false recognition and missed recognition.
In summary, the existing business license content identification method based on deep learning needs a large number of training samples, is difficult to cut characters, and has a low identification rate.
Disclosure of Invention
The invention provides a business license content identification method and system based on deep learning, aiming at overcoming the defects of large quantity of required training samples, high character cutting difficulty and low identification rate of the business license content identification method based on deep learning in the prior art.
The primary objective of the present invention is to solve the above technical problems, and the technical solution of the present invention is as follows:
the invention provides a business license content identification method based on deep learning, which comprises the following steps:
s1: collecting images of business licenses;
s2: pre-processing the acquired license images, the pre-processing comprising: graying, filtering and denoising, image binarization and tilt correction;
s3: constructing a text detection model, performing primary training by using an open source text detection data set, constructing a pre-labeled business license image data set by using the preprocessed images with set proportion, performing secondary training on the primarily trained detection model by using the pre-labeled business license image data set,
s4: constructing a text recognition model based on a convolutional neural network, and training the text recognition model by using a randomly generated training sample to obtain a trained text recognition model;
s5: inputting the preprocessed business license image into a text detection model obtained by secondary training, outputting a text line image, recognizing the text line image by using the trained text recognition model, and outputting text information;
s6: and performing semantic analysis on the text information identified by the text identification model, and connecting the contents of the same text line in series to obtain the final result of identifying the contents of the business license.
Further, the pretreatment specifically comprises:
carrying out gray processing on the collected business license image by adopting a weighted average method;
carrying out median filtering denoising on the image subjected to graying;
carrying out image binarization on the denoised image by using a point-by-point method;
and performing inclination correction on the image subjected to the binarization processing through perspective transformation.
Further, the text detection model is a fast RCNN model or a CTPN model or a SegLink model or an EAST model.
Further, the convolutional neural network-based text recognition model is a DenseNet + CTC text recognition model.
Based on the method, the invention also provides a business license content identification system based on deep learning, and the system comprises: the system comprises an image acquisition module, an image preprocessing module, a text detection module, a random sample generation module, a text recognition module and a text information integration module, wherein the image acquisition module is used for acquiring a complete business license image;
the image preprocessing module is used for preprocessing the acquired business license image, and the preprocessing comprises the following steps: graying, filtering and denoising, image binarization and tilt correction;
the text detection module is used for performing text line detection on the preprocessed image and outputting a text line image;
the random sample generation module provides a random generation training sample for the text recognition module;
the text recognition module is used for performing text recognition on the text line image and outputting text information;
the integrated text information module is used for performing semantic analysis on the text information output by the text recognition module and connecting the contents of the same text line in series to obtain the final result of content recognition of the business license.
Further, the randomly generated training samples provided by the random sample generation module are divided into a training data set and a verification data set according to a preset proportion.
Further, the randomly generated training sample provided by the random sample generation module contains preset noise and preset distortion characteristic amplitude.
Compared with the prior art, the technical scheme of the invention has the beneficial effects that:
according to the method, the training of the text detection model is divided into two stages, so that the number of samples required by training is reduced, and the accuracy of text detection is improved; the training of the generated random training samples on the text recognition model improves the accuracy of text recognition, and the defect that the traditional character recognition method needs to cut characters is overcome through recognition of text line images.
Drawings
FIG. 1 is a flow chart of the method of the present invention.
Detailed Description
In order that the above objects, features and advantages of the present invention can be more clearly understood, a more particular description of the invention will be rendered by reference to the appended drawings. It should be noted that the embodiments and features of the embodiments of the present application may be combined with each other without conflict.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those specifically described herein, and therefore the scope of the present invention is not limited by the specific embodiments disclosed below.
Example 1
Fig. 1 shows a flowchart of a license content identification method based on deep learning.
The invention provides a business license content identification method based on deep learning, which comprises the following steps:
s1: collecting images of business licenses;
in a specific embodiment, the complete image of the business license is collected by an image collecting device, which includes but is not limited to a camera, a smart phone, a computer and a tablet computer, and can also receive the image of the business license collected and transmitted by other manners.
S2: pre-processing the acquired license images, the pre-processing comprising: graying, filtering and denoising, image binarization and tilt correction;
in a specific embodiment, the collected business license image is preprocessed, and the license image is grayed by adopting a weighted average method. And (3) selecting a self-adaptive median filtering method to carry out filtering and denoising on the image, wherein the filtering and denoising are carried out to ensure that the boundary characteristics of the image are not blurred. And (4) binarizing the denoised gray level image by using a point-by-point method, wherein the image binarization operation can highlight interested target content. In addition, in the plane image processing, due to a lens angle and the like, an image is prone to be inclined, deformed and the like, and for convenience of subsequent processing, the image is required to be inclined and corrected, and the deformed image can be corrected through perspective transformation.
Perspective Transformation (Perspective Transformation) is a nonlinear Transformation in three-dimensional space, which essentially projects the original image to a new viewing plane by a 3 × 3 Transformation matrix, and the visual intuitive expression is to generate or eliminate the sense of distance and proximity.
S3: constructing a text detection model, performing primary training by using an open source text detection data set, constructing a pre-labeled business license image data set by using the preprocessed images in a set proportion, and performing secondary training on the detection model after the primary training by using the pre-labeled business license image data set;
in a specific embodiment, an open-source text detection dataset of multilingual scene text detection and script recognition (MLT) may be used to perform preliminary training on a text detection model; the available text detection models include fast RCNN, CTPN, SegLink, EAST, etc., the fast RCNN is a general target detection model, and the latter three are network models optimized for text detection. The preliminarily trained text detection model can be applied to text information detection of business license pictures. However, due to the effect of text typesetting and font size of a license, the text detection effect cannot meet the actual requirement. Extracting the pre-processed license image with a set proportion (such as 20%), manually labeling the text box to form a pre-labeled license image data set, and performing secondary training on the preliminarily trained detection model by using the pre-labeled license image data set.
It should be noted that, by adopting two stages of text detection model training, the number of labels of business license image data required by the model training can be effectively reduced, and the accuracy of text detection is improved.
In a specific embodiment, a CTPN (connectionist Text forward network) Text detection model may be used, which converts a Text detection task into detection of a series of small-scale Text boxes.
S4: constructing a text recognition model based on a convolutional neural network, and training the text recognition model by using a random training sample generated by a preset text library to obtain a trained text recognition model;
in a specific embodiment, the random training samples include training samples with various preset noise and distortion characteristic amplitudes, that is, text pictures containing only one line of characters.
It should be noted that the text library for random sample generation includes various types of materials, such as news, encyclopedia, articles, and the like. The related various common Chinese characters, English, numbers, symbols and the like are enough to correspond to the content information identification in the business license.
In addition, a large number of training samples are generated by simulating the combination rule and the style of 'unified social credit codes' in a business license, so that the accuracy of the text recognition model is effectively improved.
The random sample generator may be divided into a training data set and a validation data set on a 99:1 scale.
S5: inputting the preprocessed business license image into a text detection model obtained by secondary training, outputting a text line image, recognizing the text line image by using the trained text recognition model, and outputting text information;
s6: and performing semantic analysis on the text information identified by the text identification model, and connecting the contents of the same text line in series to obtain the final result of identifying the contents of the business license.
In the invention, a DenseNet + CTC model is adopted, the DenseNet breaks away from the fixed thinking of deepening the network layer number (ResNet) and widening the network structure (inclusion) to improve the network performance, and in view of characteristics, through characteristic reuse and Bypass (Bypass) setting, the parameter quantity of the network is greatly reduced, and the generation of the gradientvanising problem is relieved to a certain extent. The network structure of DenseNet consists mainly of DenseBlock and Transition.
In the DenseBlock, the BN + ReLU +3x3 Conv structure is adopted, the feature maps of all layers are consistent in size and can be connected in channel dimension; for the Transition layer, mainly two adjacent DenseBlock are connected and the feature map size is reduced. The Transition layer comprises a convolution of 1x1 and AvgPooling of 2x2, with the structure BN + ReLU +1x1Conv +2x2 AvgPooling. In addition, the Transition layer can function as a compression model.
Ctc (connectionist Temporal classification), is a time-series classification algorithm that addresses the alignment of input data with a given tag.
And training the DenseNet + CTC text recognition model by using random training samples, and performing text recognition on the text line pictures output by the text detection model by using the trained model.
The text information identified by the text identification model is subjected to semantic analysis, and the text lines of the same attribute are concatenated, for example, the text information of each line in the text line image is subjected to semantic analysis to obtain the contents such as 'unified social credit code', 'number', 'name', 'type', 'address', and the like. And integrating the text recognition result after semantic analysis as a final result of the content recognition system of the business license.
The semantic analysis can be realized through a semantic analysis model, and the training of the semantic analysis model is to perform model training by generating some regular information samples, for example, names generally end in "company", registered capital ends in "element", general rules of address texts, and the like.
The invention also provides a business license content identification system based on deep learning based on the method, and the system comprises: the system comprises an image acquisition module, an image preprocessing module, a text detection module, a random sample generation module, a text recognition module and a text information integration module, wherein the image acquisition module is used for acquiring a complete business license image;
the image preprocessing module is used for preprocessing the acquired business license image, and the preprocessing comprises the following steps: graying, filtering and denoising, image binarization and tilt correction;
the text detection module is used for performing text line detection on the preprocessed image and outputting a text line image;
the random sample generation module provides a random generation training sample for the text recognition module;
the text recognition module is used for performing text recognition on the text line image and outputting text information;
the integrated text information module is used for performing semantic analysis on the text information output by the text recognition module and connecting the contents of the same text line in series to obtain the final result of the content recognition system of the business license.
Further, the random training samples provided by the random sample generation module are divided into a training data set and a verification data set according to a preset proportion.
In one particular embodiment, the random training samples may be divided into a training data set and a validation data set on a 99:1 scale.
Further, the random training sample provided by the random sample generation module contains preset noise and preset distortion characteristic amplitude.
The integrated text information module is the last step of the whole business license content identification system and is used for performing semantic analysis on all text information output by the text identification model and connecting the contents of the same text line in series to obtain the final result of business license content identification.
It should be understood that the above-described embodiments of the present invention are merely examples for clearly illustrating the present invention, and are not intended to limit the embodiments of the present invention. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the claims of the present invention.

Claims (7)

1. A business license content identification method based on deep learning is characterized by comprising the following steps:
s1: collecting images of business licenses;
s2: pre-processing the acquired license images, the pre-processing comprising: graying, filtering and denoising, image binarization and tilt correction;
s3: constructing a text detection model, performing primary training by using an open source text detection data set, constructing a pre-labeled business license image data set by using the preprocessed image with a set proportion, and performing secondary training on the primarily trained detection model by using the pre-labeled business license image data set;
s4: constructing a text recognition model based on a convolutional neural network, and training the text recognition model by using a randomly generated training sample to obtain a trained text recognition model;
s5: inputting the preprocessed business license image into a text detection model obtained by secondary training, outputting a text line image, recognizing the text line image by using the trained text recognition model, and outputting text information;
s6: and performing semantic analysis on the text information identified by the text identification model, and connecting the contents of the same text line in series to obtain the final result of identifying the contents of the business license.
2. The method for license content recognition based on deep learning of claim 1, wherein the preprocessing is specifically:
carrying out gray processing on the collected business license image by adopting a weighted average method;
carrying out median filtering denoising on the image subjected to graying;
carrying out image binarization on the denoised image by using a point-by-point method;
and performing inclination correction on the image subjected to the binarization processing through perspective transformation.
3. The method for recognizing the contents of a business license based on deep learning of claim 1, wherein the text detection model is fast RCNN model or CTPN model or SegLink model or EAST model.
4. The method of claim 1, wherein the text recognition model based on convolutional neural network is a DenseNet + CTC text recognition model.
5. A deep learning based license content recognition system, the system comprising: the system comprises an image acquisition module, an image preprocessing module, a text detection module, a random sample generation module, a text recognition module and a text information integration module, wherein the image acquisition module is used for acquiring a complete business license image;
the image preprocessing module is used for preprocessing the acquired business license image, and the preprocessing comprises the following steps: graying, filtering and denoising, image binarization and tilt correction;
the text detection module is used for performing text line detection on the preprocessed image and outputting a text line image;
the random sample generation module provides a random generation training sample for the text recognition module;
the text recognition module is used for performing text recognition on the text line image and outputting text information;
the integrated text information module is used for performing semantic analysis on the text information output by the text recognition module and connecting the contents of the same text line in series to obtain the final result of content recognition of the business license.
6. The system for recognizing the contents of a business license based on deep learning of claim 5, wherein the randomly generated training samples provided by the randomly generated sample generating module are divided into the training data set and the verification data set according to a predetermined ratio.
7. The system for license content recognition based on deep learning of claim 5, wherein the randomly generated training samples provided by the random sample generation module include a preset noise and a preset amplitude of distortion features.
CN201911067919.1A 2019-11-04 2019-11-04 Business license content identification method and system based on deep learning Pending CN110889402A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911067919.1A CN110889402A (en) 2019-11-04 2019-11-04 Business license content identification method and system based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911067919.1A CN110889402A (en) 2019-11-04 2019-11-04 Business license content identification method and system based on deep learning

Publications (1)

Publication Number Publication Date
CN110889402A true CN110889402A (en) 2020-03-17

Family

ID=69746873

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911067919.1A Pending CN110889402A (en) 2019-11-04 2019-11-04 Business license content identification method and system based on deep learning

Country Status (1)

Country Link
CN (1) CN110889402A (en)

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111539406A (en) * 2020-04-21 2020-08-14 招商局金融科技有限公司 Certificate copy information identification method, server and storage medium
CN111985574A (en) * 2020-08-31 2020-11-24 平安医疗健康管理股份有限公司 Medical image recognition method, device, equipment and storage medium
CN112036330A (en) * 2020-09-02 2020-12-04 北京中油瑞飞信息技术有限责任公司 Text recognition method, text recognition device and readable storage medium
CN112163508A (en) * 2020-09-25 2021-01-01 中国电子科技集团公司第十五研究所 Character recognition method and system based on real scene and OCR terminal
CN112329774A (en) * 2020-11-10 2021-02-05 杭州微洱网络科技有限公司 Commodity size table automatic generation method based on image
CN112507914A (en) * 2020-12-15 2021-03-16 江苏国光信息产业股份有限公司 OCR (optical character recognition) method and recognition system based on bankbook and bill characters
CN112668335A (en) * 2020-12-21 2021-04-16 广州市申迪计算机系统有限公司 Method for identifying and extracting business license structured information by using named entity
CN112883953A (en) * 2021-02-22 2021-06-01 中国工商银行股份有限公司 Card recognition device and method based on joint learning
CN113160623A (en) * 2021-02-08 2021-07-23 杭州高低科技有限公司 Chinese character splicing interactive learning system and method based on magnetic cards
CN113673507A (en) * 2020-08-10 2021-11-19 广东电网有限责任公司 Electric power professional equipment nameplate recognition algorithm
CN113869131A (en) * 2021-09-01 2021-12-31 南京烽火天地通信科技有限公司 Method for structuring textualized business license picture
CN113963147A (en) * 2021-09-26 2022-01-21 西安交通大学 Key information extraction method and system based on semantic segmentation
CN114155530A (en) * 2021-11-10 2022-03-08 北京中科闻歌科技股份有限公司 Text recognition and question-answering method, device, equipment and medium
CN114359928A (en) * 2022-01-12 2022-04-15 平安科技(深圳)有限公司 Electronic invoice identification method and device, computer equipment and storage medium
CN114596573A (en) * 2022-03-22 2022-06-07 中国平安人寿保险股份有限公司 Birth certificate identification method and device, computer equipment and storage medium
CN115376142A (en) * 2022-07-20 2022-11-22 北大荒信息有限公司 Image-based business license information extraction method, computer equipment and readable storage medium
CN115830620A (en) * 2023-02-14 2023-03-21 江苏联著实业股份有限公司 Archive text data processing method and system based on OCR
CN116681628A (en) * 2023-08-03 2023-09-01 湖南华菱电子商务有限公司 Business license data processing method and system based on deep learning
CN116912845A (en) * 2023-06-16 2023-10-20 广东电网有限责任公司佛山供电局 Intelligent content identification and analysis method and device based on NLP and AI
CN117197816A (en) * 2023-06-19 2023-12-08 珠海盈米基金销售有限公司 User material identification method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108764226A (en) * 2018-04-13 2018-11-06 顺丰科技有限公司 Image text recognition methods, device, equipment and its storage medium
CN110070045A (en) * 2019-04-23 2019-07-30 杭州智趣智能信息技术有限公司 A kind of text recognition method of business license, system and associated component
CN110135424A (en) * 2019-05-23 2019-08-16 阳光保险集团股份有限公司 Tilt text detection model training method and ticket image Method for text detection
CN110363196A (en) * 2019-06-20 2019-10-22 吴晓东 It is a kind of tilt text text precisely know method for distinguishing

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108764226A (en) * 2018-04-13 2018-11-06 顺丰科技有限公司 Image text recognition methods, device, equipment and its storage medium
CN110070045A (en) * 2019-04-23 2019-07-30 杭州智趣智能信息技术有限公司 A kind of text recognition method of business license, system and associated component
CN110135424A (en) * 2019-05-23 2019-08-16 阳光保险集团股份有限公司 Tilt text detection model training method and ticket image Method for text detection
CN110363196A (en) * 2019-06-20 2019-10-22 吴晓东 It is a kind of tilt text text precisely know method for distinguishing

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111539406A (en) * 2020-04-21 2020-08-14 招商局金融科技有限公司 Certificate copy information identification method, server and storage medium
CN111539406B (en) * 2020-04-21 2023-04-18 招商局金融科技有限公司 Certificate copy information identification method, server and storage medium
CN113673507A (en) * 2020-08-10 2021-11-19 广东电网有限责任公司 Electric power professional equipment nameplate recognition algorithm
CN111985574A (en) * 2020-08-31 2020-11-24 平安医疗健康管理股份有限公司 Medical image recognition method, device, equipment and storage medium
CN112036330A (en) * 2020-09-02 2020-12-04 北京中油瑞飞信息技术有限责任公司 Text recognition method, text recognition device and readable storage medium
CN112163508A (en) * 2020-09-25 2021-01-01 中国电子科技集团公司第十五研究所 Character recognition method and system based on real scene and OCR terminal
CN112329774A (en) * 2020-11-10 2021-02-05 杭州微洱网络科技有限公司 Commodity size table automatic generation method based on image
CN112507914A (en) * 2020-12-15 2021-03-16 江苏国光信息产业股份有限公司 OCR (optical character recognition) method and recognition system based on bankbook and bill characters
CN112668335A (en) * 2020-12-21 2021-04-16 广州市申迪计算机系统有限公司 Method for identifying and extracting business license structured information by using named entity
CN112668335B (en) * 2020-12-21 2024-05-31 广州市申迪计算机系统有限公司 Method for identifying and extracting business license structured information by using named entity
CN113160623A (en) * 2021-02-08 2021-07-23 杭州高低科技有限公司 Chinese character splicing interactive learning system and method based on magnetic cards
CN112883953A (en) * 2021-02-22 2021-06-01 中国工商银行股份有限公司 Card recognition device and method based on joint learning
CN113869131A (en) * 2021-09-01 2021-12-31 南京烽火天地通信科技有限公司 Method for structuring textualized business license picture
CN113869131B (en) * 2021-09-01 2024-03-29 南京烽火天地通信科技有限公司 Method for structuring text business license picture
CN113963147A (en) * 2021-09-26 2022-01-21 西安交通大学 Key information extraction method and system based on semantic segmentation
CN113963147B (en) * 2021-09-26 2023-09-15 西安交通大学 Key information extraction method and system based on semantic segmentation
CN114155530A (en) * 2021-11-10 2022-03-08 北京中科闻歌科技股份有限公司 Text recognition and question-answering method, device, equipment and medium
CN114359928A (en) * 2022-01-12 2022-04-15 平安科技(深圳)有限公司 Electronic invoice identification method and device, computer equipment and storage medium
CN114359928B (en) * 2022-01-12 2024-10-01 平安科技(深圳)有限公司 Electronic invoice identification method and device, computer equipment and storage medium
CN114596573A (en) * 2022-03-22 2022-06-07 中国平安人寿保险股份有限公司 Birth certificate identification method and device, computer equipment and storage medium
CN115376142B (en) * 2022-07-20 2023-09-01 北大荒信息有限公司 Image-based business license information extraction method, computer equipment and readable storage medium
CN115376142A (en) * 2022-07-20 2022-11-22 北大荒信息有限公司 Image-based business license information extraction method, computer equipment and readable storage medium
CN115830620A (en) * 2023-02-14 2023-03-21 江苏联著实业股份有限公司 Archive text data processing method and system based on OCR
CN116912845A (en) * 2023-06-16 2023-10-20 广东电网有限责任公司佛山供电局 Intelligent content identification and analysis method and device based on NLP and AI
CN116912845B (en) * 2023-06-16 2024-03-19 广东电网有限责任公司佛山供电局 Intelligent content identification and analysis method and device based on NLP and AI
CN117197816A (en) * 2023-06-19 2023-12-08 珠海盈米基金销售有限公司 User material identification method and system
CN116681628A (en) * 2023-08-03 2023-09-01 湖南华菱电子商务有限公司 Business license data processing method and system based on deep learning
CN116681628B (en) * 2023-08-03 2023-10-24 湖南华菱电子商务有限公司 Business license data processing method and system based on deep learning

Similar Documents

Publication Publication Date Title
CN110889402A (en) Business license content identification method and system based on deep learning
US10943105B2 (en) Document field detection and parsing
US9626555B2 (en) Content-based document image classification
Marinai Introduction to document analysis and recognition
KR101769918B1 (en) Recognition device based deep learning for extracting text from images
KR101377601B1 (en) System and method for providing recognition and translation of multiple language in natural scene image using mobile camera
CN112508011A (en) OCR (optical character recognition) method and device based on neural network
CN109670477B (en) PDF table-oriented automatic identification system and method
Čakić et al. The use of tesseract ocr number recognition for food tracking and tracing
Sidhwa et al. Text extraction from bills and invoices
KR101552525B1 (en) A system for recognizing a font and providing its information and the method thereof
CN112949455A (en) Value-added tax invoice identification system and method
WO2022103564A1 (en) Fraud detection via automated handwriting clustering
CN103559512B (en) A kind of Text region output intent and system
CN113673528B (en) Text processing method, text processing device, electronic equipment and readable storage medium
Jiju et al. OCR text extraction
CN115311666A (en) Image-text recognition method and device, computer equipment and storage medium
Karanje et al. Survey on text detection, segmentation and recognition from a natural scene images
Hung et al. Automatic vietnamese passport recognition on android phones
CN110414497A (en) Method, device, server and storage medium for electronizing object
CN118097688A (en) Universal certificate identification method based on large language model
Kumar et al. Line based robust script identification for indianlanguages
CN116844182A (en) Card character recognition method for automatically recognizing format
Duth et al. Recognition of hand written and printed text of cursive writing utilizing optical character recognition
CN115565193A (en) Questionnaire information input method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200317