CN110889402A - Business license content identification method and system based on deep learning - Google Patents
Business license content identification method and system based on deep learning Download PDFInfo
- Publication number
- CN110889402A CN110889402A CN201911067919.1A CN201911067919A CN110889402A CN 110889402 A CN110889402 A CN 110889402A CN 201911067919 A CN201911067919 A CN 201911067919A CN 110889402 A CN110889402 A CN 110889402A
- Authority
- CN
- China
- Prior art keywords
- text
- image
- training
- model
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 30
- 238000013135 deep learning Methods 0.000 title claims abstract description 20
- 238000012549 training Methods 0.000 claims abstract description 57
- 238000001514 detection method Methods 0.000 claims abstract description 47
- 238000007781 pre-processing Methods 0.000 claims abstract description 21
- 238000001914 filtration Methods 0.000 claims abstract description 12
- 238000012937 correction Methods 0.000 claims abstract description 9
- 238000013527 convolutional neural network Methods 0.000 claims abstract description 6
- 238000012545 processing Methods 0.000 claims description 8
- 230000009466 transformation Effects 0.000 claims description 7
- 230000010354 integration Effects 0.000 claims description 3
- 238000012795 verification Methods 0.000 claims description 3
- 230000007547 defect Effects 0.000 abstract description 3
- 238000012015 optical character recognition Methods 0.000 description 5
- 230000007704 transition Effects 0.000 description 4
- 230000011218 segmentation Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000007635 classification algorithm Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/22—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
- G06V10/225—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition based on a marking or identifier characterising the area
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/148—Segmentation of character regions
- G06V30/153—Segmentation of character regions using recognition of characters or words
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Multimedia (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biomedical Technology (AREA)
- Evolutionary Biology (AREA)
- Health & Medical Sciences (AREA)
- Image Analysis (AREA)
- Character Input (AREA)
- Character Discrimination (AREA)
Abstract
The invention discloses a business license content identification method and a system based on deep learning, wherein the method comprises the following steps: collecting images of business licenses for preprocessing, wherein the preprocessing comprises the following steps: graying, filtering and denoising, image binarization and tilt correction; constructing a text detection model, and respectively performing primary training and secondary training; constructing a text recognition model based on a convolutional neural network, and training the text recognition model by using a training sample generated randomly; inputting the preprocessed business license image into a text detection model obtained by secondary training, outputting a text line image, recognizing the text line image by using the trained text recognition model, and outputting text information; and performing semantic analysis on the text information, and connecting the contents of the same text line in series to obtain a final result of content identification of the business license. The invention reduces the amount of training samples, overcomes the defect of high difficulty in character cutting and improves the content recognition rate.
Description
Technical Field
The invention relates to the field of image content identification, in particular to a business license content identification method and system based on deep learning.
Background
Optical Character Recognition (OCR) is now mainly applied to document recognition and document recognition. The certificate identification is realized by digitizing a certificate original, a scanned part and a copied part, converting the certificate original, the scanned part and the copied part into pictures and then identifying the certificate content through texts, so that the working efficiency is improved, and the working intensity is reduced. Three key techniques of OCR in conventional image processing: character region detection, character cutting and recognition. Text region detection, also called text detection, extracts a text information region in an image. Text detection methods are now broadly divided into layout analysis, which extracts target regions from images using a feature extraction method, and deep learning, which is automatic recognition and extraction of text regions of document images. The character cutting is to divide the extracted text area into single characters according to lines. The recognition is to recognize the divided single characters one by one.
The optical character recognition method in the current image processing mainly has the following problems:
the training samples are difficult to collect in large quantities
When the content identification of the license certificate based on deep learning is needed, the deep neural network training using a large number of samples cannot be avoided. However, it is difficult to collect a large number of certificate pictures such as a license as a training sample.
The difficulty of character cutting is large
The single character segmentation of one line of characters is greatly influenced by language characters, and for multi-language mixture, for example, the certificate address type characters contain Chinese, numbers, English, symbols and the like, the difficulty of character segmentation is greatly increased. And the segmentation of the characters is basically the projection method adopted at present.
The recognition accuracy of the credit codes in the business license is low
For the application scenarios of OCR of certificate types such as business licenses, there are often strict requirements on the accuracy of recognition. Among them, the "unified social credit code" in the license is an important factor influencing the recognition rate. This is because the credit code is composed of numbers and letters, and the connection is compact, which is likely to cause false recognition and missed recognition.
In summary, the existing business license content identification method based on deep learning needs a large number of training samples, is difficult to cut characters, and has a low identification rate.
Disclosure of Invention
The invention provides a business license content identification method and system based on deep learning, aiming at overcoming the defects of large quantity of required training samples, high character cutting difficulty and low identification rate of the business license content identification method based on deep learning in the prior art.
The primary objective of the present invention is to solve the above technical problems, and the technical solution of the present invention is as follows:
the invention provides a business license content identification method based on deep learning, which comprises the following steps:
s1: collecting images of business licenses;
s2: pre-processing the acquired license images, the pre-processing comprising: graying, filtering and denoising, image binarization and tilt correction;
s3: constructing a text detection model, performing primary training by using an open source text detection data set, constructing a pre-labeled business license image data set by using the preprocessed images with set proportion, performing secondary training on the primarily trained detection model by using the pre-labeled business license image data set,
s4: constructing a text recognition model based on a convolutional neural network, and training the text recognition model by using a randomly generated training sample to obtain a trained text recognition model;
s5: inputting the preprocessed business license image into a text detection model obtained by secondary training, outputting a text line image, recognizing the text line image by using the trained text recognition model, and outputting text information;
s6: and performing semantic analysis on the text information identified by the text identification model, and connecting the contents of the same text line in series to obtain the final result of identifying the contents of the business license.
Further, the pretreatment specifically comprises:
carrying out gray processing on the collected business license image by adopting a weighted average method;
carrying out median filtering denoising on the image subjected to graying;
carrying out image binarization on the denoised image by using a point-by-point method;
and performing inclination correction on the image subjected to the binarization processing through perspective transformation.
Further, the text detection model is a fast RCNN model or a CTPN model or a SegLink model or an EAST model.
Further, the convolutional neural network-based text recognition model is a DenseNet + CTC text recognition model.
Based on the method, the invention also provides a business license content identification system based on deep learning, and the system comprises: the system comprises an image acquisition module, an image preprocessing module, a text detection module, a random sample generation module, a text recognition module and a text information integration module, wherein the image acquisition module is used for acquiring a complete business license image;
the image preprocessing module is used for preprocessing the acquired business license image, and the preprocessing comprises the following steps: graying, filtering and denoising, image binarization and tilt correction;
the text detection module is used for performing text line detection on the preprocessed image and outputting a text line image;
the random sample generation module provides a random generation training sample for the text recognition module;
the text recognition module is used for performing text recognition on the text line image and outputting text information;
the integrated text information module is used for performing semantic analysis on the text information output by the text recognition module and connecting the contents of the same text line in series to obtain the final result of content recognition of the business license.
Further, the randomly generated training samples provided by the random sample generation module are divided into a training data set and a verification data set according to a preset proportion.
Further, the randomly generated training sample provided by the random sample generation module contains preset noise and preset distortion characteristic amplitude.
Compared with the prior art, the technical scheme of the invention has the beneficial effects that:
according to the method, the training of the text detection model is divided into two stages, so that the number of samples required by training is reduced, and the accuracy of text detection is improved; the training of the generated random training samples on the text recognition model improves the accuracy of text recognition, and the defect that the traditional character recognition method needs to cut characters is overcome through recognition of text line images.
Drawings
FIG. 1 is a flow chart of the method of the present invention.
Detailed Description
In order that the above objects, features and advantages of the present invention can be more clearly understood, a more particular description of the invention will be rendered by reference to the appended drawings. It should be noted that the embodiments and features of the embodiments of the present application may be combined with each other without conflict.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those specifically described herein, and therefore the scope of the present invention is not limited by the specific embodiments disclosed below.
Example 1
Fig. 1 shows a flowchart of a license content identification method based on deep learning.
The invention provides a business license content identification method based on deep learning, which comprises the following steps:
s1: collecting images of business licenses;
in a specific embodiment, the complete image of the business license is collected by an image collecting device, which includes but is not limited to a camera, a smart phone, a computer and a tablet computer, and can also receive the image of the business license collected and transmitted by other manners.
S2: pre-processing the acquired license images, the pre-processing comprising: graying, filtering and denoising, image binarization and tilt correction;
in a specific embodiment, the collected business license image is preprocessed, and the license image is grayed by adopting a weighted average method. And (3) selecting a self-adaptive median filtering method to carry out filtering and denoising on the image, wherein the filtering and denoising are carried out to ensure that the boundary characteristics of the image are not blurred. And (4) binarizing the denoised gray level image by using a point-by-point method, wherein the image binarization operation can highlight interested target content. In addition, in the plane image processing, due to a lens angle and the like, an image is prone to be inclined, deformed and the like, and for convenience of subsequent processing, the image is required to be inclined and corrected, and the deformed image can be corrected through perspective transformation.
Perspective Transformation (Perspective Transformation) is a nonlinear Transformation in three-dimensional space, which essentially projects the original image to a new viewing plane by a 3 × 3 Transformation matrix, and the visual intuitive expression is to generate or eliminate the sense of distance and proximity.
S3: constructing a text detection model, performing primary training by using an open source text detection data set, constructing a pre-labeled business license image data set by using the preprocessed images in a set proportion, and performing secondary training on the detection model after the primary training by using the pre-labeled business license image data set;
in a specific embodiment, an open-source text detection dataset of multilingual scene text detection and script recognition (MLT) may be used to perform preliminary training on a text detection model; the available text detection models include fast RCNN, CTPN, SegLink, EAST, etc., the fast RCNN is a general target detection model, and the latter three are network models optimized for text detection. The preliminarily trained text detection model can be applied to text information detection of business license pictures. However, due to the effect of text typesetting and font size of a license, the text detection effect cannot meet the actual requirement. Extracting the pre-processed license image with a set proportion (such as 20%), manually labeling the text box to form a pre-labeled license image data set, and performing secondary training on the preliminarily trained detection model by using the pre-labeled license image data set.
It should be noted that, by adopting two stages of text detection model training, the number of labels of business license image data required by the model training can be effectively reduced, and the accuracy of text detection is improved.
In a specific embodiment, a CTPN (connectionist Text forward network) Text detection model may be used, which converts a Text detection task into detection of a series of small-scale Text boxes.
S4: constructing a text recognition model based on a convolutional neural network, and training the text recognition model by using a random training sample generated by a preset text library to obtain a trained text recognition model;
in a specific embodiment, the random training samples include training samples with various preset noise and distortion characteristic amplitudes, that is, text pictures containing only one line of characters.
It should be noted that the text library for random sample generation includes various types of materials, such as news, encyclopedia, articles, and the like. The related various common Chinese characters, English, numbers, symbols and the like are enough to correspond to the content information identification in the business license.
In addition, a large number of training samples are generated by simulating the combination rule and the style of 'unified social credit codes' in a business license, so that the accuracy of the text recognition model is effectively improved.
The random sample generator may be divided into a training data set and a validation data set on a 99:1 scale.
S5: inputting the preprocessed business license image into a text detection model obtained by secondary training, outputting a text line image, recognizing the text line image by using the trained text recognition model, and outputting text information;
s6: and performing semantic analysis on the text information identified by the text identification model, and connecting the contents of the same text line in series to obtain the final result of identifying the contents of the business license.
In the invention, a DenseNet + CTC model is adopted, the DenseNet breaks away from the fixed thinking of deepening the network layer number (ResNet) and widening the network structure (inclusion) to improve the network performance, and in view of characteristics, through characteristic reuse and Bypass (Bypass) setting, the parameter quantity of the network is greatly reduced, and the generation of the gradientvanising problem is relieved to a certain extent. The network structure of DenseNet consists mainly of DenseBlock and Transition.
In the DenseBlock, the BN + ReLU +3x3 Conv structure is adopted, the feature maps of all layers are consistent in size and can be connected in channel dimension; for the Transition layer, mainly two adjacent DenseBlock are connected and the feature map size is reduced. The Transition layer comprises a convolution of 1x1 and AvgPooling of 2x2, with the structure BN + ReLU +1x1Conv +2x2 AvgPooling. In addition, the Transition layer can function as a compression model.
Ctc (connectionist Temporal classification), is a time-series classification algorithm that addresses the alignment of input data with a given tag.
And training the DenseNet + CTC text recognition model by using random training samples, and performing text recognition on the text line pictures output by the text detection model by using the trained model.
The text information identified by the text identification model is subjected to semantic analysis, and the text lines of the same attribute are concatenated, for example, the text information of each line in the text line image is subjected to semantic analysis to obtain the contents such as 'unified social credit code', 'number', 'name', 'type', 'address', and the like. And integrating the text recognition result after semantic analysis as a final result of the content recognition system of the business license.
The semantic analysis can be realized through a semantic analysis model, and the training of the semantic analysis model is to perform model training by generating some regular information samples, for example, names generally end in "company", registered capital ends in "element", general rules of address texts, and the like.
The invention also provides a business license content identification system based on deep learning based on the method, and the system comprises: the system comprises an image acquisition module, an image preprocessing module, a text detection module, a random sample generation module, a text recognition module and a text information integration module, wherein the image acquisition module is used for acquiring a complete business license image;
the image preprocessing module is used for preprocessing the acquired business license image, and the preprocessing comprises the following steps: graying, filtering and denoising, image binarization and tilt correction;
the text detection module is used for performing text line detection on the preprocessed image and outputting a text line image;
the random sample generation module provides a random generation training sample for the text recognition module;
the text recognition module is used for performing text recognition on the text line image and outputting text information;
the integrated text information module is used for performing semantic analysis on the text information output by the text recognition module and connecting the contents of the same text line in series to obtain the final result of the content recognition system of the business license.
Further, the random training samples provided by the random sample generation module are divided into a training data set and a verification data set according to a preset proportion.
In one particular embodiment, the random training samples may be divided into a training data set and a validation data set on a 99:1 scale.
Further, the random training sample provided by the random sample generation module contains preset noise and preset distortion characteristic amplitude.
The integrated text information module is the last step of the whole business license content identification system and is used for performing semantic analysis on all text information output by the text identification model and connecting the contents of the same text line in series to obtain the final result of business license content identification.
It should be understood that the above-described embodiments of the present invention are merely examples for clearly illustrating the present invention, and are not intended to limit the embodiments of the present invention. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present invention should be included in the protection scope of the claims of the present invention.
Claims (7)
1. A business license content identification method based on deep learning is characterized by comprising the following steps:
s1: collecting images of business licenses;
s2: pre-processing the acquired license images, the pre-processing comprising: graying, filtering and denoising, image binarization and tilt correction;
s3: constructing a text detection model, performing primary training by using an open source text detection data set, constructing a pre-labeled business license image data set by using the preprocessed image with a set proportion, and performing secondary training on the primarily trained detection model by using the pre-labeled business license image data set;
s4: constructing a text recognition model based on a convolutional neural network, and training the text recognition model by using a randomly generated training sample to obtain a trained text recognition model;
s5: inputting the preprocessed business license image into a text detection model obtained by secondary training, outputting a text line image, recognizing the text line image by using the trained text recognition model, and outputting text information;
s6: and performing semantic analysis on the text information identified by the text identification model, and connecting the contents of the same text line in series to obtain the final result of identifying the contents of the business license.
2. The method for license content recognition based on deep learning of claim 1, wherein the preprocessing is specifically:
carrying out gray processing on the collected business license image by adopting a weighted average method;
carrying out median filtering denoising on the image subjected to graying;
carrying out image binarization on the denoised image by using a point-by-point method;
and performing inclination correction on the image subjected to the binarization processing through perspective transformation.
3. The method for recognizing the contents of a business license based on deep learning of claim 1, wherein the text detection model is fast RCNN model or CTPN model or SegLink model or EAST model.
4. The method of claim 1, wherein the text recognition model based on convolutional neural network is a DenseNet + CTC text recognition model.
5. A deep learning based license content recognition system, the system comprising: the system comprises an image acquisition module, an image preprocessing module, a text detection module, a random sample generation module, a text recognition module and a text information integration module, wherein the image acquisition module is used for acquiring a complete business license image;
the image preprocessing module is used for preprocessing the acquired business license image, and the preprocessing comprises the following steps: graying, filtering and denoising, image binarization and tilt correction;
the text detection module is used for performing text line detection on the preprocessed image and outputting a text line image;
the random sample generation module provides a random generation training sample for the text recognition module;
the text recognition module is used for performing text recognition on the text line image and outputting text information;
the integrated text information module is used for performing semantic analysis on the text information output by the text recognition module and connecting the contents of the same text line in series to obtain the final result of content recognition of the business license.
6. The system for recognizing the contents of a business license based on deep learning of claim 5, wherein the randomly generated training samples provided by the randomly generated sample generating module are divided into the training data set and the verification data set according to a predetermined ratio.
7. The system for license content recognition based on deep learning of claim 5, wherein the randomly generated training samples provided by the random sample generation module include a preset noise and a preset amplitude of distortion features.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911067919.1A CN110889402A (en) | 2019-11-04 | 2019-11-04 | Business license content identification method and system based on deep learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911067919.1A CN110889402A (en) | 2019-11-04 | 2019-11-04 | Business license content identification method and system based on deep learning |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110889402A true CN110889402A (en) | 2020-03-17 |
Family
ID=69746873
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911067919.1A Pending CN110889402A (en) | 2019-11-04 | 2019-11-04 | Business license content identification method and system based on deep learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110889402A (en) |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111539406A (en) * | 2020-04-21 | 2020-08-14 | 招商局金融科技有限公司 | Certificate copy information identification method, server and storage medium |
CN111985574A (en) * | 2020-08-31 | 2020-11-24 | 平安医疗健康管理股份有限公司 | Medical image recognition method, device, equipment and storage medium |
CN112036330A (en) * | 2020-09-02 | 2020-12-04 | 北京中油瑞飞信息技术有限责任公司 | Text recognition method, text recognition device and readable storage medium |
CN112163508A (en) * | 2020-09-25 | 2021-01-01 | 中国电子科技集团公司第十五研究所 | Character recognition method and system based on real scene and OCR terminal |
CN112329774A (en) * | 2020-11-10 | 2021-02-05 | 杭州微洱网络科技有限公司 | Commodity size table automatic generation method based on image |
CN112507914A (en) * | 2020-12-15 | 2021-03-16 | 江苏国光信息产业股份有限公司 | OCR (optical character recognition) method and recognition system based on bankbook and bill characters |
CN112668335A (en) * | 2020-12-21 | 2021-04-16 | 广州市申迪计算机系统有限公司 | Method for identifying and extracting business license structured information by using named entity |
CN112883953A (en) * | 2021-02-22 | 2021-06-01 | 中国工商银行股份有限公司 | Card recognition device and method based on joint learning |
CN113160623A (en) * | 2021-02-08 | 2021-07-23 | 杭州高低科技有限公司 | Chinese character splicing interactive learning system and method based on magnetic cards |
CN113673507A (en) * | 2020-08-10 | 2021-11-19 | 广东电网有限责任公司 | Electric power professional equipment nameplate recognition algorithm |
CN113869131A (en) * | 2021-09-01 | 2021-12-31 | 南京烽火天地通信科技有限公司 | Method for structuring textualized business license picture |
CN113963147A (en) * | 2021-09-26 | 2022-01-21 | 西安交通大学 | Key information extraction method and system based on semantic segmentation |
CN114155530A (en) * | 2021-11-10 | 2022-03-08 | 北京中科闻歌科技股份有限公司 | Text recognition and question-answering method, device, equipment and medium |
CN114359928A (en) * | 2022-01-12 | 2022-04-15 | 平安科技(深圳)有限公司 | Electronic invoice identification method and device, computer equipment and storage medium |
CN114596573A (en) * | 2022-03-22 | 2022-06-07 | 中国平安人寿保险股份有限公司 | Birth certificate identification method and device, computer equipment and storage medium |
CN115376142A (en) * | 2022-07-20 | 2022-11-22 | 北大荒信息有限公司 | Image-based business license information extraction method, computer equipment and readable storage medium |
CN115830620A (en) * | 2023-02-14 | 2023-03-21 | 江苏联著实业股份有限公司 | Archive text data processing method and system based on OCR |
CN116681628A (en) * | 2023-08-03 | 2023-09-01 | 湖南华菱电子商务有限公司 | Business license data processing method and system based on deep learning |
CN116912845A (en) * | 2023-06-16 | 2023-10-20 | 广东电网有限责任公司佛山供电局 | Intelligent content identification and analysis method and device based on NLP and AI |
CN117197816A (en) * | 2023-06-19 | 2023-12-08 | 珠海盈米基金销售有限公司 | User material identification method and system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108764226A (en) * | 2018-04-13 | 2018-11-06 | 顺丰科技有限公司 | Image text recognition methods, device, equipment and its storage medium |
CN110070045A (en) * | 2019-04-23 | 2019-07-30 | 杭州智趣智能信息技术有限公司 | A kind of text recognition method of business license, system and associated component |
CN110135424A (en) * | 2019-05-23 | 2019-08-16 | 阳光保险集团股份有限公司 | Tilt text detection model training method and ticket image Method for text detection |
CN110363196A (en) * | 2019-06-20 | 2019-10-22 | 吴晓东 | It is a kind of tilt text text precisely know method for distinguishing |
-
2019
- 2019-11-04 CN CN201911067919.1A patent/CN110889402A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108764226A (en) * | 2018-04-13 | 2018-11-06 | 顺丰科技有限公司 | Image text recognition methods, device, equipment and its storage medium |
CN110070045A (en) * | 2019-04-23 | 2019-07-30 | 杭州智趣智能信息技术有限公司 | A kind of text recognition method of business license, system and associated component |
CN110135424A (en) * | 2019-05-23 | 2019-08-16 | 阳光保险集团股份有限公司 | Tilt text detection model training method and ticket image Method for text detection |
CN110363196A (en) * | 2019-06-20 | 2019-10-22 | 吴晓东 | It is a kind of tilt text text precisely know method for distinguishing |
Cited By (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111539406A (en) * | 2020-04-21 | 2020-08-14 | 招商局金融科技有限公司 | Certificate copy information identification method, server and storage medium |
CN111539406B (en) * | 2020-04-21 | 2023-04-18 | 招商局金融科技有限公司 | Certificate copy information identification method, server and storage medium |
CN113673507A (en) * | 2020-08-10 | 2021-11-19 | 广东电网有限责任公司 | Electric power professional equipment nameplate recognition algorithm |
CN111985574A (en) * | 2020-08-31 | 2020-11-24 | 平安医疗健康管理股份有限公司 | Medical image recognition method, device, equipment and storage medium |
CN112036330A (en) * | 2020-09-02 | 2020-12-04 | 北京中油瑞飞信息技术有限责任公司 | Text recognition method, text recognition device and readable storage medium |
CN112163508A (en) * | 2020-09-25 | 2021-01-01 | 中国电子科技集团公司第十五研究所 | Character recognition method and system based on real scene and OCR terminal |
CN112329774A (en) * | 2020-11-10 | 2021-02-05 | 杭州微洱网络科技有限公司 | Commodity size table automatic generation method based on image |
CN112507914A (en) * | 2020-12-15 | 2021-03-16 | 江苏国光信息产业股份有限公司 | OCR (optical character recognition) method and recognition system based on bankbook and bill characters |
CN112668335A (en) * | 2020-12-21 | 2021-04-16 | 广州市申迪计算机系统有限公司 | Method for identifying and extracting business license structured information by using named entity |
CN112668335B (en) * | 2020-12-21 | 2024-05-31 | 广州市申迪计算机系统有限公司 | Method for identifying and extracting business license structured information by using named entity |
CN113160623A (en) * | 2021-02-08 | 2021-07-23 | 杭州高低科技有限公司 | Chinese character splicing interactive learning system and method based on magnetic cards |
CN112883953A (en) * | 2021-02-22 | 2021-06-01 | 中国工商银行股份有限公司 | Card recognition device and method based on joint learning |
CN113869131A (en) * | 2021-09-01 | 2021-12-31 | 南京烽火天地通信科技有限公司 | Method for structuring textualized business license picture |
CN113869131B (en) * | 2021-09-01 | 2024-03-29 | 南京烽火天地通信科技有限公司 | Method for structuring text business license picture |
CN113963147A (en) * | 2021-09-26 | 2022-01-21 | 西安交通大学 | Key information extraction method and system based on semantic segmentation |
CN113963147B (en) * | 2021-09-26 | 2023-09-15 | 西安交通大学 | Key information extraction method and system based on semantic segmentation |
CN114155530A (en) * | 2021-11-10 | 2022-03-08 | 北京中科闻歌科技股份有限公司 | Text recognition and question-answering method, device, equipment and medium |
CN114359928A (en) * | 2022-01-12 | 2022-04-15 | 平安科技(深圳)有限公司 | Electronic invoice identification method and device, computer equipment and storage medium |
CN114359928B (en) * | 2022-01-12 | 2024-10-01 | 平安科技(深圳)有限公司 | Electronic invoice identification method and device, computer equipment and storage medium |
CN114596573A (en) * | 2022-03-22 | 2022-06-07 | 中国平安人寿保险股份有限公司 | Birth certificate identification method and device, computer equipment and storage medium |
CN115376142B (en) * | 2022-07-20 | 2023-09-01 | 北大荒信息有限公司 | Image-based business license information extraction method, computer equipment and readable storage medium |
CN115376142A (en) * | 2022-07-20 | 2022-11-22 | 北大荒信息有限公司 | Image-based business license information extraction method, computer equipment and readable storage medium |
CN115830620A (en) * | 2023-02-14 | 2023-03-21 | 江苏联著实业股份有限公司 | Archive text data processing method and system based on OCR |
CN116912845A (en) * | 2023-06-16 | 2023-10-20 | 广东电网有限责任公司佛山供电局 | Intelligent content identification and analysis method and device based on NLP and AI |
CN116912845B (en) * | 2023-06-16 | 2024-03-19 | 广东电网有限责任公司佛山供电局 | Intelligent content identification and analysis method and device based on NLP and AI |
CN117197816A (en) * | 2023-06-19 | 2023-12-08 | 珠海盈米基金销售有限公司 | User material identification method and system |
CN116681628A (en) * | 2023-08-03 | 2023-09-01 | 湖南华菱电子商务有限公司 | Business license data processing method and system based on deep learning |
CN116681628B (en) * | 2023-08-03 | 2023-10-24 | 湖南华菱电子商务有限公司 | Business license data processing method and system based on deep learning |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110889402A (en) | Business license content identification method and system based on deep learning | |
US10943105B2 (en) | Document field detection and parsing | |
US9626555B2 (en) | Content-based document image classification | |
Marinai | Introduction to document analysis and recognition | |
KR101769918B1 (en) | Recognition device based deep learning for extracting text from images | |
KR101377601B1 (en) | System and method for providing recognition and translation of multiple language in natural scene image using mobile camera | |
CN112508011A (en) | OCR (optical character recognition) method and device based on neural network | |
CN109670477B (en) | PDF table-oriented automatic identification system and method | |
Čakić et al. | The use of tesseract ocr number recognition for food tracking and tracing | |
Sidhwa et al. | Text extraction from bills and invoices | |
KR101552525B1 (en) | A system for recognizing a font and providing its information and the method thereof | |
CN112949455A (en) | Value-added tax invoice identification system and method | |
WO2022103564A1 (en) | Fraud detection via automated handwriting clustering | |
CN103559512B (en) | A kind of Text region output intent and system | |
CN113673528B (en) | Text processing method, text processing device, electronic equipment and readable storage medium | |
Jiju et al. | OCR text extraction | |
CN115311666A (en) | Image-text recognition method and device, computer equipment and storage medium | |
Karanje et al. | Survey on text detection, segmentation and recognition from a natural scene images | |
Hung et al. | Automatic vietnamese passport recognition on android phones | |
CN110414497A (en) | Method, device, server and storage medium for electronizing object | |
CN118097688A (en) | Universal certificate identification method based on large language model | |
Kumar et al. | Line based robust script identification for indianlanguages | |
CN116844182A (en) | Card character recognition method for automatically recognizing format | |
Duth et al. | Recognition of hand written and printed text of cursive writing utilizing optical character recognition | |
CN115565193A (en) | Questionnaire information input method and device, electronic equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200317 |