CN110135248A - A kind of natural scene Method for text detection based on deep learning - Google Patents

A kind of natural scene Method for text detection based on deep learning Download PDF

Info

Publication number
CN110135248A
CN110135248A CN201910270269.4A CN201910270269A CN110135248A CN 110135248 A CN110135248 A CN 110135248A CN 201910270269 A CN201910270269 A CN 201910270269A CN 110135248 A CN110135248 A CN 110135248A
Authority
CN
China
Prior art keywords
text
natural scene
text detection
detection
deep learning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910270269.4A
Other languages
Chinese (zh)
Inventor
刘发贵
陈成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
South China University of Technology SCUT
Original Assignee
South China University of Technology SCUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by South China University of Technology SCUT filed Critical South China University of Technology SCUT
Priority to CN201910270269.4A priority Critical patent/CN110135248A/en
Publication of CN110135248A publication Critical patent/CN110135248A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables

Abstract

The invention discloses a kind of natural scene Method for text detection based on deep learning.This method extracts the Analysis On Multi-scale Features of text using CNN network, then encodes these features using RNN to make full use of the contextual properties of text;Then, characteristic pattern is inputted into the pond ROI layer and exports a series of text and proposed.After by non-maxima suppression, the text proposal of generation is connected finally by a text connector, to realize multiple dimensioned, multidirectional text detection flexibly and efficiently.The present invention improves multi-direction, natural scene text detection under the conditions of mutative scale accuracy rate and recall rate.

Description

A kind of natural scene Method for text detection based on deep learning
Technical field
The invention belongs to technical field of image processing, and in particular to a kind of natural scene text detection based on deep learning Method.
Background technique
Scene text detection is the important prerequisite of text identification, is often used in image retrieval, machine translation, automatic Pilot Equal fields.But text detection complex background, multiple dimensioned, multilingual, uneven illumination is even, it is fuzzy when detection still So there is many difficulties.
The diversity of natural scene text is with variability: compared with the text in document, the text of natural scene may be Multiple dimensioned, multilingual, shape, direction, ratio, color are possibly different from, these variations are all brought to the detection of text Lot of challenges.
Complex background: scene text may occur in arbitrary background, including signal signature, brick or thick grass, grid Column, these backgrounds may have the feature closely similar with text, be likely to become the judgement of influence of noise text.Meanwhile also The missing of text caused by the blocking of foreign matter, causes potentially to detect mistake.
Irregular image quality: due to uncontrollable collect means, the quality of imaging is not can guarantee.For detection Image may cause distortion, empty burnt due to different shooting angle or shooting distance, or when due to shooting illumination difference Form noise, shade.
For natural scene text detection problem, the method that can will test is divided into two classes, and one kind is traditional detection method, separately One kind is the detection method based on deep learning.Traditional method has the method based on texture, such as uses local strength, filter Response, wavelet coefficient etc.;Have the method based on region, as stroke width transformation (Stroke Width Transform, SWT), Maximum extreme value stability region (Maximally Stable Extremal Regions, MSER), stroke feature convert (Stroke Feature Transform, SFT) etc..In recent years, with the development of deep neural network, deep learning is led in computer vision Domain shows increasing advantage.Currently, it is most popular or be based on convolutional neural networks (Convolutional Neural Networks, CNN) deep learning method.After having used deep learning, the accuracy of text detection is substantially increased, And people are freed from complicated characteristic Design work.The commonly natural scene text detection based on deep learning Model is typically based on common target detection model, such as RCNN, YOLO, SSD.The basic structure of these models is usually to use number A convolutional layer and pond layer extract feature, and the classification and recurrence of detection block are finally carried out using full articulamentum.
Summary of the invention
In order to more accurately and efficiently carry out text detection in natural scene, solve text in natural scene it is multi-direction, The test problems of mutative scale, the invention proposes a kind of natural scene Method for text detection based on deep learning.
The purpose of the present invention is realized at least through one of following technical solution.
A kind of natural scene Method for text detection based on deep learning, includes the following steps:
(1) natural scene text detection model neural network based is constructed and is trained, includes following sub-step:
(1.1) building is based on the feature extraction of feature pyramid network (Feature Pyramid Networks, FPN) Device;
(1.2) feature extractor is extracted using Recognition with Recurrent Neural Network (Recurrent Neural Network, RNN) Feature encoded;
(1.3) precision of detection is further increased using the pond ROI layer;
(1.4) classification and recurrence that detection block is finally carried out using full articulamentum, form text detection model;
It (1.5) will be by the training figure input model of mark;Use the multitask comprising Classification Loss and recurrence loss Loss function calculates penalty values with training pattern;
(2) using it is above-mentioned training complete natural scene text detection model to the natural scene text in given image into Row detection, includes following sub-step:
(2.1) image to be detected is inputted, literary version is carried out to given image using model after above-mentioned training and is detected, a system is exported The score and coordinate of column text proposal detection block.
(2.2) obtained text is proposed to carry out non-maxima suppression, to remove partial redundance detection block.
(2.3) proposal of a series of text is attached using text connector, generates final testing result.
Compared with prior art, the invention has the advantages that and technical effect:
(1) present invention has used feature pyramid network (Feature Pyramid for the text detection of mutative scale Networks, FPN), it can be efficiently simultaneously using the information of each different size of convolutional layer, compared to using the last layer The method of characteristic pattern, while the high resolution information of high-rise strong semantic information and bottom is utilized, to realize higher call together Return rate and accuracy rate;Compared to the method based on image pyramid, then greatly reduce calculation amount.
(2) multidirectional text detection is connected by the way of exporting a series of texts and proposing finally by text Device connects the proposal of these texts, compared to using arbitrary quadrilateral or rotating the method for rectangle, has used less Parameter, thus more flexible to the detection of multi-direction text efficient.
Detailed description of the invention
Fig. 1 is natural scene text detection flow chart in embodiment.
Fig. 2 is natural scene text detection model support composition used in embodiment.
Fig. 3 is the actual result figure detected under different scenes in embodiment using Method for text detection of the invention.
Specific embodiment
In order to which technical solution of the present invention and advantage is more clearly understood, below in conjunction with attached drawing, carry out further detailed It is without being limited thereto to describe bright but of the invention implementation and protection in detail.
Illustrate the term in the present invention first:
Feature pyramid network (Feature Pyramid Networks, FPN): FPN is directly in original back bone network On make an amendment, the characteristic pattern of each resolution ratio, which introduces latter resolution ratio and scales twice of characteristic pattern, does that each element is corresponding to be added Operation.Connection in this way, each layer of prediction characteristic pattern used have all merged the spy of different resolution, different semantic intensity Sign, the characteristic pattern of the different resolution of fusion do the object detection of corresponding resolution sizes respectively.This ensure that each layer is all There are suitable resolution ratio and strong semantic feature.
Residual error network (ResNet): it is the depth convolutional network model that He Kaiming was proposed in 2015, is adopted according to model The difference of the number of plies is respectively designated as ResNet-34, ResNet-50, ResNet-101, ResNet-152 etc..
Non-maxima suppression (Non-Maximum Suppression, NMS): inhibition is not the element of maximum, Ke Yili Solution is that local maxima is searched for.Each detection block of output has a score, these detection blocks there may be comprising with intersection Situation, the detection block that the detection block of highest scoring in selection field is carried out using NMS, and inhibits those scores low.
As shown in Figure 1, the natural scene text detection model in the present invention based on deep learning, comprising the following steps:
(1) it constructs and trains natural scene text detection model neural network based, as shown in Fig. 2, including following son Step:
(1.1) building is based on the feature extraction of feature pyramid network (Feature Pyramid Networks, FPN) Device.Use ResNet-101 as back bone network, generates feature pyramid, use the feature of the level therein from P2 to P5.
(1.2) feature extracted is carried out using Recognition with Recurrent Neural Network (Recurrent Neural Network, RNN) Coding.Use two-way long short-term memory Recognition with Recurrent Neural Network (the Bi-directional Long Short- of 512 hidden layers Term Memory, Bi-LSTM) feature extracted is encoded as RNN.
(1.3) precision of detection is further increased using the pond ROI layer.The concrete operations in the pond ROI are as follows:
ROI is mapped to the corresponding position of characteristic pattern according to the image of input by (1.3.1);
Region division after mapping is the part of same size by (1.3.2), and the quantity of division and the dimension of output are identical;
(1.3.3) carries out maximum pondization operation to each part.
(1.4) classification and recurrence of detection block are finally carried out using full articulamentum.Pass through respectively by the feature in the pond ROI Two full articulamentums are classified and are returned.If the quantity of the detection block of output is k, wherein the dimension of classification layer output is 2k, Correspond to text and background;The dimension for returning layer output is 4k, the upper left of corresponding detection block and the coordinate of bottom right 2.
(1.5) input is trained model by the training figure of mark.Wherein, quadrangle can be used in training image Rectangle mark also can be used in mark.But before input model, it need to be divided by given width, if training image mark Note is quadrangle, then the minimum circumscribed rectangle after taking it to divide;If being labeled as rectangle, directly divide.
Design is comprising Classification Loss and returns the multitask loss function lost.It is lost using the loss function of design Calculating:
Wherein L, LclsAnd LregRespectively total losses, Classification Loss and recurrence loss, λ are balanced sort loss and recurrence damage Weight coefficient between mistake.piIt is the classification of i-th of detection block prediction,It is the true classification of i-th of detection block.tiIt is i-th The prediction coordinate of detection block,It is the true coordinate of i-th of detection block.
(2) using it is above-mentioned training complete natural scene text detection model to the natural scene text in given image into Row detection, includes following sub-step:
(2.1) image to be detected is inputted, text detection is carried out to given image using model after above-mentioned training, exports a system The score and coordinate of column text proposal detection block.
(2.2) obtained text is proposed to carry out non-maxima suppression, to remove partial redundance detection block.Concrete operations are such as Under:
The list B and its corresponding score S for proposing detection block for text, using following calculation.Selection has Largest score
Detection block M, by it from removing and be added in final testing result D in set B.Usually examined remaining in B It surveys in frame
It is removed from B with the IoU of the M frame for being greater than threshold value.This process is repeated, until B is sky.
(2.3) proposal of a series of text is attached using text connector, generates final testing result.It uses Such as
Lower step carries out the connection of text proposal:
If proposing PjWith proposal Pi(i, j herein indicates different proposals) meets following two conditions, will propose PjDefinition To propose PiNeighbours:
(1) propose PjWith proposal PiIt is nearest and they the distance between be less than wj+wi
(2) propose PjWith proposal PiThere is the registration greater than 0.5 in vertical direction
Wherein wiAnd wjRespectively propose PiWith proposal PjWidth, if propose PiIt is to propose PjNeighbours and propose Pj It is to propose PiNeighbours, the two proposals are connected as the same detection block by this.Above-mentioned steps are repeated, until all mention View connection is completed, then detection block is final output result.From Fig. 2 and Fig. 3 it is found that detection effect of the present invention in natural scene Fruit, it is seen that the present invention can detect well mutative scale, multidirectional text in natural scene.

Claims (8)

1. a kind of natural scene Method for text detection based on deep learning, it is characterised in that the following steps are included:
(1) it constructs and trains natural scene text detection model neural network based, comprising:
(1.1) building is based on the feature extractor of feature pyramid network (Feature Pyramid Networks, FPN);
(1.2) spy that feature extractor is extracted using Recognition with Recurrent Neural Network (Recurrent Neural Network, RNN) Sign is encoded;
(1.3) precision of detection is further increased using the pond ROI layer;
(1.4) classification and recurrence that detection block is finally carried out using full articulamentum, form text detection model;
It (1.5) will be by the training figure input model of mark;
(1.6) penalty values are calculated with training pattern using the multitask loss function comprising Classification Loss and recurrence loss;
(2) the natural scene text in given image is examined using the natural scene text detection model that training is completed It surveys, includes following sub-step:
(2.1) image to be detected is inputted, text inspection is carried out to given image using the natural scene text detection model after training It surveys, exports score and coordinate that a series of texts propose detection block;
(2.2) obtained text is proposed to carry out non-maxima suppression, to remove partial redundance detection block;
(2.3) proposal of a series of text is attached using text connector, generates final testing result.
2. the natural scene Method for text detection according to claim 1 based on deep learning, which is characterized in that constructing In natural scene text detection model neural network based, feature pyramid network (Feature Pyramid Networks, FPN the level from P2 to P5) has been only used.
3. the natural scene Method for text detection according to claim 1 based on deep learning, which is characterized in that constructing In natural scene text detection model neural network based, feature pyramid network (Feature Pyramid Networks, FPN) used ResNet-101 as back bone network.
4. the natural scene Method for text detection according to claim 1 based on deep learning, which is characterized in that constructing In natural scene text detection model neural network based, nerve is recycled using the two-way long short-term memory of 512 hidden layers Network (Bi-directional Long Short-Term Memory, Bi-LSTM) is used as Recognition with Recurrent Neural Network (Recurrent Neural Network, RNN) feature extracted is encoded.
5. the natural scene Method for text detection according to claim 1 based on deep learning, which is characterized in that constructing In natural scene text detection model neural network based, the calculating lost using following loss function:
Wherein L, LclsAnd LregRespectively total losses, Classification Loss and recurrence loss, λ are balanced sort loss and recurrence loss Between weight coefficient,It is the true classification of i-th of detection block.
6. the natural scene Method for text detection according to claim 5 based on deep learning, which is characterized in that classification damage Mistake is defined as follows:
Wherein, piIt is the prediction classification of i-th of detection block,It is the true classification of i-th of detection block.
7. the natural scene Method for text detection according to claim 5 based on deep learning, which is characterized in that return damage Mistake is defined as follows:
Wherein, tiIt is the prediction coordinate of i-th of detection block,It is the true coordinate of i-th of detection block.
8. the natural scene Method for text detection according to claim 1 based on deep learning, which is characterized in that giving Determine to use following steps to carry out the connection of text proposal during the natural scene text in image detected:
If proposing PjWith proposal PiMeet following two conditions, will propose PjIt is defined as proposing PiNeighbours:
(1) propose PjWith proposal PiIt is nearest and they the distance between be less than wj+wi
(2) propose PjWith proposal PiThere is the registration greater than 0.5 in vertical direction
Wherein wiAnd wjRespectively propose PiWith proposal PjWidth, if propose PiIt is to propose PjNeighbours and propose PjIt is to mention Discuss PiNeighbours, the two proposals are connected as the same detection block by this.
CN201910270269.4A 2019-04-03 2019-04-03 A kind of natural scene Method for text detection based on deep learning Pending CN110135248A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910270269.4A CN110135248A (en) 2019-04-03 2019-04-03 A kind of natural scene Method for text detection based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910270269.4A CN110135248A (en) 2019-04-03 2019-04-03 A kind of natural scene Method for text detection based on deep learning

Publications (1)

Publication Number Publication Date
CN110135248A true CN110135248A (en) 2019-08-16

Family

ID=67569376

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910270269.4A Pending CN110135248A (en) 2019-04-03 2019-04-03 A kind of natural scene Method for text detection based on deep learning

Country Status (1)

Country Link
CN (1) CN110135248A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110766020A (en) * 2019-10-30 2020-02-07 哈尔滨工业大学 System and method for detecting and identifying multi-language natural scene text
CN110807422A (en) * 2019-10-31 2020-02-18 华南理工大学 Natural scene text detection method based on deep learning
CN111753714A (en) * 2020-06-23 2020-10-09 中南大学 Multidirectional natural scene text detection method based on character segmentation
WO2020221298A1 (en) * 2019-04-30 2020-11-05 北京金山云网络技术有限公司 Text detection model training method and apparatus, text region determination method and apparatus, and text content determination method and apparatus
CN113591829A (en) * 2021-05-25 2021-11-02 上海一谈网络科技有限公司 Character recognition method, device, equipment and storage medium

Citations (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101436299A (en) * 2008-11-19 2009-05-20 哈尔滨工业大学 Method for detecting natural scene image words
CN103942550A (en) * 2014-05-04 2014-07-23 厦门大学 Scene text recognition method based on sparse coding characteristics
CN104537362A (en) * 2015-01-16 2015-04-22 中国科学院自动化研究所 Domain-based self-adaptive English scene character recognition method
CN105740909A (en) * 2016-02-02 2016-07-06 华中科技大学 Text recognition method under natural scene on the basis of spatial transformation
CN106650725A (en) * 2016-11-29 2017-05-10 华南理工大学 Full convolutional neural network-based candidate text box generation and text detection method
CN107122342A (en) * 2017-04-21 2017-09-01 东莞中国科学院云计算产业技术创新与育成中心 Text code recognition methods and device
CN107203606A (en) * 2017-05-17 2017-09-26 西北工业大学 Text detection and recognition methods under natural scene based on convolutional neural networks
CN107402947A (en) * 2017-03-29 2017-11-28 北京粉笔未来科技有限公司 Picture retrieval method for establishing model and device, picture retrieval method and device
CN107622267A (en) * 2017-10-16 2018-01-23 天津师范大学 A kind of scene character recognition method based on embedded bilateral convolution activation
CN108288088A (en) * 2018-01-17 2018-07-17 浙江大学 A kind of scene text detection method based on end-to-end full convolutional neural networks
CN108304835A (en) * 2018-01-30 2018-07-20 百度在线网络技术(北京)有限公司 character detecting method and device
CN108399419A (en) * 2018-01-25 2018-08-14 华南理工大学 Chinese text recognition methods in natural scene image based on two-dimentional Recursive Networks
CN108427665A (en) * 2018-03-15 2018-08-21 广州大学 A kind of text automatic generation method based on LSTM type RNN models
CN108549893A (en) * 2018-04-04 2018-09-18 华中科技大学 A kind of end-to-end recognition methods of the scene text of arbitrary shape
CN108573257A (en) * 2017-03-14 2018-09-25 奥多比公司 Image is divided automatically based on natural language phrase
CN108764133A (en) * 2018-05-25 2018-11-06 北京旷视科技有限公司 Image-recognizing method, apparatus and system
US20180365560A1 (en) * 2017-06-19 2018-12-20 International Business Machines Corporation Context aware sensitive information detection
CN109299274A (en) * 2018-11-07 2019-02-01 南京大学 A kind of natural scene Method for text detection based on full convolutional neural networks
CN109344824A (en) * 2018-09-21 2019-02-15 泰康保险集团股份有限公司 A kind of line of text method for detecting area, device, medium and electronic equipment
CN109492638A (en) * 2018-11-07 2019-03-19 北京旷视科技有限公司 Method for text detection, device and electronic equipment

Patent Citations (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101436299A (en) * 2008-11-19 2009-05-20 哈尔滨工业大学 Method for detecting natural scene image words
CN103942550A (en) * 2014-05-04 2014-07-23 厦门大学 Scene text recognition method based on sparse coding characteristics
CN104537362A (en) * 2015-01-16 2015-04-22 中国科学院自动化研究所 Domain-based self-adaptive English scene character recognition method
CN105740909A (en) * 2016-02-02 2016-07-06 华中科技大学 Text recognition method under natural scene on the basis of spatial transformation
CN106650725A (en) * 2016-11-29 2017-05-10 华南理工大学 Full convolutional neural network-based candidate text box generation and text detection method
CN108573257A (en) * 2017-03-14 2018-09-25 奥多比公司 Image is divided automatically based on natural language phrase
CN107402947A (en) * 2017-03-29 2017-11-28 北京粉笔未来科技有限公司 Picture retrieval method for establishing model and device, picture retrieval method and device
CN107122342A (en) * 2017-04-21 2017-09-01 东莞中国科学院云计算产业技术创新与育成中心 Text code recognition methods and device
CN107203606A (en) * 2017-05-17 2017-09-26 西北工业大学 Text detection and recognition methods under natural scene based on convolutional neural networks
US20180365560A1 (en) * 2017-06-19 2018-12-20 International Business Machines Corporation Context aware sensitive information detection
CN107622267A (en) * 2017-10-16 2018-01-23 天津师范大学 A kind of scene character recognition method based on embedded bilateral convolution activation
CN108288088A (en) * 2018-01-17 2018-07-17 浙江大学 A kind of scene text detection method based on end-to-end full convolutional neural networks
CN108399419A (en) * 2018-01-25 2018-08-14 华南理工大学 Chinese text recognition methods in natural scene image based on two-dimentional Recursive Networks
CN108304835A (en) * 2018-01-30 2018-07-20 百度在线网络技术(北京)有限公司 character detecting method and device
CN108427665A (en) * 2018-03-15 2018-08-21 广州大学 A kind of text automatic generation method based on LSTM type RNN models
CN108549893A (en) * 2018-04-04 2018-09-18 华中科技大学 A kind of end-to-end recognition methods of the scene text of arbitrary shape
CN108764133A (en) * 2018-05-25 2018-11-06 北京旷视科技有限公司 Image-recognizing method, apparatus and system
CN109344824A (en) * 2018-09-21 2019-02-15 泰康保险集团股份有限公司 A kind of line of text method for detecting area, device, medium and electronic equipment
CN109299274A (en) * 2018-11-07 2019-02-01 南京大学 A kind of natural scene Method for text detection based on full convolutional neural networks
CN109492638A (en) * 2018-11-07 2019-03-19 北京旷视科技有限公司 Method for text detection, device and electronic equipment

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
YU SONG ETC.: "Scene Text Detection via Deep Semantic Feature Fusion and Attention-based Refinement", 《2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION(ICPR)》 *
蔡华杰等: "基于 WT-BTC 特征和 SVM 组合分类的场景文本检测", 《科学技术与工程》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020221298A1 (en) * 2019-04-30 2020-11-05 北京金山云网络技术有限公司 Text detection model training method and apparatus, text region determination method and apparatus, and text content determination method and apparatus
CN110766020A (en) * 2019-10-30 2020-02-07 哈尔滨工业大学 System and method for detecting and identifying multi-language natural scene text
CN110807422A (en) * 2019-10-31 2020-02-18 华南理工大学 Natural scene text detection method based on deep learning
CN110807422B (en) * 2019-10-31 2023-05-23 华南理工大学 Natural scene text detection method based on deep learning
CN111753714A (en) * 2020-06-23 2020-10-09 中南大学 Multidirectional natural scene text detection method based on character segmentation
CN111753714B (en) * 2020-06-23 2023-09-01 中南大学 Multidirectional natural scene text detection method based on character segmentation
CN113591829A (en) * 2021-05-25 2021-11-02 上海一谈网络科技有限公司 Character recognition method, device, equipment and storage medium
CN113591829B (en) * 2021-05-25 2024-02-13 上海一谈网络科技有限公司 Character recognition method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN110135248A (en) A kind of natural scene Method for text detection based on deep learning
Yang et al. Real-time face detection based on YOLO
CN108428229B (en) Lung texture recognition method based on appearance and geometric features extracted by deep neural network
CN110807422B (en) Natural scene text detection method based on deep learning
CN106599854B (en) Automatic facial expression recognition method based on multi-feature fusion
CN104463250B (en) A kind of Sign Language Recognition interpretation method based on Davinci technology
CN109034210A (en) Object detection method based on super Fusion Features Yu multi-Scale Pyramid network
CN103049763B (en) Context-constraint-based target identification method
CN109272500B (en) Fabric classification method based on adaptive convolutional neural network
CN108288271A (en) Image detecting system and method based on three-dimensional residual error network
CN108288075A (en) A kind of lightweight small target detecting method improving SSD
CN106610969A (en) Multimodal information-based video content auditing system and method
CN110135296A (en) Airfield runway FOD detection method based on convolutional neural networks
CN107016405A (en) A kind of insect image classification method based on classification prediction convolutional neural networks
CN109446982B (en) AR glasses-based electric power cabinet pressing plate state identification method and system
CN107133622A (en) The dividing method and device of a kind of word
CN104992223A (en) Dense population estimation method based on deep learning
CN104680127A (en) Gesture identification method and gesture identification system
CN111680706A (en) Double-channel output contour detection method based on coding and decoding structure
CN104268590B (en) The blind image quality evaluating method returned based on complementary combination feature and multiphase
CN106557740B (en) The recognition methods of oil depot target in a kind of remote sensing images
CN111445459A (en) Image defect detection method and system based on depth twin network
CN106228528A (en) A kind of multi-focus image fusing method based on decision diagram Yu rarefaction representation
Huang et al. A multi-feature fusion-based deep learning for insulator image identification and fault detection
CN114972312A (en) Improved insulator defect detection method based on YOLOv4-Tiny

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190816