CN110889404A - Irregular text recognition system and method based on correction network - Google Patents
Irregular text recognition system and method based on correction network Download PDFInfo
- Publication number
- CN110889404A CN110889404A CN201911145879.8A CN201911145879A CN110889404A CN 110889404 A CN110889404 A CN 110889404A CN 201911145879 A CN201911145879 A CN 201911145879A CN 110889404 A CN110889404 A CN 110889404A
- Authority
- CN
- China
- Prior art keywords
- text
- network
- layer
- irregular
- picture
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000001788 irregular Effects 0.000 title claims abstract description 86
- 238000012937 correction Methods 0.000 title claims abstract description 45
- 238000000034 method Methods 0.000 title claims abstract description 31
- 238000012545 processing Methods 0.000 claims abstract description 10
- 238000013527 convolutional neural network Methods 0.000 claims description 24
- 238000011176 pooling Methods 0.000 claims description 24
- 238000012549 training Methods 0.000 claims description 19
- 230000006870 function Effects 0.000 claims description 14
- 238000010606 normalization Methods 0.000 claims description 13
- 230000004913 activation Effects 0.000 claims description 12
- 230000007246 mechanism Effects 0.000 claims description 11
- 238000013528 artificial neural network Methods 0.000 claims description 9
- 230000002457 bidirectional effect Effects 0.000 claims description 9
- 125000004122 cyclic group Chemical group 0.000 claims description 9
- 238000006243 chemical reaction Methods 0.000 claims description 6
- 238000000605 extraction Methods 0.000 claims description 5
- 238000012360 testing method Methods 0.000 claims description 3
- 230000002194 synthesizing effect Effects 0.000 claims 1
- 238000005457 optimization Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/22—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/24—Aligning, centring, orientation detection or correction of the image
- G06V10/243—Aligning, centring, orientation detection or correction of the image by compensating for image skew or non-uniform image deformations
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Image Analysis (AREA)
- Character Discrimination (AREA)
Abstract
The invention discloses an irregular text recognition system and method based on a correction network, the recognition system of the invention comprises a text correction network and a text recognition network, the invention also relates to an irregular text recognition method based on the correction network, comprising the following steps: converting the irregular text picture into a regular text picture through a text correction network; and identifying the regular text picture through a text identification network and outputting corresponding text information. The recognition method of the invention firstly corrects the irregular text through the text correction network, for example, the text in the picture is presented in the horizontal direction, irrelevant noise information in the picture is removed, and then the recognition is carried out through the subsequent text recognition network, the geometric constraint can be avoided by processing the irregular text picture based on the correction network, various complicated irregular text pictures can be corrected, the difficulty of subsequent text recognition is reduced, and the recognition efficiency is higher.
Description
Technical Field
The invention relates to the technical field of computer vision, in particular to an irregular text recognition system and method based on a correction network.
Background
The text recognition technology in the natural scene can help people to better and more conveniently acquire information in real life and help people to know the surrounding environment. However, the text contained in natural scenes is mostly irregular text, and may be in the form of curved, cut, or text containing a lot of noise information. At present, the recognition technology for the regular text has been well developed by virtue of the advantages of a deep network, but cannot be directly applied to the recognition problem of the irregular text, and most of the methods based on the attention mechanism are adopted for the irregular text picture at present, so that the method does not need to correct the irregular text, and directly combines the attention force to locate the position of the text information required to be concerned in each step on the original picture to directly recognize the text information in the picture. Due to some limitations of previous methods, such as the need for more supervisory information during training, the possibility of introducing more noise using radial transformation,
disclosure of Invention
The invention aims to provide a system and a method for recognizing irregular texts based on a correction network, aiming at the defects.
The technical scheme adopted by the invention is as follows:
an irregular text recognition system based on a correction network, comprising a text correction network and a text recognition network, wherein:
text correction network: the system is used for converting the irregular text picture into a regular text picture;
text recognition network: and the text picture is used for identifying the rule and generating text information.
As an optimization, the text correction network of the present invention includes a prediction network and a picture gridding module, wherein:
predicting the network: obtaining the position deviation of each corresponding pixel when the irregular text picture is converted into the regular text picture based on the convolutional neural network;
the picture gridding module: and generating a grid graph for the irregular text picture, obtaining the coordinate information of each pixel on the irregular text picture, integrating the coordinate information of each pixel with the corresponding position deviation, and outputting the converted coordinate information of each pixel.
As optimization, the text recognition network adopts an encoder-decoder structure, the encoder adopts a convolutional neural network and a cyclic neural network for feature extraction, and the decoder adopts a bidirectional LSTM and combines an attention mechanism.
The invention also relates to an irregular text recognition method based on the correction network, which comprises the following steps:
converting the irregular text picture into a regular text picture through a text correction network;
and identifying the regular text picture through a text identification network and outputting corresponding text information.
As an optimization, the step of converting the irregular text picture into the regular text picture comprises:
obtaining the position deviation coordinates of each pixel corresponding to the irregular text picture converted into the regular text picture based on the convolutional neural network;
acquiring the original position coordinates of each pixel on the original irregular text picture based on a regularization processing mode;
normalizing the original position coordinates of each pixel to obtain normalized coordinates of each pixel;
and summing the normalized coordinates of each pixel and the corresponding position deviation coordinates to obtain the conversion position coordinates of each pixel.
As an optimization, the step of obtaining the position deviation coordinates of each pixel corresponding to the conversion from the irregular text picture to the regular text picture based on the convolutional neural network of the present invention includes:
taking the pixel value of the irregular text picture as input, obtaining a dual-channel characteristic graph through a prediction network, wherein one channel corresponds to an X-axis deviation coordinate, the other channel corresponds to a Y-axis deviation coordinate, and the prediction network is established based on a convolutional neural network;
and the size of the feature map is smaller than that of the irregular text picture, and the feature map is converted into the size of the irregular text picture through a resize function, so that the position deviation coordinate corresponding to each pixel on the irregular text picture is obtained.
As optimization, the prediction network comprises five layers: the first layer comprises a maximum pooling layer, the second layer comprises a convolutional layer and a maximum pooling layer, the third layer comprises a convolutional layer and a maximum pooling layer, the fourth layer comprises a convolutional layer and a maximum pooling layer, and the fifth layer comprises a convolutional layer, wherein: the convolution layers of the second layer to the fourth layer are all followed by a batch normalization layer and a RELU activation function layer, and the convolution layer of the fifth layer is followed by a batch normalization layer and a Tanh activation function layer.
As an optimization, the step of normalizing the original position coordinates of each pixel to obtain the normalized coordinates of each pixel in the present invention includes:
and acquiring the width w and the height h of the original picture, and dividing the original position coordinate of each pixel by [ w/2, h/2] to obtain a normalized coordinate.
As an optimization, the steps of identifying the regular text pictures and outputting corresponding text information in the invention are as follows: the text recognition network adopts a coder-decoder structure, the coder adopts a convolutional neural network and a cyclic neural network for feature extraction, and the decoder adopts a bidirectional LSTM and combines an attention mechanism to finally obtain output based on character probability distribution.
As optimization, the establishment of the text correction network and the text recognition network comprises the following steps:
building a text correction network structure: build the prediction network of text correction network based on convolutional neural network to the pixel value of irregular text picture is output, builds five layers of structure, and the first layer includes that the one deck is the biggest pooling layer to be constituteed, and the second floor includes that one deck convolution layer and one deck are the biggest pooling layer, and the third layer includes that one deck convolution layer and one deck are the biggest pooling layer, and the fourth layer includes that three-layer convolution layer and one deck are the biggest pooling layer, and the fifth layer includes one deck convolution layer, wherein: the convolution layers from the second layer to the fourth layer are all followed by a batch normalization layer and a RELU activation function layer, and the convolution layer from the fifth layer is followed by a batch normalization layer and a Tanh activation function layer, and the deviation prediction information of two channels is output;
building a text recognition network structure: establishing a text recognition network based on an encoder-decoder structure, establishing an encoder based on a convolutional neural network and a cyclic neural network, establishing a decoder based on a bidirectional LSTM, and outputting characters based on character probability distribution in combination with an attention mechanism;
establishing a data set: selecting a data set, and dividing the data set into a training set and a test set;
network training: the method comprises the steps of learning network parameters of a text correction network and a text recognition network by adopting a course learning strategy, training the text recognition network by regular text pictures, fixing the text recognition network, training the text correction network by irregular texts, and finally training the text correction network and the text recognition network simultaneously in an end-to-end mode.
The invention has the following advantages:
1. the identification method of the invention firstly corrects the irregular text through the text correction network, for example, the text in the picture is presented in the horizontal direction, irrelevant noise information in the picture is removed, and then the identification is carried out through the subsequent text identification network, the geometric constraint can be avoided by processing the irregular text picture based on the correction network, various complicated irregular text pictures can be corrected, the difficulty of subsequent text identification is reduced, and the identification efficiency is higher;
2. the structural attention mechanism in the text recognition network can obtain more context text information and stronger robustness, and improves the recognition accuracy;
3. the invention trains the network structure in a weak supervision mode during network training, only needs original pictures and corresponding text labels, and does not need other additional supervision information.
4. The invention adopts the strategy of course learning during training and iteratively trains and updates the network structure, so that the network training effect is better and the efficiency is higher.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
The invention is further described below with reference to the accompanying drawings:
FIG. 1 is a schematic flow chart of the present invention.
Detailed Description
The present invention is further described in the following with reference to the drawings and the specific embodiments so that those skilled in the art can better understand the present invention and can implement the present invention, but the embodiments are not to be construed as limiting the present invention, and the embodiments and the technical features of the embodiments can be combined with each other without conflict.
It is to be understood that the terms first, second, and the like in the description of the embodiments of the invention are used for distinguishing between the descriptions and not necessarily for describing a sequential or chronological order. The "plurality" in the embodiment of the present invention means two or more.
The term "and/or" in the embodiment of the present invention is only an association relationship describing an associated object, and indicates that three relationships may exist, for example, a and/or B may indicate: a exists alone, B exists alone, and A and B exist at the same time. In addition, the character "/" herein generally indicates that the former and latter associated objects are in an "or" relationship.
Example one
As shown in fig. 1, the present embodiment provides an irregular text recognition system based on a modified network, which includes a text modified network and a text recognition network, wherein:
text correction network: the system is used for converting the irregular text picture into a regular text picture; specifically, the text correction network includes a prediction network and a picture gridding module, wherein: the prediction network is used for obtaining the position deviation of each corresponding pixel when the irregular text picture is converted into the regular text picture based on the convolutional neural network; the picture gridding module is used for generating a grid graph from the irregular text picture, obtaining the coordinate information of each pixel on the irregular text picture, integrating the coordinate information of each pixel with the corresponding position deviation, and outputting the converted coordinate information of each pixel.
Text recognition network: the system is used for identifying the regular text pictures and generating text information, the text identification network adopts an encoder-decoder structure, the encoder adopts a convolutional neural network and a cyclic neural network for feature extraction, and the decoder adopts a bidirectional LSTM and combines an attention mechanism.
Example two
The embodiment provides a method for constructing an irregular text recognition system based on a correction network, which comprises the following steps:
building a text correction network structure: build the prediction network of text correction network based on convolutional neural network to the pixel value of irregular text picture is output, builds five layers of structure, and the first layer includes that the one deck is the biggest pooling layer to be constituteed, and the second floor includes that one deck convolution layer and one deck are the biggest pooling layer, and the third layer includes that one deck convolution layer and one deck are the biggest pooling layer, and the fourth layer includes that three-layer convolution layer and one deck are the biggest pooling layer, and the fifth layer includes one deck convolution layer, wherein: the convolution layers from the second layer to the fourth layer are all followed by a batch normalization layer and a RELU activation function layer, and the convolution layer from the fifth layer is followed by a batch normalization layer and a Tanh activation function layer, and the deviation prediction information of two channels is output;
building a text recognition network structure: establishing a text recognition network based on an encoder-decoder structure, establishing an encoder based on a convolutional neural network and a cyclic neural network, establishing a decoder based on a bidirectional LSTM, and outputting characters based on character probability distribution in combination with an attention mechanism;
establishing a data set: selecting data sets, such as IIIT5K-Words, Street View Text, ICDAR2003, ICDAR2013, CUTE80, ICDAR2015, etc., and dividing the data sets into training sets and test sets;
network training: the method comprises the steps of learning network parameters of a text correction network and a text recognition network by adopting a course learning strategy, training the text recognition network by regular text pictures, fixing the text recognition network, training the text correction network by irregular texts, and finally training the text correction network and the text recognition network simultaneously in an end-to-end mode.
EXAMPLE III
As shown in fig. 1, the irregular text recognition method based on the modified network provided by the embodiment based on the irregular text recognition system trained in the second embodiment includes the following steps:
s1, converting the irregular text picture into a regular text picture through a text correction network; the steps specifically comprise the following contents:
s11, obtaining the position deviation coordinates of each pixel corresponding to the conversion from the irregular text picture to the regular text picture based on the convolutional neural network: specifically, an irregular text picture is input into a prediction network, a pixel value of the irregular text picture is used as input, a double-channel characteristic diagram is obtained through the prediction network, one channel corresponds to an X-axis deviation coordinate, the other channel corresponds to a Y-axis deviation coordinate, the corresponding relation between the channel and an X axis and the corresponding relation between the channel and the Y axis are correspondingly judged in a processing mode, the X axis corresponds to the width direction of the irregular text picture, and the Y axis corresponds to the height direction of the irregular picture; and the sizes of the generated feature images of the two channels are smaller than the size of the irregular text picture, and the feature images are converted into the sizes of the resize irregular text picture by a bilinear interpolation method to obtain the position deviation coordinates corresponding to each pixel on the irregular text picture. It should be noted here that the corresponding channel values on the feature map output by the prediction network are all within the range of [ -1,1], and when the size of the original irregular text picture of the feature map resize is expressed by a linear interpolation method, the size of the irregular text picture is expressed by a pixel value, that is, the pixels of the feature map are equally equal to the pixel values of the original irregular text picture by the linear interpolation method, so that the obtained position deviation coordinates of each pixel are within the range of [ -1,1 ];
in the above, the prediction network is established based on a convolutional neural network, and the prediction network includes five layers: the first layer comprises a maximum pooling layer, the second layer comprises a convolutional layer and a maximum pooling layer, the third layer comprises a convolutional layer and a maximum pooling layer, the fourth layer comprises a convolutional layer and a maximum pooling layer, and the fifth layer comprises a convolutional layer, wherein: the convolution layers from the second layer to the fourth layer are all followed by a batch normalization layer and a RELU activation function layer, and the convolution layer from the fifth layer is followed by a batch normalization layer and a Tanh activation function layer;
s12, acquiring the original position coordinates of each pixel on the original irregular text picture based on the regularization processing mode; the original position coordinates obtained at the position are consistent with the characteristic diagram by taking the center of the irregular text image as the origin of coordinates, taking the width direction of the irregular text image as the X-axis direction and the right direction as the positive direction of the X-axis, taking the height direction of the irregular text image as the Y-axis direction and the downward direction as the positive direction of the Y-axis. The size of the irregular text picture in the processing process is measured in length units, so that the obtained original position coordinate value of the pixel is based on the numerical value of the length units;
s13, carrying out normalization processing on the original position coordinates of each pixel to obtain the normalized coordinates of each pixel, wherein the specific processing process comprises the steps of obtaining the width w and the height h of an original picture, dividing the original position coordinates of each pixel by [ w/2, h/2] to obtain the normalized coordinates, and after normalizing the original position coordinates of the pixels, enabling the obtained normalized coordinates to correspond to the position deviation coordinates so as to facilitate subsequent summation processing;
s14, summing the normalized coordinates of each pixel and the corresponding position deviation coordinates to obtain the conversion position coordinates of each pixel, and splicing all the converted pixels into regular text pictures;
and S2, recognizing the regular text picture through a text recognition network and outputting corresponding text information, wherein the text recognition network adopts an encoder-decoder structure, the encoder is based on a convolutional neural network and a cyclic neural network, the convolutional neural network can adopt network structures such as AlexNet, VGG, ResNet and the like, and the decoder is based on bidirectional LSTM and combines an attention mechanism to finally obtain output based on character probability distribution. The technology for generating text information by using regular text pictures is mature, and the technical scheme of the embodiment can be understood and optimized by referring to the prior art.
The above-mentioned embodiments are merely preferred embodiments for fully illustrating the present invention, and the scope of the present invention is not limited thereto. The equivalent substitution or change made by the technical personnel in the technical field on the basis of the invention is all within the protection scope of the invention. The protection scope of the invention is subject to the claims.
Claims (10)
1. An irregular text recognition system based on a correction network is characterized in that: the method comprises a text correction network and a text recognition network, wherein:
text correction network: the system is used for converting the irregular text picture into a regular text picture;
text recognition network: and the text picture is used for identifying the rule and generating text information.
2. The revised network-based irregular text recognition system of claim 1, wherein: the text correction network comprises a prediction network and a picture gridding module, wherein:
predicting the network: acquiring the position deviation of each corresponding pixel when the irregular text picture is converted into the regular text picture based on the convolutional neural network;
the picture gridding module: and generating a grid graph for the irregular text picture, obtaining the coordinate information of each pixel on the irregular text picture, synthesizing the coordinate information of each pixel and the corresponding position deviation, outputting the converted coordinate information of each pixel, and further obtaining the regular text picture.
3. The revised network-based irregular text recognition system of claim 1, wherein: the text recognition network adopts an encoder-decoder structure, the encoder adopts a convolutional neural network and a cyclic neural network for feature extraction, and the decoder adopts a bidirectional LSTM and combines an attention mechanism.
4. An irregular text recognition method based on a correction network is characterized in that: the method comprises the following steps:
converting the irregular text picture into a regular text picture through a text correction network;
and identifying the regular text picture through a text identification network and outputting corresponding text information.
5. The irregular text recognition method based on modified network as claimed in claim 4, wherein: the step of converting the irregular text picture into the regular text picture comprises:
obtaining the position deviation coordinates of each pixel corresponding to the irregular text picture converted into the regular text picture based on the convolutional neural network;
acquiring the original position coordinates of each pixel on the original irregular text picture based on a regularization processing mode;
normalizing the original position coordinates of each pixel to obtain normalized coordinates of each pixel;
and summing the normalized coordinates of each pixel and the corresponding position deviation coordinates to obtain the conversion position coordinates of each pixel.
6. The irregular text recognition method based on modified network of claim 5, wherein: the step of obtaining the position deviation coordinates of each pixel corresponding to the conversion from the irregular text picture to the regular text picture based on the convolutional neural network comprises:
taking the pixel value of the irregular text picture as input, obtaining a dual-channel characteristic graph through a prediction network, wherein one channel corresponds to an X-axis deviation coordinate, the other channel corresponds to a Y-axis deviation coordinate, and the prediction network is established based on a convolutional neural network;
and the size of the feature map is smaller than that of the irregular text picture, and the feature map is converted into the size of the irregular text picture through a resize function, so that the position deviation coordinate corresponding to each pixel on the irregular text picture is obtained.
7. The irregular text recognition method based on modified network of claim 6, wherein: the prediction network comprises five layers: the first layer comprises a maximum pooling layer, the second layer comprises a convolutional layer and a maximum pooling layer, the third layer comprises a convolutional layer and a maximum pooling layer, the fourth layer comprises a convolutional layer and a maximum pooling layer, and the fifth layer comprises a convolutional layer, wherein: the convolution layers of the second layer to the fourth layer are all followed by a batch normalization layer and a RELU activation function layer, and the convolution layer of the fifth layer is followed by a batch normalization layer and a Tanh activation function layer.
8. The irregular text recognition method based on modified network of claim 6, wherein: the step of obtaining the normalized coordinate of each pixel by normalizing the original position coordinate of each pixel comprises the following steps:
and acquiring the width w and the height h of the original picture, and dividing the original position coordinate of each pixel by [ w/2, h/2] to obtain a normalized coordinate.
9. The irregular text recognition method based on modified network as claimed in claim 4, wherein: the step of identifying the regular text picture and outputting corresponding text information comprises: the text recognition network adopts an encoder-decoder structure, the encoder adopts a convolutional neural network for feature extraction, and the decoder adopts a bidirectional LSTM cyclic neural network and combines an attention mechanism to finally obtain output based on character probability distribution.
10. The irregular text recognition method based on modified network as claimed in claim 4, wherein: the establishment of the text correction network and the text recognition network comprises the following steps:
building a text correction network structure: build the prediction network of text correction network based on convolutional neural network to the pixel value of irregular text picture is output, builds five layers of structure, and the first layer includes that the one deck is the biggest pooling layer to be constituteed, and the second floor includes that one deck convolution layer and one deck are the biggest pooling layer, and the third layer includes that one deck convolution layer and one deck are the biggest pooling layer, and the fourth layer includes that three-layer convolution layer and one deck are the biggest pooling layer, and the fifth layer includes one deck convolution layer, wherein: the convolution layers from the second layer to the fourth layer are all followed by a batch normalization layer and a RELU activation function layer, and the convolution layer from the fifth layer is followed by a batch normalization layer and a Tanh activation function layer, and the deviation prediction information of two channels is output;
building a text recognition network structure: establishing a text recognition network based on an encoder-decoder structure, establishing an encoder based on a convolutional neural network and a cyclic neural network, establishing a decoder based on a bidirectional LSTM, and outputting characters based on character probability distribution in combination with an attention mechanism;
establishing a data set: selecting a data set, and dividing the data set into a training set and a test set;
network training: the method comprises the steps of learning network parameters of a text correction network and a text recognition network by adopting a course learning strategy, training the text recognition network by regular text pictures, fixing the text recognition network, training the text correction network by irregular texts, and finally training the text correction network and the text recognition network simultaneously in an end-to-end mode.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911145879.8A CN110889404A (en) | 2019-11-21 | 2019-11-21 | Irregular text recognition system and method based on correction network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911145879.8A CN110889404A (en) | 2019-11-21 | 2019-11-21 | Irregular text recognition system and method based on correction network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110889404A true CN110889404A (en) | 2020-03-17 |
Family
ID=69748191
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911145879.8A Pending CN110889404A (en) | 2019-11-21 | 2019-11-21 | Irregular text recognition system and method based on correction network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110889404A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111612009A (en) * | 2020-05-21 | 2020-09-01 | 腾讯科技(深圳)有限公司 | Text recognition method, device, equipment and storage medium |
CN111639657A (en) * | 2020-06-03 | 2020-09-08 | 浪潮软件股份有限公司 | Irregular character recognition method and device |
US11537781B1 (en) * | 2021-09-15 | 2022-12-27 | Lumos Information Services, LLC | System and method to support synchronization, closed captioning and highlight within a text document or a media file |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107679531A (en) * | 2017-06-23 | 2018-02-09 | 平安科技(深圳)有限公司 | Licence plate recognition method, device, equipment and storage medium based on deep learning |
CN108694393A (en) * | 2018-05-30 | 2018-10-23 | 深圳市思迪信息技术股份有限公司 | A kind of certificate image text area extraction method based on depth convolution |
CN109582786A (en) * | 2018-10-31 | 2019-04-05 | 中国科学院深圳先进技术研究院 | A kind of text representation learning method, system and electronic equipment based on autocoding |
CN110321894A (en) * | 2019-04-23 | 2019-10-11 | 浙江工业大学 | A kind of library book method for rapidly positioning based on deep learning OCR |
CN110427938A (en) * | 2019-07-26 | 2019-11-08 | 中科视语(北京)科技有限公司 | A kind of irregular character recognition device and method based on deep learning |
-
2019
- 2019-11-21 CN CN201911145879.8A patent/CN110889404A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107679531A (en) * | 2017-06-23 | 2018-02-09 | 平安科技(深圳)有限公司 | Licence plate recognition method, device, equipment and storage medium based on deep learning |
CN108694393A (en) * | 2018-05-30 | 2018-10-23 | 深圳市思迪信息技术股份有限公司 | A kind of certificate image text area extraction method based on depth convolution |
CN109582786A (en) * | 2018-10-31 | 2019-04-05 | 中国科学院深圳先进技术研究院 | A kind of text representation learning method, system and electronic equipment based on autocoding |
CN110321894A (en) * | 2019-04-23 | 2019-10-11 | 浙江工业大学 | A kind of library book method for rapidly positioning based on deep learning OCR |
CN110427938A (en) * | 2019-07-26 | 2019-11-08 | 中科视语(北京)科技有限公司 | A kind of irregular character recognition device and method based on deep learning |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111612009A (en) * | 2020-05-21 | 2020-09-01 | 腾讯科技(深圳)有限公司 | Text recognition method, device, equipment and storage medium |
CN111612009B (en) * | 2020-05-21 | 2021-10-22 | 腾讯科技(深圳)有限公司 | Text recognition method, device, equipment and storage medium |
CN111639657A (en) * | 2020-06-03 | 2020-09-08 | 浪潮软件股份有限公司 | Irregular character recognition method and device |
US11537781B1 (en) * | 2021-09-15 | 2022-12-27 | Lumos Information Services, LLC | System and method to support synchronization, closed captioning and highlight within a text document or a media file |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110363252B (en) | End-to-end trend scene character detection and identification method and system | |
CN110889404A (en) | Irregular text recognition system and method based on correction network | |
WO2022257578A1 (en) | Method for recognizing text, and apparatus | |
CN110689012A (en) | End-to-end natural scene text recognition method and system | |
CN110084172B (en) | Character recognition method and device and electronic equipment | |
CN112668588B (en) | Parking space information generation method, device, equipment and computer readable medium | |
CN107749048B (en) | Image correction system and method, and color blindness image correction system and method | |
CN110942071A (en) | License plate recognition method based on license plate classification and LSTM | |
CN108846387B (en) | Traffic police gesture recognition method and device | |
CN113591566A (en) | Training method and device of image recognition model, electronic equipment and storage medium | |
CN109523558A (en) | A kind of portrait dividing method and system | |
CN111008979A (en) | Robust night image semantic segmentation method | |
CN116030074A (en) | Identification method, re-identification method and related equipment for road diseases | |
CN113807340A (en) | Method for recognizing irregular natural scene text based on attention mechanism | |
CN112016560A (en) | Overlay text recognition method and device, electronic equipment and storage medium | |
CN112733777B (en) | Road extraction method, device and equipment of remote sensing image and storage medium | |
CN111062388B (en) | Advertisement character recognition method, system, medium and equipment based on deep learning | |
CN115272667B (en) | Farmland image segmentation model training method and device, electronic equipment and medium | |
CN117237681A (en) | Image processing method, device and related equipment | |
CN107221019B (en) | Chart conversion method and device | |
CN115861922A (en) | Sparse smoke and fire detection method and device, computer equipment and storage medium | |
CN113627243B (en) | Text recognition method and related device | |
CN114882047A (en) | Medical image segmentation method and system based on semi-supervision and Transformers | |
CN110807784B (en) | Method and device for segmenting an object | |
CN111291758B (en) | Method and device for recognizing seal characters |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200317 |
|
RJ01 | Rejection of invention patent application after publication |