CN103942550A - Scene text recognition method based on sparse coding characteristics - Google Patents
Scene text recognition method based on sparse coding characteristics Download PDFInfo
- Publication number
- CN103942550A CN103942550A CN201410184072.6A CN201410184072A CN103942550A CN 103942550 A CN103942550 A CN 103942550A CN 201410184072 A CN201410184072 A CN 201410184072A CN 103942550 A CN103942550 A CN 103942550A
- Authority
- CN
- China
- Prior art keywords
- sparse coding
- character
- feature
- image
- scene text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Character Discrimination (AREA)
Abstract
The invention discloses a scene text recognition method based on sparse coding characteristics, and relates to computer vision and pattern recognition. The method includes the steps: inputting a natural scene text image to be recognized; by the aid of a multi-scale sliding window method, detecting and recognizing a window area in the image by a character classifier, for each character class, determining a large output area of the classifier as a candidate character area, determining a small output area as a background area, finding the candidate character area in the image, retaining the area with the largest output value of the classifier and the corresponding character class for the area with a large overlapping ratio by the aid of a non-maximum suppression method, and removing the repetitive and redundant candidate character area to obtain a character detection result; combining detected characters into a word or text line; outputting a scene text recognition result. Structural characteristics of the characters can be more effectively expressed and extracted, so that the recognition rate of a scene text is increased.
Description
Technical field
The present invention relates to computer vision and pattern-recognition, especially relate to a kind of scene text recognition methods based on sparse coding feature.
Background technology
Along with the product such as smart mobile phone and digital camera becomes more and more popular, to obtain picture and video and become easy, analysis and the understanding of image and video become one of research direction having broad prospect of application.In image and video, text message has comprised important semantic information, understanding to image and video has important value, such as the captions in the other billboard of cover page, the road of books, mark information, video etc. have all comprised bulk information, these information are convenient to the mankind and computer understanding and storage more.So at computer vision field, scene image text identification has attracted increasing concern.Due to scene image background complexity, size, the font of scene word, vary in color, and be subject to the impact of illumination variation and image degradation, this makes the identification of scene text have larger challenge.
Traditional OCR (optical character identification) technology can well be identified the fairly simple scan text document of background, but while being used for identifying on scene text, discrimination is very low.Scene text is different from the text document of scanning, in scene text, due to background more complicated, text filed just can identify afterwards must first be detected when text in identification.And in text document, carry out simple binary conversion treatment just can obtain text filedly clearly, adopt OCR to identify and just can obtain reasonable effect.So the identification of scene text not only will be identified text, has also comprised the detection to text.
The current thought that the identification of scene text is mainly adopted to the target detection in computer vision is carried out text detection and identification simultaneously.Its basic thought is, each class character is used as to a sensation target, and then detects character zone from scene text image, also provided identification classification and the identification mark in candidate characters region simultaneously.On the basis of character detection and Identification, then candidate characters region and corresponding character class are coupled together, obtain the recognition result of scene text.This method of simultaneously carrying out detection and Identification puts forward at international top-level meeting ICCV2011, has shown the recognition performance that is better than traditional OCR.The research of this respect has also been carried out in a lot of research that has for several years afterwards, has improved the performance of scene text identification.But, in the scene text recognition methods detecting at these based targets, (what use due to character detection and Identification is same sorter to character classification device, the unified character classification device of using below) what adopt is gradient orientation histogram feature (being HOG, Histogram of Gradients) conventional in target detection.HOG feature can be expressed local appearance features and the shape facility of target preferably, and to illumination-insensitive, so HOG feature is widely used in the Computer Vision Task such as face detection, pedestrian detection.In the scene text recognizer of current proposition, HOG feature extracting method is also used as the feature extraction algorithm of character classification device.
Although HOG feature can represent local feature (such as edge etc.), HOG feature can not effective expression structural information.Especially to character recognition, the structural information of character is very important information, can effectively distinguish the textural difference between character, thereby improves character identification rate.Scene text recognition methods based on sparse coding feature, does not also have the report of Patents or document.
Summary of the invention
The object of the invention is to for the feature extraction of character classification device in current scene text identification can not effective expression charcter topology information etc. problem, a kind of scene text recognition methods based on sparse coding feature is provided.
The present invention includes following steps:
Step S1: input natural scene text image to be identified;
Step S2: the method that adopts multi-scale sliding window mouth, window area in image is carried out to detection and Identification with character classification device, to each character class, be candidate characters region by the larger regional determination of sorter output, export less region and think background area, find out like this candidate characters region comprising in image, adopt again non-maximum value inhibition method, the larger region of Duplication is only retained to region and the corresponding character class of sorter output valve maximum, the candidate characters region of removing repeated and redundant, obtains character testing result;
In step S2, the feature extraction of described character classification device can adopt the feature based on sparse coding, sorter training adopts training comparatively simply and recognition speed Random Fern sorter or svm classifier device faster, and the characteristic extraction procedure of described sparse coding comprises the steps:
Step S201: by a large amount of natural scene picture data, obtain a sparse coding dictionary with general applicability with K-SVD Algorithm Learning;
In step S201, described K-SVD algorithm is when study dictionary (representing with D), each element of dictionary D is designed to 9 × 9 picture, represent the total architectural feature that study obtains, dictionary D comprises 100 elements (size that is dictionary is 100) altogether, this makes dictionary have higher expression ability, makes calculated amount be controlled at acceptable scope simultaneously.
Step S202: the sparse coding dictionary that study is obtained is preserved, wherein, what in dictionary, each element was described is some important structural informations;
Step S203: utilize the dictionary of preserving in step S202, extract the sparse coding feature of image;
In step S203, the concrete grammar of the sparse coding feature of described extraction image can be: to each pixel of image, decode and obtain the sparse coding of pixel by Orthogonal Matching Pursuit (OMP) algorithm, the sparse coding obtaining being added up to the histogram that obtains sparse coding (is Histogram of Sparse Codes again, HSC), thereby obtain the sparse coding feature of image, i.e. HSC feature;
The described histogram that obtains sparse coding that the sparse coding obtaining is added up, thereby the method that obtains the sparse coding feature of image can be: when sparse coding is added up to the histogram that obtains sparse coding, having adopted and being similar to histogram of gradients feature (is HOG feature, Histogram of Oriented Gradients) method, concrete steps comprise:
First, the picture of input is divided into 8 × 8 junior unit piece, adds up the sparse coding of each junior unit piece;
Then, use bilinear interpolation to utilize the adjacent block of each junior unit piece to calculate the sparse coding feature of each junior unit piece, also the feature on each junior unit piece is to ask interpolation to obtain on the neighborhood of 16 × 16;
Finally, the proper vector of all junior unit pieces is linked up to the sparse coding feature that obtains whole image, i.e. HSC feature.
Step S3: the character detecting is merged into a word or line of text;
In step S3, described the character detecting is merged into a word or line of text, owing to each character class having been retained to a large amount of candidate characters regions, when being merged into word, character has a large amount of array modes, therefore can adopt dynamic programming algorithm search to obtain identifying the character combination mode of mark maximum, obtain final text identification result;
Described employing dynamic programming algorithm search obtains identifying the character combination mode of mark maximum, needs an objective function to evaluate the score of each combination; The design of described objective function can adopt following methods:
With w=(c
1, c
2..., c
n) expression candidate word, wherein a c
i(i=1,2 ..., character class n) comprising in expression candidate word, n is character number (being text size), x
irepresent c
icandidate characters region, objective function is designed to:
Wherein S (c
i, x
i) be that character classification device is by candidate characters x
ibe identified as c
iscore, g (x
i, x
i+1) be the output of geometric model, candidate characters x has been described
iand x
i+1compatibility in geometric relationship, α and β are two and regulate parameter.
In described objective function, geometric model g (x
i, x
i+1) what describe is whether two geometric properties between adjacent character are intercharacter features, two class classification problems, geometric properties is carried out to modeling with a svm classifier device, the geometric properties extracting when modeling comprises Duplication, the distance of up-and-down boundary etc. of yardstick similarity, adjacent character.
In described objective function, consider the impact of text size, therefore (additive method is not considered the number of character can to overcome the impact of character length on recognition result, character number is larger, objective function can be larger, cause the recognition result of identifying additive method to tend to the more text of number of characters), thus text identification rate improved.
In described objective function, regulate parameter alpha and β to adopt minimum classification error rate training method (Minimum Classification Error Training) to obtain at scene text database learning.
Step S4: output scene text identification result.
The present invention proposes a kind of scene text recognition methods based on sparse coding feature, character classification device of the present invention has adopted the feature extracting method based on sparse coding, can more effectively represent and extract the architectural feature of character, thereby improve the discrimination of scene text.
The sparse coding feature adopting in the present invention, i.e. Histogram of Sparse Codes (HSC) feature, can automatic learning and represent the structural information of character, thereby can describe better the feature of character, improves text identification rate.Meanwhile, text recognition method of the present invention is the also integrated output of character classification device, the output of geometric model, and considered the impact of text size (character number comprising in text) on recognition result.Parameter in text identification obtains by minimum classification error rate training method automatic learning, the higher performance of gain of parameter that this sets than experience.The present invention can be widely used in the occasions such as scene text identification.
Scene text recognition methods based on sparse coding feature provided by the invention, compared with additive method, the advantage and the beneficial effect that have are as follows:
1, character classification device of the present invention has adopted the feature extraction algorithm (being HSC) based on sparse coding, this feature extraction algorithm can represent abundant structural information better, improve the discriminating power of feature, thus detection and Identification character better.
2, the feature extraction algorithm based on sparse coding of the present invention is in the time extracting feature, and feature is directly learnt to obtain by sparse decode procedure, does not therefore need manual setting.
3, method of the present invention is in the time that search obtains optimum character combination mode, how much compatible (being geometric model) between candidate characters are also considered, this has effectively utilized the effective informations such as the geometric properties between character, has therefore improved text identification rate.
4, in method of the present invention, objective function has been considered the impact of text size, therefore can overcome the impact of text size on recognition result, thereby has improved scene text discrimination.
5, in method of the present invention, the parameter in objective function is obtained by MCE automatic learning, therefore can obtain more superior recognition effect.
6, method of the present invention, applicable to the scene text recognition methods of Chinese or the language such as English, in the time of training character sorter, adopts corresponding character database to train.
Brief description of the drawings
Fig. 1 is method flow block diagram of the present invention.
Fig. 2 is the sparse coding dictionary obtaining with K-SVD Algorithm Learning.Wherein, be (a) 5 × 5, be (b) 7 × 7, be (c) 9 × 9.
Fig. 3 is the comparative examples of HSC and HOG feature extraction result.Wherein, (1) is original character image, and (2) are HSC mark sheet diagram, and (3) are HOG mark sheet diagram.
Fig. 4 implements identifying and the result example that the present invention obtains.
Embodiment
For technical method of the present invention and advantage are further explained, below in conjunction with the drawings and specific embodiments, the present invention is described further.
As shown in the method flow diagram in Fig. 1, the present invention includes following steps:
Step S1: input natural scene text image to be identified;
Step S2: the method that adopts multi-scale sliding window mouth, window area in image is carried out to detection and Identification with character classification device, to each character class, be candidate characters region by the larger regional determination of sorter output, export less region and think background area, find out like this candidate characters region comprising in image.Adopt again non-maximum value inhibition method, the larger region of Duplication is only retained to region and the corresponding character class of sorter output valve maximum, remove like this candidate characters region of a large amount of repeated and redundant, obtain character testing result;
In this step, use a character classification device that precondition is good.Character classification device of the present invention adopts the feature extracting method based on sparse coding, and sorter adopts conventional Random Fern or svm classifier device, and other machine learning algorithm is such as Boosting, neural network etc., all can be used for learning character sorter.The database adopting when training is individual character database, can select as required English data number (for english identification) or Chinese database (Chinese sentence identification) to train.
Wherein, above, the leaching process of the described feature extraction algorithm based on sparse coding is as follows:
Step S201: by a large amount of natural scene picture data, obtain a sparse coding dictionary with general applicability with K-SVD Algorithm Learning.
Wherein, described K-SVD algorithm is when study dictionary (representing with D), each element of dictionary D is designed to 9 × 9 picture, represent the total architectural feature that study obtains, dictionary D comprises 100 elements (size that is dictionary is 100) altogether, this makes dictionary have higher expression ability, makes calculated amount be controlled at acceptable scope simultaneously.As shown in Figure 2, be the dictionary example obtaining with K-SVD Algorithm Learning, wherein dictionary comprises 100 elements, each element can make the image of 5 × 5,7 × 7 or 9 × 9 sizes, the pixel of element is more, and the structural information that can express is abundanter, but corresponding calculated amount is also larger.In the specific embodiment of the present invention, selecting image size is 9 × 9.
Step S202: the sparse coding dictionary that study is obtained saves, wherein, what in dictionary, each element was described is some important structural informations.
Step S203: utilize the dictionary obtaining in step S202, extract the sparse coding feature of image, while extracting feature, to each pixel of image, decode and obtain the sparse coding of pixel by Orthogonal Matching Pursuit (OMP) algorithm, then the sparse coding obtaining is added up to the histogram (being Histogram of Sparse Codes, HSC) that obtains sparse coding, thereby obtain the sparse coding feature of image, i.e. HSC feature.
Wherein, the process that sparse coding statistics is obtained to HSC feature is as follows: when sparse coding is added up to the histogram that obtains sparse coding, having adopted and being similar to histogram of gradients feature (is HOG feature, Histogram of Oriented Gradients) method, concrete steps comprise:
First, the picture of input is divided into 8 × 8 junior unit piece, adds up the sparse coding of each junior unit piece;
Then, use bilinear interpolation to utilize the adjacent block of each junior unit piece to calculate the sparse coding feature of each junior unit piece, also the feature on each junior unit piece is to ask interpolation to obtain on the neighborhood of 16 × 16;
Finally, the proper vector of all junior unit pieces is linked up to the sparse coding feature that obtains whole image, i.e. HSC feature.
As shown in Figure 3, it is the visualization result contrast of the feature to several character samples and the extraction of non-character sample with HSC and HOG feature extraction algorithm, the abundanter structural information of can having found out HSC character representation, such as information such as texture, edge, angle points, the information of HOG character representation is taking edge as main, and the structural information that does not have HSC to provide is abundant.
Step S3, the character detecting is merged into a word (or being line of text), owing to each character class having been retained to a large amount of candidate characters regions, when being merged into word, character has a large amount of array modes, therefore this step adopts dynamic programming algorithm, the character combination mode that obtains identifying mark maximum according to described objective function search below, obtains final text identification result.Objective function is:
Wherein, S (c
i, x
i) be that character classification device is by candidate characters x
ibe identified as c
iscore, g (x
i, x
i+1) be candidate characters x
iand x
i+1compatibility in geometric relationship, α and β are two and regulate parameter.Regulate parameter alpha and β to be obtained by minimum classification error rate training method (being MCE training method) study, the database using is text database.The optimum character combination mode that the search of dynamic programming searching algorithm obtains, is last text identification result.
Step S4: output scene text identification result.
As shown in Figure 4, be to implement identifying and the result example that the present invention obtains.
Essential characteristic of the present invention is that character classification device has used the feature extracting method based on sparse coding, can describe better the structural information of character, effectively improves the accuracy that character identification rate and character detect, thereby improves scene text discrimination.The present invention has overcome HOG feature and can not better describe the problem of character feature, in conjunction with the feature of scene text detection and Identification, designs the scene text recognition methods based on sparse coding feature.
Claims (10)
1. the scene text recognition methods based on sparse coding feature, is characterized in that comprising the steps:
Step S1: input natural scene text image to be identified;
Step S2: the method that adopts multi-scale sliding window mouth, window area in image is carried out to detection and Identification with character classification device, to each character class, be candidate characters region by the larger regional determination of sorter output, export less region and think background area, find out like this candidate characters region comprising in image, adopt again non-maximum value inhibition method, the larger region of Duplication is only retained to region and the corresponding character class of sorter output valve maximum, the candidate characters region of removing repeated and redundant, obtains character testing result;
Step S3: the character detecting is merged into a word or line of text;
Step S4: output scene text identification result.
2. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 1, it is characterized in that in step S2, the feature extraction of described character classification device is the feature adopting based on sparse coding, and sorter training adopts training comparatively simply and recognition speed Random Fern sorter or svm classifier device faster.
3. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 2, is characterized in that the characteristic extraction procedure of described sparse coding comprises the steps:
Step S201: by a large amount of natural scene picture data, obtain a sparse coding dictionary with general applicability with K-SVD Algorithm Learning;
Step S202: the sparse coding dictionary that study is obtained is preserved, wherein, what in dictionary, each element was described is some important structural informations;
Step S203: utilize the dictionary of preserving in step S202, extract the sparse coding feature of image.
4. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 3, it is characterized in that in step S201, described K-SVD algorithm is in the time of study dictionary, each element of dictionary is designed to 9 × 9 picture, represented the total architectural feature that study obtains, dictionary comprises 100 elements altogether, and the size of dictionary is 100, this makes dictionary have higher expression ability, makes calculated amount be controlled at acceptable scope simultaneously.
5. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 3, it is characterized in that in step S203, the concrete grammar of the sparse coding feature of described extraction image is: to each pixel of image, decode and obtain the sparse coding of pixel by Orthogonal Matching Pursuit algorithm, again the sparse coding obtaining is added up to the histogram that obtains sparse coding, thereby obtain the sparse coding feature of image, i.e. HSC feature.
6. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 5, it is characterized in that the described histogram that obtains sparse coding that the sparse coding obtaining is added up, thereby the method that obtains the sparse coding feature of image is: when sparse coding is added up to the histogram that obtains sparse coding, adopted the method that is similar to histogram of gradients feature, concrete steps comprise:
First, the picture of input is divided into 8 × 8 junior unit piece, adds up the sparse coding of each junior unit piece;
Then, use bilinear interpolation to utilize the adjacent block of each junior unit piece to calculate the sparse coding feature of each junior unit piece, also the feature on each junior unit piece is to ask interpolation to obtain on the neighborhood of 16 × 16;
Finally, the proper vector of all junior unit pieces is linked up to the sparse coding feature that obtains whole image, i.e. HSC feature.
7. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 1, it is characterized in that in step S3, described the character detecting is merged into a word or line of text, owing to each character class having been retained to a large amount of candidate characters regions, when being merged into word, character has a large amount of array modes, therefore adopt dynamic programming algorithm search to obtain identifying the character combination mode of mark maximum, obtain final text identification result.
8. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 7, it is characterized in that described employing dynamic programming algorithm search obtains identifying the character combination mode of mark maximum, needs an objective function to evaluate the score of each combination; The design of described objective function adopts following methods:
With w=(c
1, c
2..., c
n) expression candidate word, wherein a c
i(i=1,2 ..., character class n) comprising in expression candidate word, n is character number (being text size), x
irepresent c
icandidate characters region, objective function is designed to:
Wherein S (c
i, x
i) be that character classification device is by candidate characters x
ibe identified as c
iscore, g (x
i, x
i+1) be the output of geometric model, candidate characters x has been described
iand x
i+1compatibility in geometric relationship, α and β are two and regulate parameter.
9. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 8, is characterized in that in described objective function geometric model g (x
i, x
i+1) what describe is whether two geometric properties between adjacent character are intercharacter features, two class classification problems, geometric properties is carried out to modeling with a svm classifier device, the geometric properties extracting when modeling comprises the Duplication of yardstick similarity, adjacent character, the distance of up-and-down boundary.
10. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 8, it is characterized in that in described objective function, consider the impact of text size, therefore can overcome the impact of character length on recognition result, thereby improved text identification rate; Regulate parameter alpha and β can adopt minimum classification error rate training method to obtain at scene text database learning.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410184072.6A CN103942550B (en) | 2014-05-04 | 2014-05-04 | A kind of scene text recognition methods based on sparse coding feature |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410184072.6A CN103942550B (en) | 2014-05-04 | 2014-05-04 | A kind of scene text recognition methods based on sparse coding feature |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103942550A true CN103942550A (en) | 2014-07-23 |
CN103942550B CN103942550B (en) | 2018-11-02 |
Family
ID=51190213
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410184072.6A Active CN103942550B (en) | 2014-05-04 | 2014-05-04 | A kind of scene text recognition methods based on sparse coding feature |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103942550B (en) |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104239872A (en) * | 2014-09-26 | 2014-12-24 | 南开大学 | Abnormal Chinese character identification method |
CN104268536A (en) * | 2014-10-11 | 2015-01-07 | 烽火通信科技股份有限公司 | Face detection method through images |
CN104537362A (en) * | 2015-01-16 | 2015-04-22 | 中国科学院自动化研究所 | Domain-based self-adaptive English scene character recognition method |
CN104598881A (en) * | 2015-01-12 | 2015-05-06 | 中国科学院信息工程研究所 | Feature compression and feature selection based skew scene character recognition method |
CN105404868A (en) * | 2015-11-19 | 2016-03-16 | 电子科技大学 | Interaction platform based method for rapidly detecting text in complex background |
CN105608456A (en) * | 2015-12-22 | 2016-05-25 | 华中科技大学 | Multi-directional text detection method based on full convolution network |
CN105631469A (en) * | 2015-12-18 | 2016-06-01 | 华南理工大学 | Bird image recognition method by multilayer sparse coding features |
CN105740909A (en) * | 2016-02-02 | 2016-07-06 | 华中科技大学 | Text recognition method under natural scene on the basis of spatial transformation |
CN106446909A (en) * | 2016-09-06 | 2017-02-22 | 广东顺德中山大学卡内基梅隆大学国际联合研究院 | Chinese food image feature extraction method |
CN107886065A (en) * | 2017-11-06 | 2018-04-06 | 哈尔滨工程大学 | A kind of Serial No. recognition methods of mixing script |
CN108304835A (en) * | 2018-01-30 | 2018-07-20 | 百度在线网络技术(北京)有限公司 | character detecting method and device |
US20190156156A1 (en) * | 2017-11-17 | 2019-05-23 | Hong Kong Applied Science and Technology Research Institute Company Limited | Flexible integrating recognition and semantic processing |
CN109993040A (en) * | 2018-01-03 | 2019-07-09 | 北京世纪好未来教育科技有限公司 | Text recognition method and device |
CN110135248A (en) * | 2019-04-03 | 2019-08-16 | 华南理工大学 | A kind of natural scene Method for text detection based on deep learning |
CN110345954A (en) * | 2018-04-03 | 2019-10-18 | 奥迪股份公司 | Navigation system and method |
CN110688949A (en) * | 2019-09-26 | 2020-01-14 | 北大方正集团有限公司 | Font identification method and apparatus |
CN110796129A (en) * | 2018-08-03 | 2020-02-14 | 珠海格力电器股份有限公司 | Text line region detection method and device |
CN110796143A (en) * | 2019-10-31 | 2020-02-14 | 天津大学 | Scene text recognition method based on man-machine cooperation |
CN111738326A (en) * | 2020-06-16 | 2020-10-02 | 中国工商银行股份有限公司 | Sentence granularity marking training sample generation method and device |
CN111898411A (en) * | 2020-06-16 | 2020-11-06 | 华南理工大学 | Text image labeling system, method, computer device and storage medium |
CN113642584A (en) * | 2021-08-13 | 2021-11-12 | 北京百度网讯科技有限公司 | Character recognition method, device, equipment, storage medium and intelligent dictionary pen |
CN114708580A (en) * | 2022-04-08 | 2022-07-05 | 北京百度网讯科技有限公司 | Text recognition method, model training method, device, apparatus, storage medium, and program |
CN114758340A (en) * | 2021-12-21 | 2022-07-15 | 上海沐冉信息科技有限公司 | Intelligent identification method, device and equipment for logistics address and storage medium |
US11645733B2 (en) | 2020-06-16 | 2023-05-09 | Bank Of America Corporation | System and method for providing artificial intelligence architectures to people with disabilities |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102799879A (en) * | 2012-07-12 | 2012-11-28 | 中国科学技术大学 | Method for identifying multi-language multi-font characters from natural scene image |
CN103336961A (en) * | 2013-07-22 | 2013-10-02 | 中国科学院自动化研究所 | Interactive natural scene text detection method |
US8934716B2 (en) * | 2012-01-11 | 2015-01-13 | Electronics And Telecommunications Research Institute | Method and apparatus for sequencing off-line character from natural scene |
-
2014
- 2014-05-04 CN CN201410184072.6A patent/CN103942550B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8934716B2 (en) * | 2012-01-11 | 2015-01-13 | Electronics And Telecommunications Research Institute | Method and apparatus for sequencing off-line character from natural scene |
CN102799879A (en) * | 2012-07-12 | 2012-11-28 | 中国科学技术大学 | Method for identifying multi-language multi-font characters from natural scene image |
CN103336961A (en) * | 2013-07-22 | 2013-10-02 | 中国科学院自动化研究所 | Interactive natural scene text detection method |
Non-Patent Citations (3)
Title |
---|
KAI WANG等: "End-to-End Scene Text Recognition", 《2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION》 * |
XIAOFENG REN等: "Histograms of Sparse Codes for Object Detection", 《2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION》 * |
张伟伟 等: "基于DPM的自然场景下汉字识别方法", 《计算机应用研究》 * |
Cited By (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104239872A (en) * | 2014-09-26 | 2014-12-24 | 南开大学 | Abnormal Chinese character identification method |
CN104268536B (en) * | 2014-10-11 | 2017-07-18 | 南京烽火软件科技有限公司 | A kind of image method for detecting human face |
CN104268536A (en) * | 2014-10-11 | 2015-01-07 | 烽火通信科技股份有限公司 | Face detection method through images |
CN104598881A (en) * | 2015-01-12 | 2015-05-06 | 中国科学院信息工程研究所 | Feature compression and feature selection based skew scene character recognition method |
CN104598881B (en) * | 2015-01-12 | 2017-09-29 | 中国科学院信息工程研究所 | Feature based compresses the crooked scene character recognition method with feature selecting |
CN104537362A (en) * | 2015-01-16 | 2015-04-22 | 中国科学院自动化研究所 | Domain-based self-adaptive English scene character recognition method |
CN104537362B (en) * | 2015-01-16 | 2017-12-01 | 中国科学院自动化研究所 | A kind of English scene character recognition method adaptive based on domain |
CN105404868A (en) * | 2015-11-19 | 2016-03-16 | 电子科技大学 | Interaction platform based method for rapidly detecting text in complex background |
CN105404868B (en) * | 2015-11-19 | 2019-05-10 | 电子科技大学 | The rapid detection method of text in a kind of complex background based on interaction platform |
CN105631469A (en) * | 2015-12-18 | 2016-06-01 | 华南理工大学 | Bird image recognition method by multilayer sparse coding features |
CN105608456B (en) * | 2015-12-22 | 2017-07-18 | 华中科技大学 | A kind of multi-direction Method for text detection based on full convolutional network |
CN105608456A (en) * | 2015-12-22 | 2016-05-25 | 华中科技大学 | Multi-directional text detection method based on full convolution network |
CN105740909A (en) * | 2016-02-02 | 2016-07-06 | 华中科技大学 | Text recognition method under natural scene on the basis of spatial transformation |
CN106446909A (en) * | 2016-09-06 | 2017-02-22 | 广东顺德中山大学卡内基梅隆大学国际联合研究院 | Chinese food image feature extraction method |
CN107886065A (en) * | 2017-11-06 | 2018-04-06 | 哈尔滨工程大学 | A kind of Serial No. recognition methods of mixing script |
CN109983473A (en) * | 2017-11-17 | 2019-07-05 | 香港应用科技研究院有限公司 | Flexible integrated identification and semantic processes |
US10810467B2 (en) | 2017-11-17 | 2020-10-20 | Hong Kong Applied Science and Technology Research Institute Company Limited | Flexible integrating recognition and semantic processing |
WO2019096270A1 (en) * | 2017-11-17 | 2019-05-23 | Hong Kong Applied Science and Technology Research Institute Company Limited | Flexible integrating recognition and semantic processing |
CN109983473B (en) * | 2017-11-17 | 2023-01-31 | 香港应用科技研究院有限公司 | Flexible integrated recognition and semantic processing |
US20190156156A1 (en) * | 2017-11-17 | 2019-05-23 | Hong Kong Applied Science and Technology Research Institute Company Limited | Flexible integrating recognition and semantic processing |
CN109993040A (en) * | 2018-01-03 | 2019-07-09 | 北京世纪好未来教育科技有限公司 | Text recognition method and device |
CN109993040B (en) * | 2018-01-03 | 2021-07-30 | 北京世纪好未来教育科技有限公司 | Text recognition method and device |
CN108304835A (en) * | 2018-01-30 | 2018-07-20 | 百度在线网络技术(北京)有限公司 | character detecting method and device |
CN110345954A (en) * | 2018-04-03 | 2019-10-18 | 奥迪股份公司 | Navigation system and method |
CN110796129A (en) * | 2018-08-03 | 2020-02-14 | 珠海格力电器股份有限公司 | Text line region detection method and device |
CN110135248A (en) * | 2019-04-03 | 2019-08-16 | 华南理工大学 | A kind of natural scene Method for text detection based on deep learning |
CN110688949B (en) * | 2019-09-26 | 2022-11-01 | 北大方正集团有限公司 | Font identification method and apparatus |
CN110688949A (en) * | 2019-09-26 | 2020-01-14 | 北大方正集团有限公司 | Font identification method and apparatus |
CN110796143A (en) * | 2019-10-31 | 2020-02-14 | 天津大学 | Scene text recognition method based on man-machine cooperation |
CN111898411A (en) * | 2020-06-16 | 2020-11-06 | 华南理工大学 | Text image labeling system, method, computer device and storage medium |
CN111738326A (en) * | 2020-06-16 | 2020-10-02 | 中国工商银行股份有限公司 | Sentence granularity marking training sample generation method and device |
US11645733B2 (en) | 2020-06-16 | 2023-05-09 | Bank Of America Corporation | System and method for providing artificial intelligence architectures to people with disabilities |
CN113642584A (en) * | 2021-08-13 | 2021-11-12 | 北京百度网讯科技有限公司 | Character recognition method, device, equipment, storage medium and intelligent dictionary pen |
CN113642584B (en) * | 2021-08-13 | 2023-11-28 | 北京百度网讯科技有限公司 | Character recognition method, device, equipment, storage medium and intelligent dictionary pen |
CN114758340A (en) * | 2021-12-21 | 2022-07-15 | 上海沐冉信息科技有限公司 | Intelligent identification method, device and equipment for logistics address and storage medium |
CN114708580A (en) * | 2022-04-08 | 2022-07-05 | 北京百度网讯科技有限公司 | Text recognition method, model training method, device, apparatus, storage medium, and program |
CN114708580B (en) * | 2022-04-08 | 2024-04-16 | 北京百度网讯科技有限公司 | Text recognition method, text recognition model training method, text recognition device, model training device, text recognition program, model training program, and computer-readable storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN103942550B (en) | 2018-11-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103942550A (en) | Scene text recognition method based on sparse coding characteristics | |
Yi et al. | Feature representations for scene text character recognition: A comparative study | |
Shahab et al. | ICDAR 2011 robust reading competition challenge 2: Reading text in scene images | |
Coates et al. | Text detection and character recognition in scene images with unsupervised feature learning | |
John et al. | Offline handwritten Malayalam Character Recognition based on chain code histogram | |
CN103984943A (en) | Scene text identification method based on Bayesian probability frame | |
CN106127222B (en) | A kind of the similarity of character string calculation method and similitude judgment method of view-based access control model | |
Zhou et al. | Detecting multilingual text in natural scene | |
Zhang et al. | Automatic discrimination of text and non-text natural images | |
Huang et al. | Text detection and recognition in natural scene images | |
Afakh et al. | Aksara jawa text detection in scene images using convolutional neural network | |
CN102136074B (en) | Man-machine interface (MMI) based wood image texture analyzing and identifying method | |
Asad et al. | High performance OCR for camera-captured blurred documents with LSTM networks | |
Ayesh et al. | A robust line segmentation algorithm for Arabic printed text with diacritics | |
Parameshwaran et al. | Transfer learning for classifying single hand gestures on comprehensive Bharatanatyam Mudra dataset | |
Ren et al. | A CNN based scene Chinese text recognition algorithm with synthetic data engine | |
Yokobayashi et al. | Segmentation and recognition of characters in scene images using selective binarization in color space and gat correlation | |
Antony et al. | Haar features based handwritten character recognition system for Tulu script | |
Wshah et al. | A novel lexicon reduction method for Arabic handwriting recognition | |
Yildirim et al. | Text recognition in natural images using multiclass hough forests | |
Mukhiddinov | Scene text detection and localization using fully convolutional network | |
Daood et al. | Handwriting detection and recognition of Arabic numbers and characters using deep learning methods | |
Khan et al. | A holistic approach to Urdu language word recognition using deep neural networks | |
Kumar et al. | Scene text recognition using artificial neural network: a survey | |
Ghosh et al. | Efficient indexing for query by string text retrieval |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |