CN103942550A - Scene text recognition method based on sparse coding characteristics - Google Patents

Scene text recognition method based on sparse coding characteristics Download PDF

Info

Publication number
CN103942550A
CN103942550A CN201410184072.6A CN201410184072A CN103942550A CN 103942550 A CN103942550 A CN 103942550A CN 201410184072 A CN201410184072 A CN 201410184072A CN 103942550 A CN103942550 A CN 103942550A
Authority
CN
China
Prior art keywords
sparse coding
character
feature
image
scene text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410184072.6A
Other languages
Chinese (zh)
Other versions
CN103942550B (en
Inventor
王菡子
王大寒
章冬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen University
Original Assignee
Xiamen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen University filed Critical Xiamen University
Priority to CN201410184072.6A priority Critical patent/CN103942550B/en
Publication of CN103942550A publication Critical patent/CN103942550A/en
Application granted granted Critical
Publication of CN103942550B publication Critical patent/CN103942550B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

The invention discloses a scene text recognition method based on sparse coding characteristics, and relates to computer vision and pattern recognition. The method includes the steps: inputting a natural scene text image to be recognized; by the aid of a multi-scale sliding window method, detecting and recognizing a window area in the image by a character classifier, for each character class, determining a large output area of the classifier as a candidate character area, determining a small output area as a background area, finding the candidate character area in the image, retaining the area with the largest output value of the classifier and the corresponding character class for the area with a large overlapping ratio by the aid of a non-maximum suppression method, and removing the repetitive and redundant candidate character area to obtain a character detection result; combining detected characters into a word or text line; outputting a scene text recognition result. Structural characteristics of the characters can be more effectively expressed and extracted, so that the recognition rate of a scene text is increased.

Description

A kind of scene text recognition methods based on sparse coding feature
Technical field
The present invention relates to computer vision and pattern-recognition, especially relate to a kind of scene text recognition methods based on sparse coding feature.
Background technology
Along with the product such as smart mobile phone and digital camera becomes more and more popular, to obtain picture and video and become easy, analysis and the understanding of image and video become one of research direction having broad prospect of application.In image and video, text message has comprised important semantic information, understanding to image and video has important value, such as the captions in the other billboard of cover page, the road of books, mark information, video etc. have all comprised bulk information, these information are convenient to the mankind and computer understanding and storage more.So at computer vision field, scene image text identification has attracted increasing concern.Due to scene image background complexity, size, the font of scene word, vary in color, and be subject to the impact of illumination variation and image degradation, this makes the identification of scene text have larger challenge.
Traditional OCR (optical character identification) technology can well be identified the fairly simple scan text document of background, but while being used for identifying on scene text, discrimination is very low.Scene text is different from the text document of scanning, in scene text, due to background more complicated, text filed just can identify afterwards must first be detected when text in identification.And in text document, carry out simple binary conversion treatment just can obtain text filedly clearly, adopt OCR to identify and just can obtain reasonable effect.So the identification of scene text not only will be identified text, has also comprised the detection to text.
The current thought that the identification of scene text is mainly adopted to the target detection in computer vision is carried out text detection and identification simultaneously.Its basic thought is, each class character is used as to a sensation target, and then detects character zone from scene text image, also provided identification classification and the identification mark in candidate characters region simultaneously.On the basis of character detection and Identification, then candidate characters region and corresponding character class are coupled together, obtain the recognition result of scene text.This method of simultaneously carrying out detection and Identification puts forward at international top-level meeting ICCV2011, has shown the recognition performance that is better than traditional OCR.The research of this respect has also been carried out in a lot of research that has for several years afterwards, has improved the performance of scene text identification.But, in the scene text recognition methods detecting at these based targets, (what use due to character detection and Identification is same sorter to character classification device, the unified character classification device of using below) what adopt is gradient orientation histogram feature (being HOG, Histogram of Gradients) conventional in target detection.HOG feature can be expressed local appearance features and the shape facility of target preferably, and to illumination-insensitive, so HOG feature is widely used in the Computer Vision Task such as face detection, pedestrian detection.In the scene text recognizer of current proposition, HOG feature extracting method is also used as the feature extraction algorithm of character classification device.
Although HOG feature can represent local feature (such as edge etc.), HOG feature can not effective expression structural information.Especially to character recognition, the structural information of character is very important information, can effectively distinguish the textural difference between character, thereby improves character identification rate.Scene text recognition methods based on sparse coding feature, does not also have the report of Patents or document.
Summary of the invention
The object of the invention is to for the feature extraction of character classification device in current scene text identification can not effective expression charcter topology information etc. problem, a kind of scene text recognition methods based on sparse coding feature is provided.
The present invention includes following steps:
Step S1: input natural scene text image to be identified;
Step S2: the method that adopts multi-scale sliding window mouth, window area in image is carried out to detection and Identification with character classification device, to each character class, be candidate characters region by the larger regional determination of sorter output, export less region and think background area, find out like this candidate characters region comprising in image, adopt again non-maximum value inhibition method, the larger region of Duplication is only retained to region and the corresponding character class of sorter output valve maximum, the candidate characters region of removing repeated and redundant, obtains character testing result;
In step S2, the feature extraction of described character classification device can adopt the feature based on sparse coding, sorter training adopts training comparatively simply and recognition speed Random Fern sorter or svm classifier device faster, and the characteristic extraction procedure of described sparse coding comprises the steps:
Step S201: by a large amount of natural scene picture data, obtain a sparse coding dictionary with general applicability with K-SVD Algorithm Learning;
In step S201, described K-SVD algorithm is when study dictionary (representing with D), each element of dictionary D is designed to 9 × 9 picture, represent the total architectural feature that study obtains, dictionary D comprises 100 elements (size that is dictionary is 100) altogether, this makes dictionary have higher expression ability, makes calculated amount be controlled at acceptable scope simultaneously.
Step S202: the sparse coding dictionary that study is obtained is preserved, wherein, what in dictionary, each element was described is some important structural informations;
Step S203: utilize the dictionary of preserving in step S202, extract the sparse coding feature of image;
In step S203, the concrete grammar of the sparse coding feature of described extraction image can be: to each pixel of image, decode and obtain the sparse coding of pixel by Orthogonal Matching Pursuit (OMP) algorithm, the sparse coding obtaining being added up to the histogram that obtains sparse coding (is Histogram of Sparse Codes again, HSC), thereby obtain the sparse coding feature of image, i.e. HSC feature;
The described histogram that obtains sparse coding that the sparse coding obtaining is added up, thereby the method that obtains the sparse coding feature of image can be: when sparse coding is added up to the histogram that obtains sparse coding, having adopted and being similar to histogram of gradients feature (is HOG feature, Histogram of Oriented Gradients) method, concrete steps comprise:
First, the picture of input is divided into 8 × 8 junior unit piece, adds up the sparse coding of each junior unit piece;
Then, use bilinear interpolation to utilize the adjacent block of each junior unit piece to calculate the sparse coding feature of each junior unit piece, also the feature on each junior unit piece is to ask interpolation to obtain on the neighborhood of 16 × 16;
Finally, the proper vector of all junior unit pieces is linked up to the sparse coding feature that obtains whole image, i.e. HSC feature.
Step S3: the character detecting is merged into a word or line of text;
In step S3, described the character detecting is merged into a word or line of text, owing to each character class having been retained to a large amount of candidate characters regions, when being merged into word, character has a large amount of array modes, therefore can adopt dynamic programming algorithm search to obtain identifying the character combination mode of mark maximum, obtain final text identification result;
Described employing dynamic programming algorithm search obtains identifying the character combination mode of mark maximum, needs an objective function to evaluate the score of each combination; The design of described objective function can adopt following methods:
With w=(c 1, c 2..., c n) expression candidate word, wherein a c i(i=1,2 ..., character class n) comprising in expression candidate word, n is character number (being text size), x irepresent c icandidate characters region, objective function is designed to:
O = Σ i = 1 n S ( c i , x i ) + α Σ i = 1 n - 1 g ( x i , x i + 1 ) + βn ,
Wherein S (c i, x i) be that character classification device is by candidate characters x ibe identified as c iscore, g (x i, x i+1) be the output of geometric model, candidate characters x has been described iand x i+1compatibility in geometric relationship, α and β are two and regulate parameter.
In described objective function, geometric model g (x i, x i+1) what describe is whether two geometric properties between adjacent character are intercharacter features, two class classification problems, geometric properties is carried out to modeling with a svm classifier device, the geometric properties extracting when modeling comprises Duplication, the distance of up-and-down boundary etc. of yardstick similarity, adjacent character.
In described objective function, consider the impact of text size, therefore (additive method is not considered the number of character can to overcome the impact of character length on recognition result, character number is larger, objective function can be larger, cause the recognition result of identifying additive method to tend to the more text of number of characters), thus text identification rate improved.
In described objective function, regulate parameter alpha and β to adopt minimum classification error rate training method (Minimum Classification Error Training) to obtain at scene text database learning.
Step S4: output scene text identification result.
The present invention proposes a kind of scene text recognition methods based on sparse coding feature, character classification device of the present invention has adopted the feature extracting method based on sparse coding, can more effectively represent and extract the architectural feature of character, thereby improve the discrimination of scene text.
The sparse coding feature adopting in the present invention, i.e. Histogram of Sparse Codes (HSC) feature, can automatic learning and represent the structural information of character, thereby can describe better the feature of character, improves text identification rate.Meanwhile, text recognition method of the present invention is the also integrated output of character classification device, the output of geometric model, and considered the impact of text size (character number comprising in text) on recognition result.Parameter in text identification obtains by minimum classification error rate training method automatic learning, the higher performance of gain of parameter that this sets than experience.The present invention can be widely used in the occasions such as scene text identification.
Scene text recognition methods based on sparse coding feature provided by the invention, compared with additive method, the advantage and the beneficial effect that have are as follows:
1, character classification device of the present invention has adopted the feature extraction algorithm (being HSC) based on sparse coding, this feature extraction algorithm can represent abundant structural information better, improve the discriminating power of feature, thus detection and Identification character better.
2, the feature extraction algorithm based on sparse coding of the present invention is in the time extracting feature, and feature is directly learnt to obtain by sparse decode procedure, does not therefore need manual setting.
3, method of the present invention is in the time that search obtains optimum character combination mode, how much compatible (being geometric model) between candidate characters are also considered, this has effectively utilized the effective informations such as the geometric properties between character, has therefore improved text identification rate.
4, in method of the present invention, objective function has been considered the impact of text size, therefore can overcome the impact of text size on recognition result, thereby has improved scene text discrimination.
5, in method of the present invention, the parameter in objective function is obtained by MCE automatic learning, therefore can obtain more superior recognition effect.
6, method of the present invention, applicable to the scene text recognition methods of Chinese or the language such as English, in the time of training character sorter, adopts corresponding character database to train.
Brief description of the drawings
Fig. 1 is method flow block diagram of the present invention.
Fig. 2 is the sparse coding dictionary obtaining with K-SVD Algorithm Learning.Wherein, be (a) 5 × 5, be (b) 7 × 7, be (c) 9 × 9.
Fig. 3 is the comparative examples of HSC and HOG feature extraction result.Wherein, (1) is original character image, and (2) are HSC mark sheet diagram, and (3) are HOG mark sheet diagram.
Fig. 4 implements identifying and the result example that the present invention obtains.
Embodiment
For technical method of the present invention and advantage are further explained, below in conjunction with the drawings and specific embodiments, the present invention is described further.
As shown in the method flow diagram in Fig. 1, the present invention includes following steps:
Step S1: input natural scene text image to be identified;
Step S2: the method that adopts multi-scale sliding window mouth, window area in image is carried out to detection and Identification with character classification device, to each character class, be candidate characters region by the larger regional determination of sorter output, export less region and think background area, find out like this candidate characters region comprising in image.Adopt again non-maximum value inhibition method, the larger region of Duplication is only retained to region and the corresponding character class of sorter output valve maximum, remove like this candidate characters region of a large amount of repeated and redundant, obtain character testing result;
In this step, use a character classification device that precondition is good.Character classification device of the present invention adopts the feature extracting method based on sparse coding, and sorter adopts conventional Random Fern or svm classifier device, and other machine learning algorithm is such as Boosting, neural network etc., all can be used for learning character sorter.The database adopting when training is individual character database, can select as required English data number (for english identification) or Chinese database (Chinese sentence identification) to train.
Wherein, above, the leaching process of the described feature extraction algorithm based on sparse coding is as follows:
Step S201: by a large amount of natural scene picture data, obtain a sparse coding dictionary with general applicability with K-SVD Algorithm Learning.
Wherein, described K-SVD algorithm is when study dictionary (representing with D), each element of dictionary D is designed to 9 × 9 picture, represent the total architectural feature that study obtains, dictionary D comprises 100 elements (size that is dictionary is 100) altogether, this makes dictionary have higher expression ability, makes calculated amount be controlled at acceptable scope simultaneously.As shown in Figure 2, be the dictionary example obtaining with K-SVD Algorithm Learning, wherein dictionary comprises 100 elements, each element can make the image of 5 × 5,7 × 7 or 9 × 9 sizes, the pixel of element is more, and the structural information that can express is abundanter, but corresponding calculated amount is also larger.In the specific embodiment of the present invention, selecting image size is 9 × 9.
Step S202: the sparse coding dictionary that study is obtained saves, wherein, what in dictionary, each element was described is some important structural informations.
Step S203: utilize the dictionary obtaining in step S202, extract the sparse coding feature of image, while extracting feature, to each pixel of image, decode and obtain the sparse coding of pixel by Orthogonal Matching Pursuit (OMP) algorithm, then the sparse coding obtaining is added up to the histogram (being Histogram of Sparse Codes, HSC) that obtains sparse coding, thereby obtain the sparse coding feature of image, i.e. HSC feature.
Wherein, the process that sparse coding statistics is obtained to HSC feature is as follows: when sparse coding is added up to the histogram that obtains sparse coding, having adopted and being similar to histogram of gradients feature (is HOG feature, Histogram of Oriented Gradients) method, concrete steps comprise:
First, the picture of input is divided into 8 × 8 junior unit piece, adds up the sparse coding of each junior unit piece;
Then, use bilinear interpolation to utilize the adjacent block of each junior unit piece to calculate the sparse coding feature of each junior unit piece, also the feature on each junior unit piece is to ask interpolation to obtain on the neighborhood of 16 × 16;
Finally, the proper vector of all junior unit pieces is linked up to the sparse coding feature that obtains whole image, i.e. HSC feature.
As shown in Figure 3, it is the visualization result contrast of the feature to several character samples and the extraction of non-character sample with HSC and HOG feature extraction algorithm, the abundanter structural information of can having found out HSC character representation, such as information such as texture, edge, angle points, the information of HOG character representation is taking edge as main, and the structural information that does not have HSC to provide is abundant.
Step S3, the character detecting is merged into a word (or being line of text), owing to each character class having been retained to a large amount of candidate characters regions, when being merged into word, character has a large amount of array modes, therefore this step adopts dynamic programming algorithm, the character combination mode that obtains identifying mark maximum according to described objective function search below, obtains final text identification result.Objective function is:
O = Σ i = 1 n S ( c i , x i ) + α Σ i = 1 n - 1 g ( x i , x i + 1 ) + βn ,
Wherein, S (c i, x i) be that character classification device is by candidate characters x ibe identified as c iscore, g (x i, x i+1) be candidate characters x iand x i+1compatibility in geometric relationship, α and β are two and regulate parameter.Regulate parameter alpha and β to be obtained by minimum classification error rate training method (being MCE training method) study, the database using is text database.The optimum character combination mode that the search of dynamic programming searching algorithm obtains, is last text identification result.
Step S4: output scene text identification result.
As shown in Figure 4, be to implement identifying and the result example that the present invention obtains.
Essential characteristic of the present invention is that character classification device has used the feature extracting method based on sparse coding, can describe better the structural information of character, effectively improves the accuracy that character identification rate and character detect, thereby improves scene text discrimination.The present invention has overcome HOG feature and can not better describe the problem of character feature, in conjunction with the feature of scene text detection and Identification, designs the scene text recognition methods based on sparse coding feature.

Claims (10)

1. the scene text recognition methods based on sparse coding feature, is characterized in that comprising the steps:
Step S1: input natural scene text image to be identified;
Step S2: the method that adopts multi-scale sliding window mouth, window area in image is carried out to detection and Identification with character classification device, to each character class, be candidate characters region by the larger regional determination of sorter output, export less region and think background area, find out like this candidate characters region comprising in image, adopt again non-maximum value inhibition method, the larger region of Duplication is only retained to region and the corresponding character class of sorter output valve maximum, the candidate characters region of removing repeated and redundant, obtains character testing result;
Step S3: the character detecting is merged into a word or line of text;
Step S4: output scene text identification result.
2. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 1, it is characterized in that in step S2, the feature extraction of described character classification device is the feature adopting based on sparse coding, and sorter training adopts training comparatively simply and recognition speed Random Fern sorter or svm classifier device faster.
3. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 2, is characterized in that the characteristic extraction procedure of described sparse coding comprises the steps:
Step S201: by a large amount of natural scene picture data, obtain a sparse coding dictionary with general applicability with K-SVD Algorithm Learning;
Step S202: the sparse coding dictionary that study is obtained is preserved, wherein, what in dictionary, each element was described is some important structural informations;
Step S203: utilize the dictionary of preserving in step S202, extract the sparse coding feature of image.
4. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 3, it is characterized in that in step S201, described K-SVD algorithm is in the time of study dictionary, each element of dictionary is designed to 9 × 9 picture, represented the total architectural feature that study obtains, dictionary comprises 100 elements altogether, and the size of dictionary is 100, this makes dictionary have higher expression ability, makes calculated amount be controlled at acceptable scope simultaneously.
5. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 3, it is characterized in that in step S203, the concrete grammar of the sparse coding feature of described extraction image is: to each pixel of image, decode and obtain the sparse coding of pixel by Orthogonal Matching Pursuit algorithm, again the sparse coding obtaining is added up to the histogram that obtains sparse coding, thereby obtain the sparse coding feature of image, i.e. HSC feature.
6. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 5, it is characterized in that the described histogram that obtains sparse coding that the sparse coding obtaining is added up, thereby the method that obtains the sparse coding feature of image is: when sparse coding is added up to the histogram that obtains sparse coding, adopted the method that is similar to histogram of gradients feature, concrete steps comprise:
First, the picture of input is divided into 8 × 8 junior unit piece, adds up the sparse coding of each junior unit piece;
Then, use bilinear interpolation to utilize the adjacent block of each junior unit piece to calculate the sparse coding feature of each junior unit piece, also the feature on each junior unit piece is to ask interpolation to obtain on the neighborhood of 16 × 16;
Finally, the proper vector of all junior unit pieces is linked up to the sparse coding feature that obtains whole image, i.e. HSC feature.
7. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 1, it is characterized in that in step S3, described the character detecting is merged into a word or line of text, owing to each character class having been retained to a large amount of candidate characters regions, when being merged into word, character has a large amount of array modes, therefore adopt dynamic programming algorithm search to obtain identifying the character combination mode of mark maximum, obtain final text identification result.
8. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 7, it is characterized in that described employing dynamic programming algorithm search obtains identifying the character combination mode of mark maximum, needs an objective function to evaluate the score of each combination; The design of described objective function adopts following methods:
With w=(c 1, c 2..., c n) expression candidate word, wherein a c i(i=1,2 ..., character class n) comprising in expression candidate word, n is character number (being text size), x irepresent c icandidate characters region, objective function is designed to:
O = Σ i = 1 n S ( c i , x i ) + α Σ i = 1 n - 1 g ( x i , x i + 1 ) + βn ,
Wherein S (c i, x i) be that character classification device is by candidate characters x ibe identified as c iscore, g (x i, x i+1) be the output of geometric model, candidate characters x has been described iand x i+1compatibility in geometric relationship, α and β are two and regulate parameter.
9. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 8, is characterized in that in described objective function geometric model g (x i, x i+1) what describe is whether two geometric properties between adjacent character are intercharacter features, two class classification problems, geometric properties is carried out to modeling with a svm classifier device, the geometric properties extracting when modeling comprises the Duplication of yardstick similarity, adjacent character, the distance of up-and-down boundary.
10. a kind of scene text recognition methods based on sparse coding feature as claimed in claim 8, it is characterized in that in described objective function, consider the impact of text size, therefore can overcome the impact of character length on recognition result, thereby improved text identification rate; Regulate parameter alpha and β can adopt minimum classification error rate training method to obtain at scene text database learning.
CN201410184072.6A 2014-05-04 2014-05-04 A kind of scene text recognition methods based on sparse coding feature Active CN103942550B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410184072.6A CN103942550B (en) 2014-05-04 2014-05-04 A kind of scene text recognition methods based on sparse coding feature

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410184072.6A CN103942550B (en) 2014-05-04 2014-05-04 A kind of scene text recognition methods based on sparse coding feature

Publications (2)

Publication Number Publication Date
CN103942550A true CN103942550A (en) 2014-07-23
CN103942550B CN103942550B (en) 2018-11-02

Family

ID=51190213

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410184072.6A Active CN103942550B (en) 2014-05-04 2014-05-04 A kind of scene text recognition methods based on sparse coding feature

Country Status (1)

Country Link
CN (1) CN103942550B (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239872A (en) * 2014-09-26 2014-12-24 南开大学 Abnormal Chinese character identification method
CN104268536A (en) * 2014-10-11 2015-01-07 烽火通信科技股份有限公司 Face detection method through images
CN104537362A (en) * 2015-01-16 2015-04-22 中国科学院自动化研究所 Domain-based self-adaptive English scene character recognition method
CN104598881A (en) * 2015-01-12 2015-05-06 中国科学院信息工程研究所 Feature compression and feature selection based skew scene character recognition method
CN105404868A (en) * 2015-11-19 2016-03-16 电子科技大学 Interaction platform based method for rapidly detecting text in complex background
CN105608456A (en) * 2015-12-22 2016-05-25 华中科技大学 Multi-directional text detection method based on full convolution network
CN105631469A (en) * 2015-12-18 2016-06-01 华南理工大学 Bird image recognition method by multilayer sparse coding features
CN105740909A (en) * 2016-02-02 2016-07-06 华中科技大学 Text recognition method under natural scene on the basis of spatial transformation
CN106446909A (en) * 2016-09-06 2017-02-22 广东顺德中山大学卡内基梅隆大学国际联合研究院 Chinese food image feature extraction method
CN107886065A (en) * 2017-11-06 2018-04-06 哈尔滨工程大学 A kind of Serial No. recognition methods of mixing script
CN108304835A (en) * 2018-01-30 2018-07-20 百度在线网络技术(北京)有限公司 character detecting method and device
US20190156156A1 (en) * 2017-11-17 2019-05-23 Hong Kong Applied Science and Technology Research Institute Company Limited Flexible integrating recognition and semantic processing
CN109993040A (en) * 2018-01-03 2019-07-09 北京世纪好未来教育科技有限公司 Text recognition method and device
CN110135248A (en) * 2019-04-03 2019-08-16 华南理工大学 A kind of natural scene Method for text detection based on deep learning
CN110345954A (en) * 2018-04-03 2019-10-18 奥迪股份公司 Navigation system and method
CN110688949A (en) * 2019-09-26 2020-01-14 北大方正集团有限公司 Font identification method and apparatus
CN110796129A (en) * 2018-08-03 2020-02-14 珠海格力电器股份有限公司 Text line region detection method and device
CN110796143A (en) * 2019-10-31 2020-02-14 天津大学 Scene text recognition method based on man-machine cooperation
CN111738326A (en) * 2020-06-16 2020-10-02 中国工商银行股份有限公司 Sentence granularity marking training sample generation method and device
CN111898411A (en) * 2020-06-16 2020-11-06 华南理工大学 Text image labeling system, method, computer device and storage medium
CN113642584A (en) * 2021-08-13 2021-11-12 北京百度网讯科技有限公司 Character recognition method, device, equipment, storage medium and intelligent dictionary pen
CN114708580A (en) * 2022-04-08 2022-07-05 北京百度网讯科技有限公司 Text recognition method, model training method, device, apparatus, storage medium, and program
CN114758340A (en) * 2021-12-21 2022-07-15 上海沐冉信息科技有限公司 Intelligent identification method, device and equipment for logistics address and storage medium
US11645733B2 (en) 2020-06-16 2023-05-09 Bank Of America Corporation System and method for providing artificial intelligence architectures to people with disabilities

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102799879A (en) * 2012-07-12 2012-11-28 中国科学技术大学 Method for identifying multi-language multi-font characters from natural scene image
CN103336961A (en) * 2013-07-22 2013-10-02 中国科学院自动化研究所 Interactive natural scene text detection method
US8934716B2 (en) * 2012-01-11 2015-01-13 Electronics And Telecommunications Research Institute Method and apparatus for sequencing off-line character from natural scene

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8934716B2 (en) * 2012-01-11 2015-01-13 Electronics And Telecommunications Research Institute Method and apparatus for sequencing off-line character from natural scene
CN102799879A (en) * 2012-07-12 2012-11-28 中国科学技术大学 Method for identifying multi-language multi-font characters from natural scene image
CN103336961A (en) * 2013-07-22 2013-10-02 中国科学院自动化研究所 Interactive natural scene text detection method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
KAI WANG等: "End-to-End Scene Text Recognition", 《2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION》 *
XIAOFENG REN等: "Histograms of Sparse Codes for Object Detection", 《2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION》 *
张伟伟 等: "基于DPM的自然场景下汉字识别方法", 《计算机应用研究》 *

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239872A (en) * 2014-09-26 2014-12-24 南开大学 Abnormal Chinese character identification method
CN104268536B (en) * 2014-10-11 2017-07-18 南京烽火软件科技有限公司 A kind of image method for detecting human face
CN104268536A (en) * 2014-10-11 2015-01-07 烽火通信科技股份有限公司 Face detection method through images
CN104598881A (en) * 2015-01-12 2015-05-06 中国科学院信息工程研究所 Feature compression and feature selection based skew scene character recognition method
CN104598881B (en) * 2015-01-12 2017-09-29 中国科学院信息工程研究所 Feature based compresses the crooked scene character recognition method with feature selecting
CN104537362A (en) * 2015-01-16 2015-04-22 中国科学院自动化研究所 Domain-based self-adaptive English scene character recognition method
CN104537362B (en) * 2015-01-16 2017-12-01 中国科学院自动化研究所 A kind of English scene character recognition method adaptive based on domain
CN105404868A (en) * 2015-11-19 2016-03-16 电子科技大学 Interaction platform based method for rapidly detecting text in complex background
CN105404868B (en) * 2015-11-19 2019-05-10 电子科技大学 The rapid detection method of text in a kind of complex background based on interaction platform
CN105631469A (en) * 2015-12-18 2016-06-01 华南理工大学 Bird image recognition method by multilayer sparse coding features
CN105608456B (en) * 2015-12-22 2017-07-18 华中科技大学 A kind of multi-direction Method for text detection based on full convolutional network
CN105608456A (en) * 2015-12-22 2016-05-25 华中科技大学 Multi-directional text detection method based on full convolution network
CN105740909A (en) * 2016-02-02 2016-07-06 华中科技大学 Text recognition method under natural scene on the basis of spatial transformation
CN106446909A (en) * 2016-09-06 2017-02-22 广东顺德中山大学卡内基梅隆大学国际联合研究院 Chinese food image feature extraction method
CN107886065A (en) * 2017-11-06 2018-04-06 哈尔滨工程大学 A kind of Serial No. recognition methods of mixing script
CN109983473A (en) * 2017-11-17 2019-07-05 香港应用科技研究院有限公司 Flexible integrated identification and semantic processes
US10810467B2 (en) 2017-11-17 2020-10-20 Hong Kong Applied Science and Technology Research Institute Company Limited Flexible integrating recognition and semantic processing
WO2019096270A1 (en) * 2017-11-17 2019-05-23 Hong Kong Applied Science and Technology Research Institute Company Limited Flexible integrating recognition and semantic processing
CN109983473B (en) * 2017-11-17 2023-01-31 香港应用科技研究院有限公司 Flexible integrated recognition and semantic processing
US20190156156A1 (en) * 2017-11-17 2019-05-23 Hong Kong Applied Science and Technology Research Institute Company Limited Flexible integrating recognition and semantic processing
CN109993040A (en) * 2018-01-03 2019-07-09 北京世纪好未来教育科技有限公司 Text recognition method and device
CN109993040B (en) * 2018-01-03 2021-07-30 北京世纪好未来教育科技有限公司 Text recognition method and device
CN108304835A (en) * 2018-01-30 2018-07-20 百度在线网络技术(北京)有限公司 character detecting method and device
CN110345954A (en) * 2018-04-03 2019-10-18 奥迪股份公司 Navigation system and method
CN110796129A (en) * 2018-08-03 2020-02-14 珠海格力电器股份有限公司 Text line region detection method and device
CN110135248A (en) * 2019-04-03 2019-08-16 华南理工大学 A kind of natural scene Method for text detection based on deep learning
CN110688949B (en) * 2019-09-26 2022-11-01 北大方正集团有限公司 Font identification method and apparatus
CN110688949A (en) * 2019-09-26 2020-01-14 北大方正集团有限公司 Font identification method and apparatus
CN110796143A (en) * 2019-10-31 2020-02-14 天津大学 Scene text recognition method based on man-machine cooperation
CN111898411A (en) * 2020-06-16 2020-11-06 华南理工大学 Text image labeling system, method, computer device and storage medium
CN111738326A (en) * 2020-06-16 2020-10-02 中国工商银行股份有限公司 Sentence granularity marking training sample generation method and device
US11645733B2 (en) 2020-06-16 2023-05-09 Bank Of America Corporation System and method for providing artificial intelligence architectures to people with disabilities
CN113642584A (en) * 2021-08-13 2021-11-12 北京百度网讯科技有限公司 Character recognition method, device, equipment, storage medium and intelligent dictionary pen
CN113642584B (en) * 2021-08-13 2023-11-28 北京百度网讯科技有限公司 Character recognition method, device, equipment, storage medium and intelligent dictionary pen
CN114758340A (en) * 2021-12-21 2022-07-15 上海沐冉信息科技有限公司 Intelligent identification method, device and equipment for logistics address and storage medium
CN114708580A (en) * 2022-04-08 2022-07-05 北京百度网讯科技有限公司 Text recognition method, model training method, device, apparatus, storage medium, and program
CN114708580B (en) * 2022-04-08 2024-04-16 北京百度网讯科技有限公司 Text recognition method, text recognition model training method, text recognition device, model training device, text recognition program, model training program, and computer-readable storage medium

Also Published As

Publication number Publication date
CN103942550B (en) 2018-11-02

Similar Documents

Publication Publication Date Title
CN103942550A (en) Scene text recognition method based on sparse coding characteristics
Yi et al. Feature representations for scene text character recognition: A comparative study
Shahab et al. ICDAR 2011 robust reading competition challenge 2: Reading text in scene images
Coates et al. Text detection and character recognition in scene images with unsupervised feature learning
John et al. Offline handwritten Malayalam Character Recognition based on chain code histogram
CN103984943A (en) Scene text identification method based on Bayesian probability frame
CN106127222B (en) A kind of the similarity of character string calculation method and similitude judgment method of view-based access control model
Zhou et al. Detecting multilingual text in natural scene
Zhang et al. Automatic discrimination of text and non-text natural images
Huang et al. Text detection and recognition in natural scene images
Afakh et al. Aksara jawa text detection in scene images using convolutional neural network
CN102136074B (en) Man-machine interface (MMI) based wood image texture analyzing and identifying method
Asad et al. High performance OCR for camera-captured blurred documents with LSTM networks
Ayesh et al. A robust line segmentation algorithm for Arabic printed text with diacritics
Parameshwaran et al. Transfer learning for classifying single hand gestures on comprehensive Bharatanatyam Mudra dataset
Ren et al. A CNN based scene Chinese text recognition algorithm with synthetic data engine
Yokobayashi et al. Segmentation and recognition of characters in scene images using selective binarization in color space and gat correlation
Antony et al. Haar features based handwritten character recognition system for Tulu script
Wshah et al. A novel lexicon reduction method for Arabic handwriting recognition
Yildirim et al. Text recognition in natural images using multiclass hough forests
Mukhiddinov Scene text detection and localization using fully convolutional network
Daood et al. Handwriting detection and recognition of Arabic numbers and characters using deep learning methods
Khan et al. A holistic approach to Urdu language word recognition using deep neural networks
Kumar et al. Scene text recognition using artificial neural network: a survey
Ghosh et al. Efficient indexing for query by string text retrieval

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant