CN108681735A - Optical character recognition method based on convolutional neural networks deep learning model - Google Patents

Optical character recognition method based on convolutional neural networks deep learning model Download PDF

Info

Publication number
CN108681735A
CN108681735A CN201810270374.3A CN201810270374A CN108681735A CN 108681735 A CN108681735 A CN 108681735A CN 201810270374 A CN201810270374 A CN 201810270374A CN 108681735 A CN108681735 A CN 108681735A
Authority
CN
China
Prior art keywords
model
deep learning
optical character
neural networks
convolutional neural
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810270374.3A
Other languages
Chinese (zh)
Inventor
陆成学
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Science And Technology Co Ltd (beijing) Technology Co Ltd
Original Assignee
China Science And Technology Co Ltd (beijing) Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Science And Technology Co Ltd (beijing) Technology Co Ltd filed Critical China Science And Technology Co Ltd (beijing) Technology Co Ltd
Priority to CN201810270374.3A priority Critical patent/CN108681735A/en
Publication of CN108681735A publication Critical patent/CN108681735A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/285Selection of pattern recognition techniques, e.g. of classifiers in a multi-classifier system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Abstract

The present invention discloses a kind of optical character recognition method based on convolutional neural networks deep learning model.This approach includes the following steps:The Chinese characters in common use and 10 Arabic numerals of collection different fonts and 26 English alphabet data sets are simultaneously converted into picture format;Slight distortion and rotation are carried out to enhance the robustness of model to picture, generate model training database;Establish the deep learning model of optical character identification;Training set image input model is continued to optimize into object function using convolutional neural networks model by the method for supervised learning, learns a multi-categorizer;For new test sample, feature extraction is carried out to it based on model obtained in the previous step and application model grader obtains final classification result.The present invention proposes new model and method to application of the deep learning based on convolutional neural networks in optical character identification, this method can be applied in general pattern classification task, especially text identification problem, the optical character identification model proposed by the present invention based on deep learning can significantly improve the recognition correct rate of character recognition.

Description

Optical character recognition method based on convolutional neural networks deep learning model
Technical field
The present invention relates to computer vision, pattern-recognition, the technical fields such as natural scene feature recognition, especially a kind of bases In the optical character recognition method of convolutional neural networks deep learning model.
Background technology
Optical character identification is because it in real-life practicability has obtained the extensive concern of domestic and foreign scholars, base at present It is concentrated mainly on scanned document character recognition in the application of optical character identification.Optical character identification streetscape identifier identify, The foreground of being widely applied also is used in bank's ID card information identification, classroom blackboard-writing identification etc..Optical character identification has height The advantages of effect property and convenience.There is a large amount of research effort just constantly promoting the development of field of optical character recognition at present.
A usual character recognition system is acquired by character, Character segmentation, feature extraction, several step structures such as characteristic matching At.Wherein feature extraction has most important influence for the accuracy of character recognition.When the feature using most identification When matching is compared to character, better discrimination can be usually obtained, it is on the contrary then will be greatly reduced character recognition system Accuracy.And the research of character recognition is also concentrated mainly in the method for character feature extraction, it is based on convolutional neural networks Deep learning method in detection feature automatically and extract characteristic aspect and have big advantage.
In recent years, the deep learning model based on convolutional neural networks is prominent in numerous computer vision problems because of it Go out performance and obtains great concern.Its basic thought is to carry an original image automatically behind multilayer convolution sum pond Take wherein most representational feature.Deep learning is all obtained in character recognition, image classification, natural language processing etc. in fields Obtained howling success.And with the development of technology, how to learn to suitable for particular problem (such as be used for image classification, character Identification) model become scholars' focus of attention.
Using the method for deep learning, a weight matrix with identification can be obtained by study and is biased towards Amount.Weight vector and biasing constitute a grader, and classification knot will be can be obtained after character input grader to be tested Fruit.Research under this theoretical frame is mainly concentrated in so that the model learnt has differentiation performance more outstanding.
However, in character recognition problem under practical application scene, it is not usually to mark that we, which can be obtained character picture, Accurate character picture.Due to intensity of illumination, the factors such as placement position, character picture usually has a degree of rotation or torsion It is bent.If the character picture of standard is directly used in above-mentioned model, is had in the model acquired and greatly represent judgement index Weaker requires character picture very stringent information, then the recognition correct rate of model can substantially reduce.And if it is intended to obtaining Obtain good recognition effect, it usually needs the additional capacity for increasing character training set is to expand its coverage area.
For deep learning model have the characteristics that good ability in feature extraction this, it is proposed that existing optics word Symbol identification model is improved, and learns a grader under deep learning frame to complete the identification to character.In this way in reality It, can be in a unification from the identification of the input character picture of non-standard (including but not limited to) to the end under the application environment of border Frame in be resolved.
Invention content
(1) technical problems to be solved
The problem of for input picture in character recognition problem under actual environment may be non-standard image, the present invention propose Character feature extraction and character recognition are placed on by a kind of optical character recognition method based on neural network deep learning model It is resolved under one unified frame so that it is correct that the interaction of above-mentioned two step improves final character recognition jointly Rate.
(2) technical solution
A kind of technical solution of optical character recognition method based on neural network deep learning model proposed by the present invention It is as follows:
Step S1 collects the Chinese character of common different fonts, 10 Arabic numerals and 26 English alphabets and generates figure The data set of piece format.
Step S2, training set and test set sample to acquisition suitably carry out slight rotation and distortion.
Step S3, each layer weight matrix parameter W and biasing b of the grader of Optimization Learning training set, passes through stochastic gradient The optimal way of descent method (SGD) minimizes object function, study optimum classifier parameter W and b.
Step S4 carries out a propagated forward, calculates the probability value of its affiliated each classification, obtains the classification of test character As a result.
Beneficial effects of the present invention:The present invention is directed to the character recognition problem under actual application environment, can directly input Non-standard character image carries out character recognition.It is placed on a unified model frame by expressing character feature, with character recognition It is solved under frame, it is hereby achieved that higher discrimination, enhances the robustness of algorithm.
Description of the drawings
Fig. 1 is the system flow chart of the optical character recognition method based on neural network deep learning model.
Specific implementation mode
To make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with specific example, and with reference to detailed Thin attached drawing, the present invention is described in more detail.But described embodiment is intended merely to facilitate the understanding of the present invention, and right It does not play any restriction effect.
Fig. 1 is flow chart of the method for the present invention, as shown in Figure 1, proposed by the present invention a kind of based on neural network depth The optical character recognition method for practising model includes following steps:
Step S1 collects the Chinese character of common different fonts, 26 English alphabets of 10 Arabic numerals and English alphabet And generate the data set of picture format.
Step S2, training set and test set sample to acquisition suitably carry out slight rotation and distortion.
Step S3, each layer weight matrix parameter W of the grader of Optimization Learning training set, passes through stochastic gradient descent method (SGD) optimal way minimizes object function, study optimum classifier parameter W and b.
S31 initializes weights square for multiple convolution kernels of each convolutional layer in training set by Gaussian Profile Battle array.Next, entering alternately error propagated forward and gradient back-propagation process, each of which volume is provided simultaneously by SGD algorithms The weights of product core.S32 and S33 is recycled until restraining or reaching iterations requirement.
This is the object function of a typical classification problem, and the optimization for completing this object function can be in the hope of one group Sorting parameter W and b.
S32, the value of propagated forward counting loss function:
S33, the Grad of backpropagation counting loss function pair parameters.
Wherein, f is hidden layer.
Step S4 carries out a propagated forward, calculates the probability value of its affiliated each classification.
Wherein, s=g (xi;W, b).
Case study on implementation:
For the specific implementation mode and verification effectiveness of the invention that the present invention will be described in detail, we propose the present invention Method be applied to the database that forms of picture generated by Chinese characters in common use, 10 Arabic numerals and 26 letters.The data Library is included in the image that rotation in various degree and distortion obtain.In our embodiment, we extract every in image first A character.Using the single character after extraction as the input feature vector of training and test.
According to the step S3 in the technical detail introduced before, we first carry out all training set data input models Training, wherein training parameter W are set as Gaussian Profile, mean value 0, standard deviation 0.01.Next according to step S31, S32 and S33 completes the training to model.Grader is inputted to obtain final classification results by step S4 to new test image.
Particular embodiments described above has carried out further in detail the purpose of the present invention, technical solution and advantageous effect It describes in detail bright, it should be understood that the above is only a specific embodiment of the present invention, is not intended to restrict the invention, it is all Within the spirit and principles in the present invention, any modification, equivalent substitution, improvement and etc. done should be included in the guarantor of the present invention Within the scope of shield.

Claims (6)

1. a kind of optical character recognition method based on convolutional neural networks deep learning model, which is characterized in that this method Specific steps include:
Step S1 collects the Chinese character of common different fonts, 10 Arabic numerals and 26 English alphabets and generates picture lattice The data set of formula.
Step S2, training set and test set sample to acquisition suitably carry out slight rotation and distortion processing.
Step S3, each layer weight matrix parameter W and biasing b of the grader of Optimization Learning training set, passes through stochastic gradient descent The optimal way of method (SGD) minimizes object function, study optimum classifier parameter W and b.
2. the optical character recognition method according to claim 1 based on convolutional neural networks deep learning model, special Sign is, in the step S1, collects the Chinese characters in common use and 10 Arabic numerals and 26 that all identity cards are related to English alphabet.
3. the optical character recognition method according to claim 1 based on convolutional neural networks deep learning model, special Sign is, in step s 2, to training set and test set sample differ mild distortion and the rotation of degree, after processing Image as input feature vector.
4. the optical character recognition method according to claim 1 based on convolutional neural networks deep learning model, special Sign is that in step s3, the optimization of deep learning model needs to complete by stochastic gradient descent method iteration optimization strategy, Specific process is summarized as follows:
S31 initializes weight matrix for multiple convolution kernels of each convolutional layer in training set by Gaussian Profile.It connects Get off, into alternately error propagated forward and gradient back-propagation process, each of which convolution kernel is provided by SGD algorithms simultaneously Weights.S32 and S33 is recycled until restraining or reaching iterations requirement.
S32, the value of propagated forward counting loss function:
This is the object function of a typical classification problem, and the optimization for completing this object function can be in the hope of one group of classification Parameter W and b.
S33, the Grad of backpropagation counting loss function pair parameters.
Wherein, f is hidden layer.
5. the optical character recognition method according to claim 1 based on convolutional neural networks deep learning model, special Sign is, in step s3, after model training, for a new test sample ytest, depth is acquired currently It practises in model and predicts its value.Its concrete operation step is as follows:
S4 carries out a propagated forward, calculates the probability value of its affiliated each classification.
Wherein, si=g (xi;W, b), i=1,2 ... m.
6. the optical character recognition method according to claim 5 based on convolutional neural networks deep learning model, special Sign is, in steps of 5, after the probability value for calculating all categories, last classification results is determined according to the size of probability value.
CN201810270374.3A 2018-03-28 2018-03-28 Optical character recognition method based on convolutional neural networks deep learning model Pending CN108681735A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810270374.3A CN108681735A (en) 2018-03-28 2018-03-28 Optical character recognition method based on convolutional neural networks deep learning model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810270374.3A CN108681735A (en) 2018-03-28 2018-03-28 Optical character recognition method based on convolutional neural networks deep learning model

Publications (1)

Publication Number Publication Date
CN108681735A true CN108681735A (en) 2018-10-19

Family

ID=63800544

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810270374.3A Pending CN108681735A (en) 2018-03-28 2018-03-28 Optical character recognition method based on convolutional neural networks deep learning model

Country Status (1)

Country Link
CN (1) CN108681735A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109598270A (en) * 2018-12-04 2019-04-09 龙马智芯(珠海横琴)科技有限公司 Distort recognition methods and the device, storage medium and processor of text
CN109858305A (en) * 2019-01-17 2019-06-07 柳州康云互联科技有限公司 A kind of two dimensional code positioning identification system and method based on deep learning
CN110059705A (en) * 2019-04-22 2019-07-26 厦门商集网络科技有限责任公司 A kind of OCR recognition result decision method and equipment based on modeling
CN110956133A (en) * 2019-11-29 2020-04-03 上海眼控科技股份有限公司 Training method of single character text normalization model, text recognition method and device
CN111797908A (en) * 2020-06-18 2020-10-20 浪潮金融信息技术有限公司 Training set generation method of deep learning model for print character recognition
CN112580657A (en) * 2020-12-23 2021-03-30 陕西天诚软件有限公司 Self-learning character recognition method
CN113191251A (en) * 2021-04-28 2021-07-30 北京有竹居网络技术有限公司 Method and device for detecting stroke order, electronic equipment and storage medium
US11295155B2 (en) 2020-04-08 2022-04-05 Konica Minolta Business Solutions U.S.A., Inc. Online training data generation for optical character recognition
CN117173716A (en) * 2023-09-01 2023-12-05 湖南天桥嘉成智能科技有限公司 Deep learning-based high-temperature slab ID character recognition method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0784828A2 (en) * 1994-10-05 1997-07-23 United Parcel Service Of America, Inc. Method of and apparatus for segmenting foreground and background information for optical character recognition of labels employing single layer recurrent neural network
CN104966097A (en) * 2015-06-12 2015-10-07 成都数联铭品科技有限公司 Complex character recognition method based on deep learning
CN105184312A (en) * 2015-08-24 2015-12-23 中国科学院自动化研究所 Character detection method and device based on deep learning
CN107273897A (en) * 2017-07-04 2017-10-20 华中科技大学 A kind of character recognition method based on deep learning

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0784828A2 (en) * 1994-10-05 1997-07-23 United Parcel Service Of America, Inc. Method of and apparatus for segmenting foreground and background information for optical character recognition of labels employing single layer recurrent neural network
CN104966097A (en) * 2015-06-12 2015-10-07 成都数联铭品科技有限公司 Complex character recognition method based on deep learning
CN105184312A (en) * 2015-08-24 2015-12-23 中国科学院自动化研究所 Character detection method and device based on deep learning
CN107273897A (en) * 2017-07-04 2017-10-20 华中科技大学 A kind of character recognition method based on deep learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
张超群: ""基于深度学习的字符识别"", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109598270B (en) * 2018-12-04 2020-05-05 龙马智芯(珠海横琴)科技有限公司 Method and device for identifying distorted characters, storage medium and processor
CN109598270A (en) * 2018-12-04 2019-04-09 龙马智芯(珠海横琴)科技有限公司 Distort recognition methods and the device, storage medium and processor of text
CN109858305A (en) * 2019-01-17 2019-06-07 柳州康云互联科技有限公司 A kind of two dimensional code positioning identification system and method based on deep learning
CN110059705A (en) * 2019-04-22 2019-07-26 厦门商集网络科技有限责任公司 A kind of OCR recognition result decision method and equipment based on modeling
CN110956133A (en) * 2019-11-29 2020-04-03 上海眼控科技股份有限公司 Training method of single character text normalization model, text recognition method and device
US11295155B2 (en) 2020-04-08 2022-04-05 Konica Minolta Business Solutions U.S.A., Inc. Online training data generation for optical character recognition
CN111797908B (en) * 2020-06-18 2022-08-09 浪潮金融信息技术有限公司 Training set generation method of deep learning model for print character recognition
CN111797908A (en) * 2020-06-18 2020-10-20 浪潮金融信息技术有限公司 Training set generation method of deep learning model for print character recognition
CN112580657A (en) * 2020-12-23 2021-03-30 陕西天诚软件有限公司 Self-learning character recognition method
CN112580657B (en) * 2020-12-23 2022-11-01 陕西天诚软件有限公司 Self-learning character recognition method
CN113191251A (en) * 2021-04-28 2021-07-30 北京有竹居网络技术有限公司 Method and device for detecting stroke order, electronic equipment and storage medium
CN117173716A (en) * 2023-09-01 2023-12-05 湖南天桥嘉成智能科技有限公司 Deep learning-based high-temperature slab ID character recognition method and system
CN117173716B (en) * 2023-09-01 2024-03-26 湖南天桥嘉成智能科技有限公司 Deep learning-based high-temperature slab ID character recognition method and system

Similar Documents

Publication Publication Date Title
CN108681735A (en) Optical character recognition method based on convolutional neural networks deep learning model
CN111325203B (en) American license plate recognition method and system based on image correction
CN107368831B (en) English words and digit recognition method in a kind of natural scene image
CN110490081B (en) Remote sensing object interpretation method based on focusing weight matrix and variable-scale semantic segmentation neural network
WO2017016240A1 (en) Banknote serial number identification method
CN106096602A (en) A kind of Chinese licence plate recognition method based on convolutional neural networks
CN109117885B (en) Stamp identification method based on deep learning
JP2022532177A (en) Forged face recognition methods, devices, and non-temporary computer-readable storage media
CN106228166B (en) The recognition methods of character picture
CN106372624B (en) Face recognition method and system
CN103544504B (en) Scene character recognition method based on multi-scale map matching core
CN104951781B (en) Character recognition device and recognition function generation method
CN109886161A (en) A kind of road traffic index identification method based on possibility cluster and convolutional neural networks
CN112307919B (en) Improved YOLOv 3-based digital information area identification method in document image
CN112069900A (en) Bill character recognition method and system based on convolutional neural network
CN108364037A (en) Method, system and the equipment of Handwritten Chinese Character Recognition
Hossain et al. Recognition and solution for handwritten equation using convolutional neural network
He et al. Aggregating local context for accurate scene text detection
Kobchaisawat et al. Thai text localization in natural scene images using convolutional neural network
Harizi et al. Convolutional neural network with joint stepwise character/word modeling based system for scene text recognition
Liu et al. Scene text recognition with high performance CNN classifier and efficient word inference
Yildirim et al. Text recognition in natural images using multiclass hough forests
Wicht et al. Camera-based sudoku recognition with deep belief network
CN110766001B (en) Bank card number positioning and end-to-end identification method based on CNN and RNN
Zuo et al. An intelligent knowledge extraction framework for recognizing identification information from real-world ID card images

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20181019

RJ01 Rejection of invention patent application after publication