CN106228166A

CN106228166A - The recognition methods of character picture

Info

Publication number: CN106228166A
Application number: CN201610603330.9A
Authority: CN
Inventors: 李清勇; 薛文元; 张振
Original assignee: Beijing Jiaotong University
Current assignee: Beijing Jiaotong University
Priority date: 2016-07-27
Filing date: 2016-07-27
Publication date: 2016-12-14
Anticipated expiration: 2036-07-27
Also published as: CN106228166B

Abstract

The invention provides the recognition methods of a kind of character picture.The method specifically includes that to be normalized original character image Image and obtains floating type image array ImageSquare；Use multiple filtering core that floating type image array ImageSquare is carried out convolution algorithm, obtain the one group of image array ImageDst comprising different texture feature；Obtain original character image Image characteristic of correspondence vector Features, from one group of image array ImageDst, extract multilayer feature, multilayer feature is arranged in characteristic vector Features, obtain data sample；Use the multi-level grader with cascade connection that data sample carries out successively classification to process, identify the character in original character image Image.The thought that the present invention classifies based on image, SVM classifier is utilized to devise a kind of multi-level cascade classifier model, achieve a kind of recognition methods step by step first sorted out and identify afterwards, decrease the class number of monolayer grader to a certain extent, thus improve recognition accuracy.The method of the present invention is succinct, workable.

Description

The recognition methods of character picture

Technical field

The present invention relates to character recognition technologies field, particularly relate to the recognition methods of a kind of character picture.

Background technology

Optical character recognition (Optical Character Recognition, OCR) is pattern recognition (Pattern Recognition) an important research direction in field, its main purpose be by hand-written, print, scan or text in image Be converted to machine code, be a kind of digitized process of text message.In recent years, along with the development of science and technology, OCR technique not only existed The Digitization of literature of ancient book plays great function, is also applied in many intelligent scene, as normal in life The charge station's Car license recognition seen, and Google's glasses provide wearer according to the word seen and advise accordingly.Character recognition Process is generally divided into several steps such as Image semantic classification, Character segmentation, character recognition and post processing, and each step is required for multiple Technology participates in jointly.

Abroad the research work of OCR technique is started to walk relatively early, and English character has total amount character stroke little, single to connect Deng distinct characteristic, these features are all prone to from image extract.Chinese character recognition is then the most challenging in OCR technique One content, one of them main difficulty is that the feature extraction of Chinese character.Chinese character is developed by pictograph, character Quantity is many, font type is many, structure is complicated, and has a number of nearly word form, and these features both increase extraction Chinese character to be had The difficulty of effect feature, thus have influence on recognition result.

The conventional method extracting character feature has statistical nature to extract, architectural feature is extracted.Statistical nature extracts main It is, by methods such as projection, statistics with histogram, each pixel in image carried out half-tone information statistics, thus forms character Characteristic vector.The method is poor to distinguishing property of details, it is impossible to effectively distinguish nearly word form.Feature extraction side based on structure Method is mainly extracted the stroke information of character, combines the feature such as direction, cross point, can preferably identify detailed information, but The most also can be the most sensitive to noise.In recent years due to the burning hot development of degree of depth study, occur in that and utilize neutral net to carry out The method of feature extraction, although there is the accuracy of identification that comparison is high, but technology realizes cost height, and Chinese character quantity is various, god Becoming sufficiently complex through the design of network, these are all unfavorable for the exploitation of middle-size and small-size OCR system.Based on the problems referred to above, the present invention Propose a kind of hierarchical Design model based on Gabor filtering to be used for carrying out feature extraction.Gabor filtering is a kind of sensitive to texture Method, it is possible to reflect image textural characteristics on different scale and direction, its extensive application image process in edge Test problems.

Summary of the invention

The embodiment provides the recognition methods of a kind of character picture, to realize effectively knowing from character picture Do not go out character.

To achieve these goals, this invention takes following technical scheme.

A kind of recognition methods of character picture, including:

Original character image Image is normalized and obtains floating type image array ImageSquare；

Use multiple filtering core that described floating type image array ImageSquare is carried out convolution algorithm, obtain comprising not One group of image array ImageDst with textural characteristics；

Obtain described original character image Image characteristic of correspondence vector Features, from described one group of image array ImageDst extracts multilayer feature, described multilayer feature is arranged in described characteristic vector Features, obtains data Sample；

Use the multi-level grader with cascade connection that described data sample carries out successively classification to process, identify Character in described original character image Image.

Further, described be normalized original character image Image obtains floating type image array ImageSquare, including:

If original character image is Image, the image after image Image gray processing processes is ImageGray, takes image The longest edge of ImageGray is designated as LenSide, calculates image ImageGray and is positioned at the gray average at four angular vertexs MeanVal, two long limits of image ImageGray outside fill equal number, gray value be the pixel of meanVald, Making image ImageGray become a size is the image array of LenSide × LenSide, then by described matrix image normalizing Turn to the floating type image array ImageSquare of 32 × 32.

Further, the multiple filtering core of described use carries out convolution to described floating type image array ImageSquare Computing, obtains the one group of image array ImageDst comprising different texture feature, including:

Designing 12 Gabor filtering cores, the kernel function of described Gabor filtering is as follows:

X '=x cos θ+y sin θ

Y '=-x sin θ+y cos θ

Wherein θ represents the direction of filtering core,Representing the phase place of cosine part, γ represents the space aspect ratio of filtering core, λ Representing the wavelength of cosine part, σ represents the standard deviation in Gaussian function；

Use described 12 Gabor filtering cores that described floating type image array ImageSquare is carried out convolution fortune respectively Calculating, extract 12 character feature images altogether of 3 kinds of different scales on 4 directions, kinds of characters characteristic image comprises not Same textural characteristics, forms one group of image array ImageDst by all of character feature image.

Further, described acquisition described original character image Image characteristic of correspondence vector Features, from described One group of image array group ImageDst extracts multilayer feature, described multilayer feature is arranged described characteristic vector In Features, obtain data sample, including:

Note image Image characteristic of correspondence vector is Features, calculates described one group of image array respectively The average of all pixels and standard deviation the feature as ground floor in every characteristic image in ImageDst, by described first The feature of layer is arranged in characteristic vector Features；

It is divided into 4 sizes to be the image moment of 16 × 16 every characteristic image in described one group of image array ImageDst Battle array, calculates the average of the image array of each 16 × 16 and standard deviation the feature as the second layer, respectively by described second The feature of layer is arranged in characteristic vector Features；

It is divided into 16 sizes to be the image array of 8 × 8 every characteristic image in described one group of image array ImageDst, Calculate the average of the image array of each 8 × 8 and standard deviation the feature as third layer respectively, by the spy of described third layer Levy and arrange in characteristic vector Features；

Using contain three layers of characteristic information characteristic vector Features as data sample.

Further, described use has the multi-level grader of cascade connection and carries out described data sample successively Classification processes, and identifies the character in described original character image Image, including:

Arrange the multi-level grader with cascade connection according to sample attribute, each layer includes multiple grader, often The grader on individual upper strata cascades the grader of one or more lower floor, and sub classification device is the upper strata grader cascading it Refinement, the sample attribute of each grader corresponding kind respectively, choose the training data of every kind respectively to correspondence Grader is trained, and obtains the cascade classifier trained；

The cascade classifier trained described in using is according to top-down order, to described data sample from different perspectives Successively classify, when described data sample is at classification results certain grader corresponding of last layer time, then choose described certain The grader of next layer of grader cascade carries out classification further to described data sample, until the classification of the bottom Device completes the classification to described data sample and processes, and the grader on all levels is entered the classification results of described sample data Row is comprehensive, obtains the character identification result of described original character image Image.

The technical scheme that thered is provided by embodiments of the invention described above is it can be seen that what the embodiment of the present invention was classified based on image Thought, utilizes SVM classifier to devise a kind of multi-level cascade classifier model, it is achieved that a kind of first sort out identify afterwards by Level recognition methods, decreases the class number of monolayer grader to a certain extent, thus improves recognition accuracy.The present invention is real The method executing example is succinct, workable.

Aspect and advantage that the present invention adds will part be given in the following description, and these will become from the following description Obtain substantially, or recognized by the practice of the present invention.

Accompanying drawing explanation

In order to be illustrated more clearly that the technical scheme of the embodiment of the present invention, required use in embodiment being described below Accompanying drawing be briefly described, it should be apparent that, below describe in accompanying drawing be only some embodiments of the present invention, for this From the point of view of the those of ordinary skill of field, on the premise of not paying creative work, it is also possible to obtain other according to these accompanying drawings Accompanying drawing.

A kind of level characteristics model for character picture identification that Fig. 1 provides for the embodiment of the present invention and cascade classifier Design flow diagram.

A kind of expression original character image that Fig. 2 provides for the embodiment of the present invention, and original character image is carried out gray scale The schematic diagram filled.

The visualization signal of a kind of kernelGroup being made up of 12 filtering cores that Fig. 3 provides for the embodiment of the present invention Figure.

A kind of hierarchical Design model schematic that Fig. 4 provides for the embodiment of the present invention.

A kind of cascade classifier design diagram that Fig. 5 provides for the embodiment of the present invention.

Detailed description of the invention

Embodiments of the present invention are described below in detail, and the example of described embodiment is shown in the drawings, the most ad initio Represent same or similar element to same or similar label eventually or there is the element of same or like function.Below by ginseng The embodiment examining accompanying drawing description is exemplary, is only used for explaining the present invention, and is not construed as limiting the claims.

Those skilled in the art of the present technique are appreciated that unless expressly stated, singulative used herein " ", " Individual ", " described " and " being somebody's turn to do " may also comprise plural form.It is to be further understood that use in the description of the present invention arranges Diction " including " refers to there is described feature, integer, step, operation, element and/or assembly, but it is not excluded that existence or adds Other features one or more, integer, step, operation, element, assembly and/or their group.It should be understood that when we claim unit Part is " connected " or during " coupled " to another element, and it can be directly connected or coupled to other elements, or can also exist Intermediary element.Additionally, " connection " used herein or " coupling " can include wireless connections or couple.Wording used herein "and/or" includes one or more any cell listing item being associated and all combinations.

Those skilled in the art of the present technique are appreciated that unless otherwise defined, and all terms used herein (include technology art Language and scientific terminology) have with the those of ordinary skill in art of the present invention be commonly understood by identical meaning.Also should Being understood by, those terms defined in such as general dictionary should be understood that the meaning having with the context of prior art The meaning that justice is consistent, and unless defined as here, will not explain by idealization or the most formal implication.

For ease of the understanding to the embodiment of the present invention, do as a example by several specific embodiments further below in conjunction with accompanying drawing Explanation, and each embodiment is not intended that the restriction to the embodiment of the present invention.

The embodiment of the present invention sets mainly for the character feature extracting method in character recognition process and recognition classifier Meter.The embodiment of the present invention is level characteristics model based on Gabor filter design, can not only reflect that the overall situation of character picture is special Levy, moreover it is possible to show the local feature of character stroke, thus reach to extract the purpose of character picture validity feature.Setting of grader Meter is also a ring important for OCR, and good classifier design can reasonably utilize the character picture feature being drawn into, and draws accurately Character identification result reliably.Along with the increase of classification, the identification error rate of monolayer grader also can be greatly increased.Base of the present invention In the thought of image classification, utilize a kind of level of SVM (support vector machine, support vector machine) classifier design The model of connection grader, it is achieved that a kind of first sorts out the recognition methods step by step identified afterwards, decreases monolayer to a certain extent and classifies The class number of device, thus improve recognition accuracy.

The embodiment of the present invention proposes the level characteristics model for character picture identification and cascade classifier model, the party First method is normalized pretreatment to input picture, then devises and can extract different directions and different scale feature Gabor filtering core, and be filtered image processing.It is followed by hierarchical model is designed, and extracts character picture feature. Finally design cascade classifier, according to cascade classifier sample data classified and train step by step.

The handling process of the recognition methods of a kind of character picture that the embodiment of the present invention proposes is as it is shown in figure 1, include as follows Process step:

Step S110, the pretreatment of character picture.

First to original character image be normalized.Original character image is generally the rectangle that length and width do not wait, First original character image is carried out gray scale filling, make the pros of a length of length of side with original character image longest edge Shape image block, is finally normalized to the floating type image array of 32 × 32.

Note original character image is Image, and the image after gray processing is ImageGray, takes the longest edge note of ImageGray For LenSide, the gray average that calculating ImageGray is positioned at four angular vertexs is designated as meanVal, ImageGray two Equal number is filled in the outside on long limit, and gray value is the pixel of meanVald, makes ImageGray become a size and is The image array of LenSide × LenSide, is designated as ImageSquare.A kind of expression original character that the embodiment of the present invention provides Image, and original character image is carried out the schematic diagram of gray scale filling as in figure 2 it is shown, the outside of arrow indication black line is The region being filled.Finally, ImageSquare is normalized to the floating type image array of 32 × 32.

Size based on different original images, above-mentioned size 32 × 32 be can be varied from (suitable increase or Reduce), but by experiment gained, for the picture on major part mobile phone or camera shooting A4 paper, each word in figure Size, close to 32 × 32, is normalized to such a size and is advantageous for the reservation of original image information, so this size Setting be relatively reasonable.

Step S120, the design of Gabor filtering core and the Filtering Processing of character picture.

Gabor (adding the primary) filtering is the important filter function of detection image texture characteristic, and the stroke feature of character can also Regard a kind of textural characteristics as, so, this method devises 12 Gabor filtering cores, uses this 12 Gabor filtering cores respectively Described floating type image array ImageSquare is carried out convolution algorithm, 3 kinds of different scales on 4 directions can be extracted 12 character feature images altogether, kinds of characters characteristic image comprises different textural characteristics, by all of character feature figure As composition image array ImageDst.

The kernel function of Gabor filtering is as follows:

X '=x cos θ+y sin θ

Y '=-x sin θ+y cos θ

Wherein θ represents the direction of filtering core,Representing the phase place of cosine part, γ represents the space aspect ratio of filtering core, λ Representing the wavelength of cosine part, σ represents the standard deviation in Gaussian function.Take σ=π/3, γ=0.5,θ=0, π/4, Pi/2,3 π/4, λ=2,4,6, can obtain after combination representing 4 directions, 12 filtering core functions of 3 yardsticks, set The template size of kernel function is 11 × 11, and constructs corresponding one group of kernel function template kernelGroup.

The present invention obtains one group of Gabor filter core by the different parameter group arranging the kernel function of Gabor filtering is incompatible, enters And extraction characteristics of image effectively and reasonably.

As shown in Figure 3.These 12 filtering cores are used respectively ImageSquare (Size32 × 32) to be carried out convolution algorithm, Obtain 12 one group of image arrays comprising different texture feature, be denoted as ImageDst.

Step S130, the design of hierarchical model and the feature extraction of character picture.

The local feature of global characteristics Yu stroke in order to preferably extract character, this method progressively will comprise textural characteristics Image be subdivided into some pieces, try to achieve average and the standard deviation of every piece, and by the average of the block of all formed objects and side every time Differential pressure enters characteristic vector, as the eigenvalue of this layer.

In order to extract global characteristics and the local feature of character, the one that Fig. 4 provides for the embodiment of the present invention simultaneously Hierarchical Design model schematic.

Note Image characteristic of correspondence vector is Features, seeks every characteristic image in one group of image array ImageDst The average of all pixels and standard deviation are as the feature of ground floor, and the feature of this ground floor is arranged characteristic vector In Features.

Then, it is divided into 4 sizes to be the image moment of 16 × 16 every characteristic image in one group of image array ImageDst Battle array, tries to achieve average and the standard deviation of each image array (Size16 × 16), respectively by each image array (Size16 × 16) Average and standard deviation as the feature of the second layer, and the feature of this second layer is arranged in characteristic vector Features.

Finally, it is divided into 16 sizes to be the image moment of 8 × 8 every characteristic image in one group of image array ImageDst Battle array, tries to achieve average and the standard deviation of each image array (Size8 × 8) respectively, equal by each image array (Size8 × 8) Value and standard deviation are as the feature of third layer, and the feature of this third layer are arranged in characteristic vector Features.

In actual applications, it is also possible to continue to be subdivided into more layers, the method at least two-layer, but it is not limited only to two-layer, than As, can there be the 4th layer, layer 5 feature.

So, we have just obtained not only comprising global characteristics but also comprise characteristic vector Features of local feature, will bag Contain characteristic vector Features of multilayer feature information as the data sample extracting feature.

Step S140, the design of cascade classifier and training.

The problem too much in order to solve monolayer grader output classification, this method is first by sample according to natural quality step by step Classify, and the label on appropriate level is set for sample.The most top-down, go training point according to the classification of current level Class device, finally obtains the multi-level grader with cascade connection.When using cascade classifier, same employing is top-down Process, every time according to the classification results to sample, select the grader of next stage, and finally give desired result.

In order to reduce the output classification of monolayer grader, improving recognition accuracy, the present invention arranges tool according to sample attribute There is the multi-level cascade classifier of cascade connection.Each layer includes multiple grader, and the grader on each upper strata cascades one Or the grader of multiple lower floors, sub classification device is the refinement to the upper strata grader that it cascades, and each grader is the most right Answering the sample attribute of a kind, the grader choosing the training data of every kind respectively corresponding is trained, and is trained Good cascade classifier.This cascade classifier builds based on SVM classifier.

The cascade classifier that use trains is according to top-down order, to described data sample the most successively Classify, when described data sample is at classification results certain grader corresponding of last layer time, then choose certain classification described The grader of next layer of device cascade carries out classification further to described data sample, until the grader of the bottom is complete The classification of paired described data sample processes, and the grader on all levels is combined the classification results of described sample data Close, obtain the character identification result of described original character image Image.

First, according to the natural quality of sample, classifying sample the most step by step, each layer of classification can be regarded as One-level, every one-level is all the segmentation to upper level.For the data sample of all extraction features, i.e. characteristic vector Features, marks i label { y for it₁,y_2j,…,y_ij, y_ijRepresent belonging to this sample is under i-stage jth grader Classification.We are model based on SVM classifier, uses and all has label y_ijTraining data training i-stage jth classification Device.During identifying, the top-down process of same employing, by grader at the classification results of i-stage, determine to survey This grader that should select in i+1 level of sample, finally completes to identify the process of character recognition on afterbody grader.

A kind of cascade classifier design diagram that Fig. 5 provides for the embodiment of the present invention, first order grader is by feature sample Originally be divided into four class Chinese characters, English, numeral and other, if input is for " Chinese " image block, by first order grader, it is judged that its The first order is categorized as " Chinese character ", and then enters second level grader, by second level grader, it is judged that its specific category is " Chinese ".

In sum, the thought that the embodiment of the present invention is classified based on image, utilize SVM classifier to devise a kind of multi-level Cascade classifier model, it is achieved that a kind of first sort out the recognition methods step by step identified afterwards, decrease monolayer to a certain extent and divide The class number of class device, thus improve recognition accuracy.The method of the embodiment of the present invention is succinct, workable.

The embodiment of the present invention is by the design to Gabor filtering core and hierarchical model so that the character picture of extraction is special Levy more significantly, be more beneficial for the identification of image.The design of cascade classifier, substantially uses the thought divided and rule to reduce The complicated classification degree of single grader, compared with single grader, can be greatly enhanced the accuracy rate of image recognition.

One of ordinary skill in the art will appreciate that: accompanying drawing is the schematic diagram of an embodiment, module in accompanying drawing or Flow process is not necessarily implemented necessary to the present invention.

As seen through the above description of the embodiments, those skilled in the art it can be understood that to the present invention can The mode adding required general hardware platform by software realizes.Based on such understanding, technical scheme essence On the part that in other words prior art contributed can embody with the form of software product, this computer software product Can be stored in storage medium, such as ROM/RAM, magnetic disc, CD etc., including some instructions with so that a computer equipment (can be personal computer, server, or the network equipment etc.) performs some of each embodiment of the present invention or embodiment Method described in part.

Each embodiment in this specification all uses the mode gone forward one by one to describe, identical similar portion between each embodiment Dividing and see mutually, what each embodiment stressed is the difference with other embodiments.Especially for device or For system embodiment, owing to it is substantially similar to embodiment of the method, so describing fairly simple, relevant part sees method The part of embodiment illustrates.Apparatus and system embodiment described above is only schematically, wherein said conduct The unit of separating component explanation can be or may not be physically separate, the parts shown as unit can be or Person may not be physical location, i.e. may be located at a place, or can also be distributed on multiple NE.Can root Factually border need select some or all of module therein to realize the purpose of the present embodiment scheme.Ordinary skill Personnel, in the case of not paying creative work, are i.e. appreciated that and implement.

The above, the only present invention preferably detailed description of the invention, but protection scope of the present invention is not limited thereto, Any those familiar with the art in the technical scope that the invention discloses, the change that can readily occur in or replacement, All should contain within protection scope of the present invention.Therefore, protection scope of the present invention should be with scope of the claims It is as the criterion.

Claims

1. the recognition methods of a character picture, it is characterised in that including:

Use multiple filtering core that described floating type image array ImageSquare is carried out convolution algorithm, obtain comprising different stricture of vagina One group of image array ImageDst of reason feature；

Use the multi-level grader with cascade connection that described data sample carries out successively classification to process, identify described Character in original character image Image.

Method the most according to claim 1, described be normalized original character image Image obtains floating-point Type image array ImageSquare, including:

The recognition methods of character picture the most according to claim 2, it is characterised in that the multiple filtering core pair of described use Described floating type image array ImageSquare carries out convolution algorithm, obtains the one group of image array comprising different texture feature ImageDst, including:

X '=x cos θ+y sin θ

Y '=-x sin θ+y cos θ

Wherein θ represents the direction of filtering core,Representing the phase place of cosine part, γ represents the space aspect ratio of filtering core, and λ represents The wavelength of cosine part, σ represents the standard deviation in Gaussian function；

Use described 12 Gabor filtering cores that described floating type image array ImageSquare is carried out convolution algorithm respectively, carry Getting 12 character feature images altogether of 3 kinds of different scales on 4 directions, kinds of characters characteristic image comprises different stricture of vaginas Reason feature, forms one group of image array ImageDst by all of character feature image.

The recognition methods of character picture the most according to claim 3, it is characterised in that the described original character of described acquisition Image Image characteristic of correspondence vector Features, extracts multilayer feature from described one group of image array group ImageDst, Described multilayer feature is arranged in described characteristic vector Features, obtains data sample, including:

Note image Image characteristic of correspondence vector is Features, calculates respectively in described one group of image array ImageDst In every characteristic image, the average of all pixels and standard deviation the feature as ground floor, set the feature of described ground floor Put in characteristic vector Features；

It is divided into 4 sizes to be the image array of 16 × 16 every characteristic image in described one group of image array ImageDst, point Do not calculate the average of the image array of each 16 × 16 and standard deviation the feature as the second layer, by the spy of the described second layer Levy and arrange in characteristic vector Features；

It is divided into 16 sizes to be the image array of 8 × 8 every characteristic image in described one group of image array ImageDst, respectively Calculate the average of the image array of each 8 × 8 and standard deviation the feature as third layer, the feature of described third layer is set Put in characteristic vector Features；

The recognition methods of character picture the most according to claim 4, it is characterised in that described use has cascade connection Multi-level grader described data sample is carried out successively classification process, identify in described original character image Image Character, including:

Arrange the multi-level grader with cascade connection according to sample attribute, each layer includes multiple grader, Mei Geshang The grader of layer cascades the grader of one or more lower floor, and sub classification device is thin to the upper strata grader that it cascades Changing, the sample attribute of each grader corresponding kind respectively, correspondence is divided by the training data choosing every kind respectively Class device is trained, and obtains the cascade classifier trained；

The cascade classifier trained described in using is according to top-down order, to described data sample the most successively Classify, when described data sample is at classification results certain grader corresponding of last layer time, then choose certain classification described The grader of next layer of device cascade carries out classification further to described data sample, until the grader of the bottom is complete The classification of paired described data sample processes, and the grader on all levels is combined the classification results of described sample data Close, obtain the character identification result of described original character image Image.