CN106127198A - A kind of image character recognition method based on Multi-classifers integrated - Google Patents
A kind of image character recognition method based on Multi-classifers integrated Download PDFInfo
- Publication number
- CN106127198A CN106127198A CN201610442435.0A CN201610442435A CN106127198A CN 106127198 A CN106127198 A CN 106127198A CN 201610442435 A CN201610442435 A CN 201610442435A CN 106127198 A CN106127198 A CN 106127198A
- Authority
- CN
- China
- Prior art keywords
- image
- word
- character
- gray level
- method based
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/22—Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2413—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
- G06F18/24147—Distances to closest patterns, e.g. nearest neighbour classification
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Multimedia (AREA)
- Character Input (AREA)
- Character Discrimination (AREA)
Abstract
The present invention provides a kind of image character recognition method based on Multi-classifers integrated, and colored image to be identified is converted to gray level image;Gray level image is carried out binary conversion treatment, and the image region segmentation of Word message will be comprised out;Each Chinese character is split from monoblock character image;Extract grid search-engine and the direction character of each Chinese character;Use minimum distance classifier, select stroke density total length feature to carry out the rough sort of ground floor;Use nearest neighbour classification device, select peripheral characteristic, grid search-engine and direction character to combine the classification and matching of the second layer respectively.The invention have the advantages that the existing stronger capacity of resisting disturbance of Text region, have again the ability of stronger descriptive text partial structurtes, and affected by stroke width less;Have employed minimum distance classifier, the combining classifiers technology combined closest to grader complementation, make system more reliability;Can Intelligent Recognition word, improve the adaptability of system and discrimination be high.
Description
Technical field
The present invention relates to pictograph identification field, more particularly, to a kind of image based on Multi-classifers integrated literary composition
Word recognition methods.
Background technology
Social development enters the information age, and along with expansion, deep and socialization's needs of practical activity, human needs goes to know
The information of the form content complexity of a lot of classes.People the most no longer rest on oneself ear and eyes go to directly obtain these
Information, but use computer that word inputs computer automatically.Owing to scientific and technological level improves constantly so that various different
Object of study obtains " image conversion " and " digitized ", and the multimedia messages based on image rapidly becomes important information transmission matchmaker
Being situated between, the Word message in image contains abundant high-layer semantic information.Extract these words, semanteme high-level for image
Understanding, index and retrieve the most helpful.
Now for the research of character image identification technology, being also faced with Railway Project, one is that image data amount is big, typically comes
Saying, will obtain higher accuracy of identification, original image should have higher resolution, at least should be greater than 64 × 64.Two is image
It is stained, owing to the interference of target environment, the error of transmission, the error of sensor, noise, ambient interferences, deformation etc. can be stained figure
Picture.Three is accuracy, and the vision of displacement, rotation, dimensional variation, distortion, with the mankind is the same, there are between target and sensor
The change of position, it is therefore desirable to system is when target produces displacement, rotation, dimensional variation, distortion, still is able to correctly identify mesh
Mark.Four is real-time, and in the application of military field, the identification target that the system that is desirable that greatly can be real-time, this just requires system
There is the out speed and recognition efficiency being exceedingly fast.
In view of the current situation of current character identification system, the discrimination how improving print hand writing is still current grinding
Study carefully focus, under World Scene, how to identify that word will be a direction of character identification system development.In addition, how tool is built
Have the space of a whole page to automatically analyze, fault-tolerance is strong, discrimination is high, mistake self study self-correction, the character identification system of easy extension characteristic are
The goal in research of Text region automatization.So, the research of pictograph identification technology is particularly important.
Summary of the invention
The present invention is to overcome at least one defect described in above-mentioned prior art, it is provided that a kind of automatization, discrimination height
Image character recognition method based on Multi-classifers integrated.
For solving above-mentioned technical problem, technical scheme is as follows:
A kind of image character recognition method based on Multi-classifers integrated, said method comprising the steps of:
S1: colored image to be identified is converted to gray level image, if image to be identified is originally as gray level image, omits this step
Suddenly;
S2: the gray level image obtained is carried out binary conversion treatment, and the image region segmentation of Word message will be comprised out;
S3: each Chinese character is split from monoblock character image;
S4: extract grid search-engine and the direction character of each Chinese character;
S5: use minimum distance classifier, selects stroke density total length feature to carry out the rough sort of ground floor;
S6: use nearest neighbour classification device, selects peripheral characteristic, grid search-engine and direction character to combine the second layer respectively
Classification and matching.
In the preferred scheme of one, in step S1, when colored image to be identified is converted to gray level image, employing adds
Weight average value method carries out gradation conversion, i.e. the value weighted average to R, G, B: R=G=B=a*R+b*G+c*B;Wherein, R, G,
B represents red, green and blue respectively, and a, b, c are respectively the weights of R, G, B, wherein b > a > c.
When pictograph identification, the image to be identified of input is typically all color RGB image, it comprises substantial amounts of face
Color information, if image carrying out place comprehend the execution speed of reduction system, RGB image includes a lot of and Text region in addition
Unrelated colouring information, is unfavorable for the location of word, and gray level image, only comprise monochrome information, do not comprise color information, favorably
In later stage process further to image, the speed of service can be improved, be conducive to next step text location.Owing to human eye is to green
Color is most sensitive, takes second place red sensitivity, minimum to blue sensitivity, so when at b > under conditions of a > c, can obtain
To the gray level image being relatively easy to identification.
In the preferred scheme of one, in step S2, use OTSU algorithm (Da-Jin algorithm or maximum variance between clusters) to ash
Degree image carries out binary conversion treatment.
The binary conversion treatment of image, is that the gray value to the pixel on image is set to 0 or 255, i.e. big when all gray scales
In or be judged as specific object equal to the pixel of threshold values, its gray value is 255, and otherwise, its gray value is 0, represents it
His object area or background, the image after process will present obvious black and white effect.The binaryzation of image will be to have 256
The gray level of pixel, after suitable threshold values is chosen, is divided into 2 grades by the gray level image of individual tonal gradation.Through binary conversion treatment
After image, its character is only the most relevant with the position of the pixel that gray value is 0 or 255, no longer relates to the picture of other gray levels
Vegetarian refreshments, it is simple to image is further processed, and the treating capacity of data and decrement less, and obtain binary image still
The old feature that can reflect image entirety and local.In order to obtain preferable binary image, choosing of threshold values is most important.Choosing
Take suitable threshold values, be possible not only to effectively remove noise, and image can be divided into target area and background significantly, significantly
Reduce quantity of information, the speed that raising processes.
In the preferred scheme of one, in step S3, use the single word in character segmentation method identification image-region, i.e. profit
Single character is divided by the crest formed by the upright projection on image level direction of the white space between word and word with trough
Cut out.
In the preferred scheme of one, in step S3, in order to improve accuracy rate, use regression equation character segmentation method identification single
Word, is i.e. square figure according to Chinese character, has the feature of uniform-dimension substantially, the word height obtained when utilizing row cutting
Estimate the width of word, thus predict the position of next word.
In the preferred scheme of one, in step S4, the concrete grammar extracting word grid search-engine is as follows:
1) word lattice is divided into 8 × 8 parts;
2) the stain number in obtaining every part, with P11, P12 ... P18, P21 ... P88 represents;
3) the stain number P=P11+P12+ that word is total is obtained ...+Pl8+P21+ ...+P88;
4) the percentage ratio Pij=Pij × 100/P of whole word stain number shared by stain number in obtaining every part, wherein i, j are big
In the integer equal to 1 and less than or equal to 8, and characteristic vector (P11, P12 ... P18, P21 ... P88) it is exactly the grid search-engine of word.
In the preferred scheme of one, in step S4, the concrete grammar extracting words direction feature is as follows:
Word lattice image is carried out binaryzation and normalization, and extracts profile information, to each imparting one on profile
Or the attribute of both direction, direction takes level, vertical and positive and negative 45 ° of totally four angles, word lattice is divided into n × n net
Lattice, calculate the number of 4 direction attributes that each grid includes, thus constitute 4 dimensional vectors, and comprehensive all of grid is special
Levy, form the characteristic vector of a 4 × n × n dimension, be direction character.
In the preferred scheme of one, in step S5, the concrete grammar building minimum distance classifier is as follows:
1) from sample, extract the stroke density length characteristic vector as rough sort of word.2) each classification is calculated respectively
The feature corresponding to sample, the most one-dimensional of each class has characteristic set, by set, an average can be calculated, also
It it is exactly eigencenter.3) generally for eliminating different characteristic because the different impact of dimension, the most one-dimensional feature is needed by we
Do a normalization, or scaling is to intervals such as (-1,1) so that it is go dimension.4) utilize the distance criterion chosen, treat
Originally judging of classification.
In the preferred scheme of one, in step S6, the concrete grammar building nearest neighbour classification device is as follows:
1) initializing distance is maximum
2) unknown sample and distance dist of each training sample are calculated
3) K up till now is obtained individual closest to ultimate range maxdist in sample
4) if dist is less than maxdist, then using this training sample as K-nearest samples
5) repeat step 2,3,4, until the distance of unknown sample and all training samples is the completeest
6) number of times that in statistics K-nearest samples, each class label occurs
7) the class label class label as unknown sample of frequency of occurrences maximum is selected
Compared with prior art, technical solution of the present invention provides the benefit that: the present invention provides a kind of based on Multi-classifers integrated
Image character recognition method, colored image to be identified is converted to gray level image;Gray level image is carried out binary conversion treatment,
And the image region segmentation of Word message will be comprised out;Each Chinese character is split from monoblock character image;Extract every
The grid search-engine of individual Chinese character and direction character;Use minimum distance classifier, select stroke density total length feature to carry out the
The rough sort of one layer;Use nearest neighbour classification device, select peripheral characteristic, grid search-engine and direction character to combine respectively
The classification and matching of the second layer.For feature extraction, the method using grid and direction character to combine, make Text region existing stronger
Capacity of resisting disturbance, have again the ability of stronger descriptive text partial structurtes, and affected less by stroke width;For figure
As, in Text region, applying artificial intelligence's learning art, the adaptability and the discrimination that improve system are high;Grader is set
Meter, have employed minimum distance classifier, the combining classifiers technology combined closest to grader complementation, makes system the most reliable
Property.
Accompanying drawing explanation
Fig. 1 is the flow chart of image character recognition method based on Multi-classifers integrated.
Fig. 2 is the schematic diagram of gradation conversion and binaryzation.
Fig. 3 is the schematic diagram of regression equation character segmentation method.
Fig. 4 is the schematic diagram extracting word grid search-engine.
Fig. 5 is the schematic diagram extracting direction grid search-engine.
Fig. 6 is the Text region schematic diagram of multiple combining classifiers.
Fig. 7 is the schematic diagram that whole section of Text segmentation becomes single font.
Fig. 8 is the schematic diagram of the form output character with text box.
Detailed description of the invention
Accompanying drawing being merely cited for property explanation, it is impossible to be interpreted as the restriction to this patent;
In order to the present embodiment is more preferably described, some parts of accompanying drawing have omission, zoom in or out, and do not represent the chi of actual product
Very little;
To those skilled in the art, in accompanying drawing, some known features and explanation thereof may will be understood by omission.
With embodiment, technical scheme is described further below in conjunction with the accompanying drawings.
Embodiment 1
As it is shown in figure 1, a kind of image character recognition method based on Multi-classifers integrated, said method comprising the steps of:
S1: colored image to be identified is converted to gray level image, if image to be identified is originally as gray level image, omits this step
Suddenly;
When colored image to be identified is converted to gray level image, use weighted average method to carry out gradation conversion, i.e. to R,
The value weighted average of G, B: R=G=B=a*R+b*G+c*B;Wherein, R, G, B represent red, green and blue respectively, a, b, c
It is respectively the weights of R, G, B, wherein b > a > c.
When pictograph identification, the image to be identified of input is typically all color RGB image, it comprises substantial amounts of face
Color information, if image carrying out place comprehend the execution speed of reduction system, RGB image includes a lot of and Text region in addition
Unrelated colouring information, is unfavorable for the location of word, and gray level image, only comprise monochrome information, do not comprise color information, favorably
In later stage process further to image, the speed of service can be improved, be conducive to next step text location.Owing to human eye is to green
Color is most sensitive, takes second place red sensitivity, minimum to blue sensitivity, so when at b > under conditions of a > c, can obtain
To the gray level image being relatively easy to identification.
S2: the gray level image obtained is carried out binary conversion treatment, and the image region segmentation of Word message will be comprised out;
As in figure 2 it is shown, in step S2, use OTSU algorithm that gray level image is carried out binary conversion treatment.The binary conversion treatment of image,
It is that the gray value to the pixel on image is set to 0 or 255, is i.e. determined more than or equal to the pixel of threshold values when all gray scales
For specific object, its gray value is 255, and otherwise, its gray value is 0, represents other object area or background, after process
Image will present obvious black and white effect.The binaryzation of image will be that the gray level image with 256 tonal gradations is through closing
After suitable threshold values is chosen, the gray level of pixel is divided into 2 grades.Image after binary conversion treatment, its character and gray value
The position being the pixel of 0 or 255 is relevant, no longer relates to the pixel of other gray levels, it is simple to make image further
Process, and the treating capacity of data and decrement less, and the binary image obtained still can reflect that image is overall with local
Feature.In order to obtain preferable binary image, choosing of threshold values is most important.Choose suitable threshold values, be possible not only to
Noise is removed on effect ground, and image can be divided into target area and background significantly, greatly reduces quantity of information, the speed that raising processes
Degree.
OTSU algorithm is by the gamma characteristic of image, divides the image into background and target 2 part, between background and target
Inter-class variance is the biggest, illustrates that the difference of 2 parts of pie graph picture is the biggest, when partial target mistake is divided into background or part background mistake to divide
2 part difference all can be caused to diminish for target, therefore, the segmentation making inter-class variance maximum means that misclassification probability is minimum;
Otsu algorithm steps is as follows:
If image comprise L gray level (0,1 ..., L-1), gray value be i pixel count as Ni, the picture element that image is total
Number is N=N0+N1+...+N (L-1), and gray value is that the probability of the point of i is: P (i)=N (i)/N;
Entire image is divided into dark space c1 and clear zone c2 two class by thresholding t, then inter-class variance σ is the function of t: σ=a1*a2 (u1-u2)
^2 ;In formula, aj is the area ratio with the image gross area of class cj, a1=sum (P (i)) i-> t, a2=1-a1;
Uj is the average of class cj, u1=sum (i*P (i))/a1 0-> t, u2=sum (i*P (i))/a2, t+1-> L-1, should
Method selects optimum thresholding t^ to make inter-class variance maximum, it may be assumed that make Δ u=u1-u2, σ b=max{a1 (t) * a2 (t) Δ u^2}.
S3: each Chinese character is split from monoblock character image;
As it is shown on figure 3, in step S3, use the single word in character segmentation method identification image-region, i.e. utilize between word and word
The crest that formed of white space upright projection on image level direction with trough by single Character segmentation out.In order to carry
High-accuracy, uses the regression equation character segmentation single word of method identification, is i.e. square figure according to Chinese character, has uniform chi substantially
Very little feature, the word height obtained when utilizing row cutting is to estimate the width of word, thus predicts the position of next word.
S4: extract grid search-engine and the direction character of each Chinese character;
The feature of extraction single kind carries out Chinese Character Recognition, and misclassification rate is difficult to reduce, and anti-interference is not easy to improve.Because this
The Chinese character information amount that sample is utilized is limited, it is impossible to reflect the feature of Chinese character comprehensively, for any feature, certainly exists it
" dead angle " identified, i.e. utilizes this feature to be difficult to distinguish Chinese character.From the perspective of pattern recognition, if by whole arrows of Chinese character
The space that quantization characteristic is formed be referred to as space Ω (i=1,2 ...), then utilize the information of whole space Ω to carry out Chinese character knowledge
Not, owing to the Chinese character information provided is very abundant, anti-interference can be greatly enhanced.But, in actual applications, it is necessary to consider that know
Other accuracy and recognition speed (operand) and system resource three's is compromise.So the OCR system of any one practicality only profit
Information with wherein parton space.Due to the defect of information, the problem the most inevitably running into identification " dead angle ".
On the basis of these technique study, the present invention have selected the grid search-engine of Chinese character and direction character carries out Chinese character knowledge
Not, these features have stronger capacity of resisting disturbance, have again the ability of stronger descriptive text partial structurtes, and by stroke width
The impact of degree is less, brings out the best in each other, makes " dead angle " of Chinese Character Recognition significantly to reduce, thus improve discrimination.
As shown in Figure 4, in step S4, the concrete grammar extracting word grid search-engine is as follows:
1) word lattice is divided into m × m part, the present embodiment is divided into 8 × 8 parts.
2) the stain number in obtaining every part, with P11, P12 ... P18, P21 ... P88 represents.
3) the stain number P=P11+P12+ that word is total is obtained ...+Pl8+P21+ ...+P88.
4) the percentage ratio Pij=Pij × 100/P, wherein i, j of whole word stain number shared by stain number in obtaining every part
For the integer more than or equal to 1 and less than or equal to 8, and characteristic vector (P11, P12 ... P18, P21 ... P88) it is exactly that the grid of word is special
Levy.
As it is shown in figure 5, in step S4, the concrete grammar extracting words direction feature is as follows:
Word lattice image is carried out binaryzation and normalization, and extracts profile information, to each imparting one on profile
Or the attribute of both direction, direction takes level, vertical and positive and negative 45 ° of totally four angles, word lattice is divided into n × n net
Lattice, calculate the number of 4 direction attributes that each grid includes, thus constitute 4 dimensional vectors, and comprehensive all of grid is special
Levy, form the characteristic vector of a 4 × n × n dimension, be direction character.
S5: as shown in Figure 6, uses minimum distance classifier, selects stroke density total length feature to carry out ground floor
Rough sort;
Minimum distance classifier selects stroke density total length feature to carry out the rough sort of ground floor.In this approach, quilt
Recognition mode is minimum with the distance of affiliated pattern class sample.Assuming that c classification represents characteristic vector R1 of pattern ..., Rc
Representing, x is the characteristic vector of identified pattern, | x-Ri | be x Yu Ri (i=1,2 ..., c) between distance, if | x-Ri |
Minimum, then be divided into the i-th class x.
S6: use nearest neighbour classification device, selects peripheral characteristic, grid search-engine and direction character to combine the respectively
The classification and matching of two layers.
Nearest neighbour classification device selects grid search-engine and direction character to combine the classification and matching of the second layer respectively.?
Nearest Neighbor Classifier is to be extended on the basis of minimum distance classification, is depended on as differentiation by each sample in training set
According to, find the sample in the training set that sample to be sorted is nearest, classify on this basis.
Through test of many times and research, conclusion shows can not fundamentally improve systematicness based on single evaluator principle
Can, recognition result integrated of multiple grader should be relied on.Multi-classifers integrated is i.e. improved by the grader of multiple complementations
The performance of single grader, obtains a reliability higher identification system.Therefore, the present invention use minimum distance classifier and
Nearest neighbour classification device is integrated, and by the optimization in classifier design, further increase word can not rate and accuracy rate.
For checking effectiveness of the invention, need to carry out related experiment, the present invention uses the original image comprising 697 Chinese characters
Test.First this original image is converted into gray level image to carry out next step operation.Cut by regression equation word
Point-score becomes single font whole section of Text segmentation, tests effect such as Fig. 7, can split each Chinese character exactly.Finally, use
The word that the method identification of multi-feature extraction and Multi-classifers integrated splits, and export with the form of text box, test knot
Fruit such as Fig. 8, result is the most correct.
Multi-feature extraction method and multi-classifier integrating method make raising pictograph discrimination be possibly realized, and it is good
Recognition effect causes the most attention of people, has broad application prospects.The present invention is real based on multi-classifier integrating method
Existing pictograph identification, makes the process of pictograph information and extraction be more feasible.
Obviously, the above embodiment of the present invention is only for clearly demonstrating example of the present invention, and is not right
The restriction of embodiments of the present invention.For those of ordinary skill in the field, the most also may be used
To make other changes in different forms.Here without also cannot all of embodiment be given exhaustive.All at this
Any amendment, equivalent and the improvement etc. made within the spirit of invention and principle, should be included in the claims in the present invention
Protection domain within.
Claims (7)
1. an image character recognition method based on Multi-classifers integrated, it is characterised in that said method comprising the steps of:
S1: colored image to be identified is converted to gray level image, if image to be identified is originally as gray level image, omits this step
Suddenly;
S2: the gray level image obtained is carried out binary conversion treatment, and the image region segmentation of Word message will be comprised out;
S3: each Chinese character is split from monoblock character image;
S4: extract grid search-engine and the direction character of each Chinese character;
S5: use minimum distance classifier, selects stroke density total length feature to carry out the rough sort of ground floor;
S6: use nearest neighbour classification device, selects peripheral characteristic, grid search-engine and direction character to combine the second layer respectively
Classification and matching.
Image character recognition method based on Multi-classifers integrated the most according to claim 1, it is characterised in that step S1
In, when colored image to be identified is converted to gray level image, use weighted average method to carry out gradation conversion, i.e. to R, G,
The value weighted average of B: R=G=B=a*R+b*G+c*B;Wherein, R, G, B represent red, green and blue respectively, and a, b, c divide
Not Wei the weights of R, G, B, wherein b > a > c.
Image character recognition method based on Multi-classifers integrated the most according to claim 1, it is characterised in that step S2
In, use OTSU algorithm that gray level image carries out binary conversion treatment, OTSU algorithm is by the gamma characteristic of image, divides the image into
Background and target 2 part, the inter-class variance between background and target is the biggest, illustrates that the difference of 2 parts of pie graph picture is the biggest, when
Partial target mistake is divided into background or part background mistake to be divided into target that 2 part difference all can be caused to diminish, and therefore, makes inter-class variance
Big segmentation means that misclassification probability is minimum;
Otsu algorithm steps is as follows:
If image comprise L gray level (0,1 ..., L-1), gray value be i pixel count as Ni, the picture element that image is total
Number is N=N0+N1+...+N (L-1), and gray value is that the probability of the point of i is: P (i)=N (i)/N;
Entire image is divided into dark space c1 and clear zone c2 two class by thresholding t, then inter-class variance σ is the function of t: σ=a1*a2 (u1-u2)
^2 ;In formula, aj is the area ratio with the image gross area of class cj, a1=sum (P (i)) i-> t, a2=1-a1;
Uj is the average of class cj, u1=sum (i*P (i))/a1 0-> t, u2=sum (i*P (i))/a2, t+1-> L-1, should
Method selects optimum thresholding t^ to make inter-class variance maximum, it may be assumed that make Δ u=u1-u2, σ b=max{a1 (t) * a2 (t) Δ u^2}.
Image character recognition method based on Multi-classifers integrated the most according to claim 1, it is characterised in that step S3
In, use the single word in character segmentation method identification image-region, i.e. utilize the white space between word and word at image level
The crest that upright projection on direction is formed is with trough by single Character segmentation out.
Image character recognition method based on Multi-classifers integrated the most according to claim 4, it is characterised in that step S3
In, use the regression equation character segmentation single word of method identification, be i.e. square figure according to Chinese character, there is the spy of uniform-dimension substantially
Point, the word height obtained when utilizing row cutting is to estimate the width of word, thus predicts the position of next word.
Image character recognition method based on Multi-classifers integrated the most according to claim 1, it is characterised in that step S4
In, the concrete grammar extracting word grid search-engine is as follows:
1) word lattice is divided into 8 × 8 parts;
2) the stain number in obtaining every part, with P11, P12 ... P18, P21 ... P88 represents;
3) the stain number P=P11+P12+ that word is total is obtained ...+Pl8+P21+ ...+P88;
4) the percentage ratio Pij=Pij × 100/P of whole word stain number shared by stain number in obtaining every part, wherein i, j are big
In the integer equal to 1 and less than or equal to 8, and characteristic vector (P11, P12 ... P18, P21 ... P88) it is exactly the grid search-engine of word.
Image character recognition method based on Multi-classifers integrated the most according to claim 1, it is characterised in that step S4
In, the concrete grammar extracting words direction feature is as follows:
Word lattice image is carried out binaryzation and normalization, and extracts profile information, to each imparting one on profile
Or the attribute of both direction, direction takes level, vertical and positive and negative 45 ° of totally four angles, word lattice is divided into n × n net
Lattice, calculate the number of 4 direction attributes that each grid includes, thus constitute 4 dimensional vectors, and comprehensive all of grid is special
Levy, form the characteristic vector of a 4 × n × n dimension, be direction character.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610442435.0A CN106127198A (en) | 2016-06-20 | 2016-06-20 | A kind of image character recognition method based on Multi-classifers integrated |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610442435.0A CN106127198A (en) | 2016-06-20 | 2016-06-20 | A kind of image character recognition method based on Multi-classifers integrated |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106127198A true CN106127198A (en) | 2016-11-16 |
Family
ID=57470615
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610442435.0A Pending CN106127198A (en) | 2016-06-20 | 2016-06-20 | A kind of image character recognition method based on Multi-classifers integrated |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106127198A (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109034166A (en) * | 2017-06-08 | 2018-12-18 | 北京君正集成电路股份有限公司 | Confusable character identification model training method and device |
CN109389061A (en) * | 2018-09-26 | 2019-02-26 | 苏州友教习亦教育科技有限公司 | Paper recognition methods and system |
CN110009065A (en) * | 2019-01-14 | 2019-07-12 | 岭南师范学院 | A kind of calligraphy comparison method based on image binaryzation |
CN110378287A (en) * | 2019-07-19 | 2019-10-25 | 腾讯科技(深圳)有限公司 | Document direction recognizing method, device and storage medium |
CN110879965A (en) * | 2019-10-12 | 2020-03-13 | 中国平安财产保险股份有限公司 | Automatic reading and amending method of test paper objective questions, electronic device, equipment and storage medium |
CN110942085A (en) * | 2019-10-25 | 2020-03-31 | 深圳猛犸电动科技有限公司 | Image classification method, image classification device and terminal equipment |
CN111104980A (en) * | 2019-12-19 | 2020-05-05 | 腾讯科技(深圳)有限公司 | Method, device, equipment and storage medium for determining classification result |
CN111428069A (en) * | 2020-03-11 | 2020-07-17 | 中交第二航务工程局有限公司 | Construction data acquisition method for slot milling machine |
CN112906686A (en) * | 2021-03-11 | 2021-06-04 | 北京小米移动软件有限公司 | Character recognition method and device, electronic equipment and storage medium |
CN113420767A (en) * | 2021-07-22 | 2021-09-21 | 凌云光技术股份有限公司 | Method, system and device for extracting features for font classification |
CN114627730A (en) * | 2022-03-31 | 2022-06-14 | 北京科技大学 | Braille electronic book |
CN114937277A (en) * | 2022-05-18 | 2022-08-23 | 北京百度网讯科技有限公司 | Image-based text acquisition method and device, electronic equipment and storage medium |
CN118155210A (en) * | 2024-05-08 | 2024-06-07 | 中国电子产品可靠性与环境试验研究所((工业和信息化部电子第五研究所)(中国赛宝实验室)) | Word processing function test system and method based on machine learning |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070133883A1 (en) * | 2005-12-12 | 2007-06-14 | Microsoft Corporation | Logical structure and layout based offline character recognition |
CN101630367A (en) * | 2009-07-31 | 2010-01-20 | 北京科技大学 | Rejection method for identifying handwritten character based on multiple classifiers |
CN103996055A (en) * | 2014-06-13 | 2014-08-20 | 上海珉智信息科技有限公司 | Identification method based on classifiers in image document electronic material identification system |
-
2016
- 2016-06-20 CN CN201610442435.0A patent/CN106127198A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070133883A1 (en) * | 2005-12-12 | 2007-06-14 | Microsoft Corporation | Logical structure and layout based offline character recognition |
CN101630367A (en) * | 2009-07-31 | 2010-01-20 | 北京科技大学 | Rejection method for identifying handwritten character based on multiple classifiers |
CN103996055A (en) * | 2014-06-13 | 2014-08-20 | 上海珉智信息科技有限公司 | Identification method based on classifiers in image document electronic material identification system |
Non-Patent Citations (2)
Title |
---|
吴丹等: "《基于直方图分析和OTSU算法的文字图像二值化》", 《计算机与现代化》 * |
罗笑玲等: "《基于多分类器集成的图像文字识别技术及其应用研究》", 《软件》 * |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109034166B (en) * | 2017-06-08 | 2021-09-24 | 北京君正集成电路股份有限公司 | Confusable character recognition model training method and device |
CN109034166A (en) * | 2017-06-08 | 2018-12-18 | 北京君正集成电路股份有限公司 | Confusable character identification model training method and device |
CN109389061A (en) * | 2018-09-26 | 2019-02-26 | 苏州友教习亦教育科技有限公司 | Paper recognition methods and system |
CN110009065A (en) * | 2019-01-14 | 2019-07-12 | 岭南师范学院 | A kind of calligraphy comparison method based on image binaryzation |
CN110378287A (en) * | 2019-07-19 | 2019-10-25 | 腾讯科技(深圳)有限公司 | Document direction recognizing method, device and storage medium |
CN110879965A (en) * | 2019-10-12 | 2020-03-13 | 中国平安财产保险股份有限公司 | Automatic reading and amending method of test paper objective questions, electronic device, equipment and storage medium |
CN110942085A (en) * | 2019-10-25 | 2020-03-31 | 深圳猛犸电动科技有限公司 | Image classification method, image classification device and terminal equipment |
CN110942085B (en) * | 2019-10-25 | 2024-04-09 | 深圳猛犸电动科技有限公司 | Image classification method, image classification device and terminal equipment |
CN111104980A (en) * | 2019-12-19 | 2020-05-05 | 腾讯科技(深圳)有限公司 | Method, device, equipment and storage medium for determining classification result |
CN111428069A (en) * | 2020-03-11 | 2020-07-17 | 中交第二航务工程局有限公司 | Construction data acquisition method for slot milling machine |
CN112906686A (en) * | 2021-03-11 | 2021-06-04 | 北京小米移动软件有限公司 | Character recognition method and device, electronic equipment and storage medium |
CN113420767A (en) * | 2021-07-22 | 2021-09-21 | 凌云光技术股份有限公司 | Method, system and device for extracting features for font classification |
CN113420767B (en) * | 2021-07-22 | 2024-04-26 | 凌云光技术股份有限公司 | Feature extraction method, system and device for font classification |
CN114627730A (en) * | 2022-03-31 | 2022-06-14 | 北京科技大学 | Braille electronic book |
CN114937277A (en) * | 2022-05-18 | 2022-08-23 | 北京百度网讯科技有限公司 | Image-based text acquisition method and device, electronic equipment and storage medium |
CN118155210A (en) * | 2024-05-08 | 2024-06-07 | 中国电子产品可靠性与环境试验研究所((工业和信息化部电子第五研究所)(中国赛宝实验室)) | Word processing function test system and method based on machine learning |
CN118155210B (en) * | 2024-05-08 | 2024-07-12 | 中国电子产品可靠性与环境试验研究所((工业和信息化部电子第五研究所)(中国赛宝实验室)) | Word processing function test system and method based on machine learning |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106127198A (en) | A kind of image character recognition method based on Multi-classifers integrated | |
CN111368886B (en) | Sample screening-based label-free vehicle picture classification method | |
CN110533631B (en) | SAR image change detection method based on pyramid pooling twin network | |
CN110135267A (en) | A kind of subtle object detection method of large scene SAR image | |
CN114841972A (en) | Power transmission line defect identification method based on saliency map and semantic embedded feature pyramid | |
CN110991549A (en) | Countermeasure sample generation method and system for image data | |
CN109711448A (en) | Based on the plant image fine grit classification method for differentiating key field and deep learning | |
CN101609509B (en) | Image and object detection method and system based on pre-classifier | |
Cui et al. | Locality preserving genetic algorithms for spatial-spectral hyperspectral image classification | |
CN104504362A (en) | Face detection method based on convolutional neural network | |
CN105184298A (en) | Image classification method through fast and locality-constrained low-rank coding process | |
Aditya et al. | Batik classification using neural network with gray level co-occurence matrix and statistical color feature extraction | |
CN113011357A (en) | Depth fake face video positioning method based on space-time fusion | |
CN110009030A (en) | Sewage treatment method for diagnosing faults based on stacking meta learning strategy | |
CN108520215B (en) | Single-sample face recognition method based on multi-scale joint feature encoder | |
CN110363230A (en) | Stacking integrated sewage handling failure diagnostic method based on weighting base classifier | |
CN110390347A (en) | Conditions leading formula confrontation for deep neural network generates test method and system | |
CN106096655A (en) | A kind of remote sensing image airplane detection method based on convolutional neural networks | |
CN104252625A (en) | Sample adaptive multi-feature weighted remote sensing image method | |
CN114255403A (en) | Optical remote sensing image data processing method and system based on deep learning | |
Vil’kin et al. | Algorithm for segmentation of documents based on texture features | |
CN114332075A (en) | Rapid structural defect identification and classification method based on lightweight deep learning model | |
CN106203482A (en) | Characteristics of The Remote Sensing Images dimension reduction method based on mRMR and KPCA | |
WO2020119624A1 (en) | Class-sensitive edge detection method based on deep learning | |
CN107766792A (en) | A kind of remote sensing images ship seakeeping method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20161116 |
|
RJ01 | Rejection of invention patent application after publication |