CN105447522A - Complex image character identification system - Google Patents
Complex image character identification system Download PDFInfo
- Publication number
- CN105447522A CN105447522A CN201510831651.XA CN201510831651A CN105447522A CN 105447522 A CN105447522 A CN 105447522A CN 201510831651 A CN201510831651 A CN 201510831651A CN 105447522 A CN105447522 A CN 105447522A
- Authority
- CN
- China
- Prior art keywords
- cut
- projection
- module
- subsequent use
- character
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
- G06F18/24155—Bayesian classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/22—Character recognition characterised by the type of writing
- G06V30/224—Character recognition characterised by the type of writing of printed characters having additional code marks or containing code marks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/32—Digital ink
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Data Mining & Analysis (AREA)
- Multimedia (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Probability & Statistics with Applications (AREA)
- Character Input (AREA)
Abstract
The invention relates to the image identification field and particularly relates to a complex image character identification system. By aiming at the structure characteristics of the complex test image like the image table, the invention decomposes the complex structure one layer by one layer by firstly detecting and segmenting the table cell and then segmenting and identifying the content of the table cell, achieves the identification effect which makes the hard thing simple and improves the identification accuracy. The invention adopts a Bayes classifier to perform optimization on a character block segmentation position which is found by a projection method, avoids the left-right-part structure Chinese character from being segmented into two parts due to the gap between the left part and the right part of the character, and enables the segmentation of the character block to be complete and accurate. The invention also adopts the nerve network classifier to perform automatic identification on the segmented character, and has high identification efficiency and accurate identification result. In a word, the invention enables the identification of the complex image character to be rapidly realized, has a complete system structure and high identification efficiency, and has great application prospect in the information exploitation and information analysis field.
Description
Technical field
The present invention relates to field of image recognition, particularly a kind of complicated image character identification system.
Background technology
Image recognition technology is very important developing direction in current intelligent identification technology field, the development experience of image recognition three phases: Text region, Digital Image Processing and identification, object identification; Wherein in numerous image recognition technologys, the recognition technology of pictograph is seemed particularly important, this is because pictograph often comprises more importantly available information than simplicial graph picture, Text region is generally identification letter, numbers and symbols, recognize handwriting identification from printing word, application widely.The mathematics essence of problem of image recognition belongs to the mapping problems of model space to classification space.At present, in the development of image recognition, mainly contain three kinds of recognition methodss: statistical-simulation spectrometry, configuration mode identification, Fuzzy Pattern Recognition.
Along with the deep development of pictograph, the pictograph data of increasing kind, be included in the category of pictograph identification, wherein form gathers as lteral data record and simplifies expression way most, or expression format the most frequently used in data statistics, interpretation of result is master tool in various data analysis tool.The popularity that form is applied in all trades and professions is self-evident.Be not difficult to find out the importance for the identification of sheet format pictograph, such as to enterprise's year earning report, most important statistics and analysis result in enterprise's year achievement may be comprised, the importance of these information and generality are that other any materials are not compared, various form information is flooded with in the current network information, but a lot of form is all provide with the form of picture, such as various scanning files, pdf document, automatically these image form informations are identified, the table content of picture/mb-type being reduced into numerical data is the basis of these data being carried out fast processing and analysis.
The complex structure of form is abundant in content, and compared to document picture, require very high to the accurate rate of the identification of form Image, but due to the architectural feature that form information itself is complicated, making the identification of form pictograph more difficult than general normal image written historical materials, in prior art when carrying out the identification of word in image, first needing the character string cutting in image to open, form the little picture comprising single word, then use certain method to identify the word after cutting.And to carry out the most frequently used method of character segmentation be sciagraphy, be namely by after pictograph binary conversion treatment, find the separatrix between two words by vertical projection, according to separatrix, character segmentation is come.Due to the existence of the frame line of form, directly use traditional projection to send out realized character segmentation and identify just unrealistic.Complicated and changeable simultaneously due to this body structure of form, frame contour line complicated and changeable makes sciagraphy be difficult to realize the cutting of form Word message and identification.
Summary of the invention
The object of the invention is to overcome above-mentioned deficiency existing in prior art, a kind of complicated image character identification system is provided, the cell in the image form of labyrinth can be detected fast and accurately; And complete fast on this basis the Word message cutting in the image form of complexity to be identified.First present system detects the cell profile in image form to be identified; Word message in corresponding unit lattice cuts out by the corner apex coordinate corresponding to cell profile; Character segmentation module utilizes sciagraphy to find out the dicing position of character block in cell content on this basis, and utilizes Bayes classifier to be optimized dicing position, and then is cut out successively by character block; The character block cut out is input in Text region module, identifies corresponding word content.Native system is for the architectural feature point of the textual image of this complexity of image form, from first detection and cutting cell to the cutting of cell content and identification, the structure of complexity is successively decomposed, reach recognition effect of simplifying, the identification of complicated pictograph is realized fast, system architecture is complete, and recognition efficiency is high.
In order to realize foregoing invention object, the invention provides following technical scheme:
A kind of complicated image character identification system, comprises image input module, cell detection module, cell cutting module, character segmentation module and Text region module;
Wherein, image form to be identified is input in described cell detection module by image input module, and described cell detection module by the cell contour detecting in image form to be identified out, and is input in described cell cutting module;
Unit lattice content in image form to be identified cuts out by described cell cutting module, forms corresponding sub-pictures;
Input described sub-pictures is wherein cut into character block to be identified by described character segmentation block module;
Described character block is input to the identification completing corresponding content in described Text region module.
Concrete, in this complicated image character identification system, described cell detection module uses the findcontours function in openCV image processing tool to detect image table unit lattice profile.
Described cell cutting module, by calling minAreaRect function, constructs the rotation rectangular area that the area that comprises cell profile point set is minimum, and extracts the corner apex coordinate point of this rotation rectangular area; According to described corner apex coordinate value, the Word message in corresponding unit lattice is cut out as a whole, form corresponding sub-pictures.
Further, described character segmentation module, comprises cut-off detection module for subsequent use and Bayes classifier module; Described cut-off detection module for subsequent use is connected with described Bayes classifier module, and the class categories of described Bayes classifier comprises: word part, digital, alphabetical, punctuation mark.
Described cut-off detection module for subsequent use detects the left and right sides of character block dicing position for subsequent use in sub-pictures by sciagraphy, and testing result is input in described Bayes classifier module, described Bayes classifier module judges the classification of content between the cut-off for subsequent use of character block left and right, is cut out by two, the adjacent left and right character block belonging to word part as a character block.
Described cut-off detection module for subsequent use carries out side projection to sub-pictures, passes through formula:
calculate the number of the non-zero pixel of a line, and be put in projection_y class by this value, wherein i is line number, and j is columns, and pix (i, j) is respective pixel value, and n is the coordinate figure of last row; Element in projection_y class is traveled through; If projection_y [k]=0, projection_y [k+1] > 0, projection_y [k+2] > 0, then judge that k is the initial cut-off of certain a line, and put it in class vector<int>top; If projection_y [k]=0, projection_y [k-1] > 0, projection_y [k-2] > 0, then judge that k is the terminal cut-off of certain a line, and put it in class vector<int>bottom; Based on the element value in class top and bottom by the literal line in cell to cutting out.
Described cut-off detection module for subsequent use carries out vertical projection for the character image of the every a line cut out, and adopts formula:
calculate the number of the non-zero pixel of row, and be put in projection_x class by this value, wherein j is columns, and i is line number, and pix (i, j) is respective pixel value, and m is the coordinate figure of last column; Element in projection_x class is traveled through; If projection_x [k]=0, projection_x [k+1] > 0, projection_x [k+2] > 0, then judge that k is the initial cut-off of a certain character block, and put it in class vector<int>left; If projection_x [k]=0, projection_x [k-1] > 0, projection_x [k-2] > 0, then judge that k is the terminal cut-off of a certain character block, and put it in class vector<int>right; Element value in class left and right is the dicing position for subsequent use of single word.
Described Bayes classifier is after the given first left side for subsequent use cut-off coordinate, judge whether the content between described first left side for subsequent use cut-off and the first right side for subsequent use cut-off is word part, if not word part, then think that this content is numeral, letter or punctuate, directly cut out;
If word part, then continue to judge whether the content between the second left side for subsequent use cut-off and the second right side for subsequent use cut-off is word part, if word part, then the content between the first left side for subsequent use cut-off and the second right side for subsequent use cut-off is cut out.
Further, the process that described Bayes classifier realizes classifying comprises:
To contents extraction eigenwert to be identified;
Eigenwert is inputted in described Bayes classifier, calculates the probability that this eigenwert belongs to of all categories;
Each for content to be identified eigenwert is belonged to such probability multiplication, obtain the probability that this content belongs to such;
Select the classification that generally maximum probable value is corresponding, as the classification of this content;
Wherein, the computing formula that judging characteristic belongs to certain class probable value is: p=(w × ni+1)/(w × nj+q), and wherein w is the proper vector sample size that in Bayes classifier used, three classifications are total; Ni is the number of times that this feature of this object occurs in a certain classification; Nj is proper vector number total in this classification; Q is empirical value.
Further, described Text region module is neural network classifier or is the recognition system based on OCR.
Compared with prior art, beneficial effect of the present invention: the invention provides a kind of complicated image character identification system, native system is for the such complex text of image form word, adopt and first detect cutting cell, again the word content in cell is carried out to the process of cutting, achieve the object of the text structure of complexity successively decomposition, present system detects the cell in image form to be identified by cell detection module, information content entirety in each cell is cut out and forms corresponding sub-pictures, described character segmentation module adopts sciagraphy to find out the cut-off of character block, by cut-off, character block is cut out, in order to improve the accuracy rate of character block cutting, this method, system adopts Bayes classifier to be optimized the cut-off for subsequent use found out by sciagraphy, avoid and the Chinese character of tiled configuration is cut into two-part problem because of the gap existed between the parts of left and right, improve the accuracy rate of character block cutting, and by numeral, letter, punctuation mark cuts out respectively, cutting of classifying so is accurately that the accurate identification of character block content creates basis, adopt the content of neural network classifier to character block to identify on such basis, the accuracy rate of identification is high, quick, complete, ultrahigh in efficiency, and the extensive batch processing for complicated image word is supplied to the processing platform of a high efficient and reliable, in pictograph identification, information excavating, information analysis field has huge application prospect.
Accompanying drawing illustrates:
Fig. 1 is the system architecture schematic diagram of this complicated image character identification system.
Fig. 2 is the image form schematic diagram to be identified of binaryzation.
The cell profile schematic diagram of Fig. 3 for being extracted by the findcontours function in openCV.
Fig. 4 is that scope schematic diagram cut by the unit cell picture in Fig. 3.
The schematic diagram of the unit cell picture that Fig. 5 is syncopated as the cutting scope determined through the cutting of Fig. 4.
Fig. 6 is the cut-off schematic diagram for subsequent use found out by sciagraphy.
Fig. 7 is the deterministic process schematic diagram be optimized by Bayes classifier the cut-off for subsequent use of Fig. 6.
Fig. 8 is the concrete structure schematic diagram of present system.
It should be noted that, institute of the present invention drawings attached is schematically, does not represent actual size and ratio.
Embodiment
Below in conjunction with test example and embodiment, the present invention is described in further detail.But this should be interpreted as that the scope of the above-mentioned theme of the present invention is only limitted to following embodiment, all technology realized based on content of the present invention all belong to scope of the present invention.
The invention provides a kind of complicated image character identification system, the cell in the image form of labyrinth can be detected fast and accurately; And complete fast on this basis the Word message cutting in the image form of complexity to be identified.First present system detects the cell profile in image form to be identified; Word message in corresponding unit lattice cuts out by the corner apex coordinate corresponding to cell profile; Character segmentation module utilizes sciagraphy to find out the dicing position of character block in cell content on this basis, and is cut out successively by character block; The character block cut out is input in Text region module, identifies corresponding word content.Native system is for the architectural feature point of the textual image of this complexity of image form, from first detection and cutting cell to the cutting of cell content and identification, the structure of complexity is successively decomposed, reach recognition effect of simplifying, the identification of complicated pictograph is realized fast, system architecture is complete, and recognition efficiency is high.
In order to realize foregoing invention object, the invention provides following technical scheme:
A kind of complicated image character identification system, as shown in Figure 1: comprise image input module, cell detection module, cell cutting module, character segmentation module and Text region module;
Wherein, image form to be identified is input in described cell detection module by image input module, and described cell detection module by the cell contour detecting in image form to be identified out, and is input in described cell cutting module;
Unit lattice content in image form to be identified cuts out by described cell cutting module, forms corresponding sub-pictures;
Input described sub-pictures is wherein cut into character block to be identified by described character segmentation block module;
Described character block is input to the identification completing corresponding content in described Text region module.
Concrete, in this complicated image character identification system, described image input module can be scanner, the image acquisition such as digital camera or memory device, the a large amount of pictograph collected is input in present system by described image input module, identifies.
Described cell detection module, after image form to be identified is carried out binary conversion treatment, uses the findcontours function in openCV image processing tool to detect image table unit lattice profile.The point set of the cell profile detected is extracted and is placed in corresponding some class (as: with Vector1, Vector2, Vector3 ... represent), openCV is a kind of image processing tool efficiently, a lot of simple image processing function is efficiently comprised in openCV, wherein findcontours function can according to the feature of Form Frame Line profile by the contour detecting of cell out, and the cell profile point set detected is extracted, detection efficiency is high, Fig. 2 is an exemplary plot of pending image form, pending picture is input in cell detection module by described image input module, after image is carried out binary conversion treatment by described cell detection module, call the outline line of the table cell to be identified that findcontours function is found out as shown in Figure 3.
Described cell cutting module is by calling minAreaRect function (minAreaRect function is also a function in openCV), construct the rotation rectangular area that the area that comprises cell profile point set is minimum, and extract the corner apex coordinate point of this rotation rectangular area; Cut out as a whole by Word message in corresponding unit lattice according to described corner apex coordinate value, form corresponding sub-pictures, the cell picture cut out as shown in Figure 4.When carrying out cell content cutting, should first the Form Frame Line of cell be got rid of, described cell cutting module moves a certain distance determined new region by corresponding to the direction in rectangular area for corner apex coordinate; The scope covered by new region carrys out cutting cell content, just the frame of cell can be cut away, leave the sub-pictures only comprising the inner Word message of cell word, as shown in Figure 5, concrete process repeats no more the sub-pictures that the cell shown in Fig. 4 is formed through cutting.Further, described character segmentation module, comprises cut-off detection module for subsequent use and Bayes classifier module; Described cut-off detection module for subsequent use is connected with described Bayes classifier module, and the class categories of described Bayes classifier comprises: word part, digital, alphabetical, punctuation mark.
Described cut-off detection module for subsequent use detects the left and right sides of character block dicing position for subsequent use in sub-pictures by sciagraphy, and testing result is input in described Bayes classifier module, described cut-off detection module for subsequent use carries out side projection to sub-pictures, passes through formula:
calculate the number of the non-zero pixel of a line, and be put in projection_y class by this value, wherein i is line number, and j is columns, and pix (i, j) is respective pixel value, and n is the coordinate figure of last row; Element in projection_y class is traveled through; If projection_y [k]=0, projection_y [k+1] > 0, projection_y [k+2] > 0, then judge that k is the initial cut-off of certain a line, and put it in class vector<int>top; If projection_y [k]=0, projection_y [k-1] > 0, projection_y [k-2] > 0, then judge that k is the terminal cut-off of certain a line, and put it in class vector<int>bottom; Based on the element value in class top and bottom by the literal line in cell to cutting out.
Described cut-off detection module for subsequent use carries out vertical projection for the character image of the every a line cut out, and adopts formula:
calculate the number of the non-zero pixel of row, and be put in proiection_x class by this value, wherein j is columns, and i is line number, and pix (i, j) is respective pixel value, and m is the coordinate figure of last column; Element in projection_x class is traveled through; If projection_x [k]=0, projection_x [k+1] > 0, projection_x [k+2] > 0, then judge that k is the initial cut-off of a certain character block, and put it in class vector<int>left; If projection_x [k]=0, projection_x [k-1] > 0, projection_x [k-2] > 0, then judge that k is the terminal cut-off of a certain character block, and put it in class vector<int>right; Element value in class left and right is the dicing position for subsequent use of single word.
The coordinate position of each character block just can be determined according to transverse projection and longitudinal projection's coordinate; Single character block can be cut out according to coordinate position.But such situation likely appears in sciagraphy: when a word is the word of tiled configuration, because the gap existed between tiled configuration parts, each word part may be cut out separately by mistake, have impact on the accuracy of Text region.In order to improve the accuracy of character block cutting, further calculate in cut-off information input Bayes classifier for subsequent use; Described Bayes classifier module judges the classification of content between the cut-off for subsequent use of character block left and right, by two, the adjacent left and right character block belonging to word part integrally character block cut out.
Described Bayes classifier is after the given first left side for subsequent use cut-off coordinate, judge whether the content between described first left side for subsequent use cut-off and the first right side for subsequent use cut-off is word part, if not word part, then think that this content is numeral, letter or punctuate, directly cut out;
If word part, then continue to judge whether the content between the second left side for subsequent use cut-off and the second right side for subsequent use cut-off is word part, if word part, then the content between the first left side for subsequent use cut-off and the second right side for subsequent use cut-off is cut out.
Further, the process that described Bayes classifier realizes classifying comprises:
To contents extraction eigenwert to be identified;
Eigenwert is inputted in described Bayes classifier, calculates the probability that this eigenwert belongs to of all categories;
Each for content to be identified eigenwert is belonged to such probability multiplication, obtain the probability that this content belongs to such;
Select the classification that generally maximum probable value is corresponding, as the classification of this content;
Wherein, the computing formula that judging characteristic belongs to certain class probable value is: p=(w × ni+1)/(w × nj+q), and wherein w is the proper vector sample size that in Bayes classifier used, three classifications are total; Ni is the number of times that this feature of this object occurs in a certain classification; Nj is proper vector number total in this classification; Q is empirical value.In the sorter sample that present system is taked, the eigenwert selected comprises: depth-width ratio, coverage rate, vertical center line passes through stroke number, vertical center line passes through stroke number upper and lower two ends ultimate range and ratio highly, horizontal center line passes through stroke number, and horizontal center line passes through the ratio of stroke number left and right two ends ultimate range and width.
Concrete, the width of character block and the threshold value of setting compare after finding out the left and right cut-off for subsequent use of character block by invention system, if the character block between this left and right cut-off just can directly cut out by the threshold value being greater than setting.This is because the Chinese character of type-script under normal circumstances, the width of single character is relatively homogeneous, and be greater than the width of word part, numeral, letter or punctuation mark, when carrying out character segmentation, first can be compared by the threshold value of the width between cut-off and setting, the word between adjacent left and right cut-off larger for width is cut out as single character block.
For the character block that left and right cut-off width is less, several situation may be there is, such as may be word part, numeral, letter or punctuate; Accurately can not judge which kind of type this block belongs to only by width, if be not optimized the left and right parts of the word of tiled configuration may be cut out respectively because of the gap of middle existence cut-off, have impact on the recognition effect of word.Present system utilizes Bayes classifier to judge the content between the cut-off for subsequent use of the left and right sides, on the basis of cut-off for subsequent use, finds out optimization dicing position, improves the accuracy rate of cutting.When carrying out cut-off optimization, (the wherein schematic diagram of for subsequent use cut-off of Fig. 6 for finding out through sciagraphy as shown in Figure 6, Figure 7, A, B, C, D are the left side for subsequent use cut-off of corresponding character block, the right side for subsequent use cut-off that A ', B ', C ', D ' are corresponding character block, be only and schematically illustrate, do not represent the true cut-off position of character block; Fig. 7 uses Bayes classifier to be optimized cut-off for subsequent use and content is carried out the process schematic of classification cutting), after fixing left side cutting coordinate (the first left prepped side cut-off A), the content between Bayes classifier pair and its right side cut-off for subsequent use (the first right prepped side cut-off A ') being close to thereafter is used to classify; If numeral, letter or punctuation mark, then it is directly cut out according to the type of sorter identification, if word part, then judgement position is moved on to next left side for subsequent use dicing position (the second left side cut-off B for subsequent use), judge this left side for subsequent use cut-off and and its right side for subsequent use cut-off (the second right side cut-off B ' for subsequent use) immediately between content whether be word part, if word part, then the character block between the first left side cut-off A for subsequent use and the second right side cut-off B ' for subsequent use is integrally cut out; Doing so avoids and become two parts to carry out situation about identifying the Chinese character segmentation of tiled configuration, ensure the integrality of character segmentation.
Further, as shown in Figure 8, described Text region module is neural network classifier or is the recognition system based on OCR.The wherein application of neural network at present in voice and image recognition very extensive, to the recognition technology relative maturity of the picture of the word segmented, neural network is similar to the neural training of human brain and learning process, can the feature of learning sample, pattern and rule, after needing to build neural network according to identification, prepare some training samples according to the complexity identified and be input to the training carrying out neural network in neural network, before these training samples are input to neural network, need manually to mark this training sample, the selection of training sample will have influence on the recognition result of neural network, the object identified in the present invention is image form, abundant Chinese character is comprised in image form, numeral, letter and symbol, the kind of the character set comprised and quantity are all abundanter, neural network of the present invention can select the sample character set consistent with image form character set to be identified, such as the Chinese Character Set comprised in form Image to be identified is at about 2000, set of digits 0-9, glossary of symbols comprises punctuation mark, mathematical formulae symbol, such as branch, percentage sign, various measurement unit symbol, the sample set then selected when carrying out neural metwork training also should comprise these Chinese characters accordingly, numbers and symbols, the Output rusults of such guarantee character block identification is correct.Neural network has adaptive error transfer factor ability, can according to regulative modes such as error back propagations, constantly reduce the difference between learning outcome and annotation results, last progressively level off to stable correct identification direction, after neural metwork training completes, the test sample book (development sample) of some is input in neural network, the accuracy of the input results of test neural network, when accuracy reaches the threshold value of setting, can think that the training of neural network completes, after neural metwork training is good, the character block picture segmented is input in neural network, complete the identifying of picture.
Claims (10)
1. a complicated image character identification system, is characterized in that: comprise image input module, cell detection module, cell cutting module, character segmentation module and Text region module;
Wherein, image form to be identified is input in described cell detection module by image input module, and described cell detection module by the cell contour detecting in image form to be identified out, and is input in described cell cutting module;
Unit lattice content in image form to be identified cuts out by described cell cutting module, forms corresponding sub-pictures;
Input described sub-pictures is wherein cut into character block to be identified by described character segmentation block module;
Described character block is input to the identification completing corresponding content in described Text region module.
2. the system as claimed in claim 1, is characterized in that: described cell detection module uses the findcontours function in openCV image processing tool to carry out detected image table cell profile.
3. system as claimed in claim 2, it is characterized in that: described cell cutting module is by calling minAreaRect function, construct the rotation rectangular area that the area that comprises cell profile point set is minimum, and extract the corner apex coordinate point of this rotation rectangular area; According to described corner apex coordinate value, the Word message in corresponding unit lattice is cut out as a whole, form corresponding sub-pictures.
4. system as claimed in claim 3, is characterized in that: described character segmentation module comprises cut-off detection module for subsequent use and Bayes classifier module; Described cut-off detection module for subsequent use is connected with described Bayes classifier module, and the class categories of described Bayes classifier comprises: word part, digital, alphabetical, punctuation mark.
5. system as claimed in claim 4, it is characterized in that: described cut-off detection module for subsequent use detects the left and right sides of character block dicing position for subsequent use in sub-pictures by sciagraphy, and testing result is input in described Bayes classifier module, described Bayes classifier module judges the classification of content between the cut-off for subsequent use of character block left and right, is cut out by two, the adjacent left and right character block belonging to word part as a character block.
6. system as claimed in claim 5, is characterized in that: described cut-off detection module for subsequent use carries out side projection to sub-pictures, passes through formula:
calculate the number of the non-zero pixel of a line, and be put in projection_y class by this value, wherein i is line number, and j is columns, and pix (i, j) is respective pixel value, and n is the coordinate figure of last row; Element in projection_y class is traveled through; If projection_y [k]=0, projection_y [k+1] > 0, projection_y [k+2] > 0, then judge that k is the initial cut-off of certain a line, and put it in class vector<int > top;
If projection_y [k]=0, projection_y [k-1] > 0, projection_y [k-2] > 0, then judge that k is the terminal cut-off of certain a line, and put it in class vector<int>bottom;
Based on the element value in class top and bottom by the literal line in cell to cutting out.
7. system as claimed in claim 6, is characterized in that: described cut-off detection module for subsequent use carries out vertical projection for the character image of the every a line cut out, and adopts formula:
calculate the number of the non-zero pixel of row, and be put in projection_x class by this value, wherein j is columns, and i is line number, and pix (i, j) is respective pixel value, and m is the coordinate figure of last column; Element in projection_x class is traveled through; If projection_x [k]=0, projection_x [k+1] > 0, projection_x [k+2] > 0, then judge that k is the initial cut-off of a certain character block, and put it in class vector<int>left;
If projection_x [k]=0, projection_x [k-1] > 0, projection_x [k-2] > 0, then judge that k is the terminal cut-off of a certain character block, and put it in class vector<int>right;
Element value in class left and right is the dicing position for subsequent use of single word.
8. system as claimed in claim 7, it is characterized in that: described Bayes classifier is after the given first left side for subsequent use cut-off coordinate, judge whether the content between described first left side for subsequent use cut-off and the first right side for subsequent use cut-off is word part, if not word part, then think that this content is numeral, letter or punctuate, directly cut out;
If word part, then continue to judge whether the content between the second left side for subsequent use cut-off and the second right side for subsequent use cut-off is word part, if word part, then the content between the first left side for subsequent use cut-off and the second right side for subsequent use cut-off is cut out.
9. system as claimed in claim 8, is characterized in that: the process that described Bayes classifier realizes classification comprises:
To contents extraction eigenwert to be identified;
Eigenwert is inputted in described Bayes classifier, calculates the probability that this eigenwert belongs to of all categories;
Each for content to be identified eigenwert is belonged to such probability multiplication, obtain the probability that this content belongs to such;
Select the classification that generally maximum probable value is corresponding, as the classification of this content;
Wherein, the computing formula that judging characteristic belongs to certain class probable value is: p=(w × ni+1)/(w × nj+q), and wherein w is the proper vector sample size that in Bayes classifier used, three classifications are total; Ni is the number of times that this feature of this object occurs in a certain classification; Nj is proper vector number total in this classification; Q is empirical value.
10. the system as described in one of claim 1 to 9, is characterized in that: described Text region module is neural network classifier.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510831651.XA CN105447522A (en) | 2015-11-25 | 2015-11-25 | Complex image character identification system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510831651.XA CN105447522A (en) | 2015-11-25 | 2015-11-25 | Complex image character identification system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105447522A true CN105447522A (en) | 2016-03-30 |
Family
ID=55557681
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510831651.XA Pending CN105447522A (en) | 2015-11-25 | 2015-11-25 | Complex image character identification system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105447522A (en) |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106682671A (en) * | 2016-12-29 | 2017-05-17 | 成都数联铭品科技有限公司 | Image character recognition system |
CN106778759A (en) * | 2016-12-29 | 2017-05-31 | 成都数联铭品科技有限公司 | For the feature image automatic creation system of pictograph identification |
CN109284712A (en) * | 2018-09-20 | 2019-01-29 | 浙江口碑网络技术有限公司 | The configuration method and device of merchandise news |
CN109492549A (en) * | 2018-10-24 | 2019-03-19 | 杭州睿琪软件有限公司 | A kind of processing of training sample set, model training method and system |
CN109522805A (en) * | 2018-10-18 | 2019-03-26 | 成都中科信息技术有限公司 | A kind of form processing method for Form ballot paper in community election |
CN109614923A (en) * | 2018-12-07 | 2019-04-12 | 上海智臻智能网络科技股份有限公司 | The recognition methods of OCR document and its device |
CN109871910A (en) * | 2019-03-12 | 2019-06-11 | 成都工业学院 | A kind of hand-written character recognition method and device |
CN109981920A (en) * | 2017-12-28 | 2019-07-05 | 夏普株式会社 | Image processing apparatus, image processing program, image processing method and image forming apparatus |
CN109993112A (en) * | 2019-03-29 | 2019-07-09 | 杭州睿琪软件有限公司 | The recognition methods of table and device in a kind of picture |
CN110188758A (en) * | 2019-06-03 | 2019-08-30 | 中国热带农业科学院农业机械研究所 | Area-of-interest acquisition methods, device, equipment and storage medium in sugarcane image |
CN110569835A (en) * | 2018-06-06 | 2019-12-13 | 北京搜狗科技发展有限公司 | Image identification method and device and electronic equipment |
CN110942004A (en) * | 2019-11-20 | 2020-03-31 | 深圳追一科技有限公司 | Handwriting recognition method and device based on neural network model and electronic equipment |
CN111507250A (en) * | 2020-04-16 | 2020-08-07 | 北京世纪好未来教育科技有限公司 | Image recognition method, device and storage medium |
CN112364790A (en) * | 2020-11-16 | 2021-02-12 | 中国民航大学 | Airport work order information identification method and system based on convolutional neural network |
CN114973291A (en) * | 2022-07-28 | 2022-08-30 | 北京和人广智科技有限公司 | Text line image text segmentation method and device |
CN115063667A (en) * | 2022-05-12 | 2022-09-16 | 吉林省吉林祥云信息技术有限公司 | Parallel recognition processing method for document scanning PDF file |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103488986A (en) * | 2013-09-18 | 2014-01-01 | 西安理工大学 | Method for segmenting and extracting characters in self-adaptation mode |
CN104484643A (en) * | 2014-10-27 | 2015-04-01 | 中国科学技术大学 | Intelligent identification method and system for hand-written table |
JP2015115025A (en) * | 2013-12-16 | 2015-06-22 | 三菱電機株式会社 | Document identification device and document identification method |
CN105160342A (en) * | 2015-08-11 | 2015-12-16 | 成都数联铭品科技有限公司 | HMM-GMM-based automatic word picture splitting method and system |
-
2015
- 2015-11-25 CN CN201510831651.XA patent/CN105447522A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103488986A (en) * | 2013-09-18 | 2014-01-01 | 西安理工大学 | Method for segmenting and extracting characters in self-adaptation mode |
JP2015115025A (en) * | 2013-12-16 | 2015-06-22 | 三菱電機株式会社 | Document identification device and document identification method |
CN104484643A (en) * | 2014-10-27 | 2015-04-01 | 中国科学技术大学 | Intelligent identification method and system for hand-written table |
CN105160342A (en) * | 2015-08-11 | 2015-12-16 | 成都数联铭品科技有限公司 | HMM-GMM-based automatic word picture splitting method and system |
Non-Patent Citations (1)
Title |
---|
仲小挺: "基于自学习的手写表格数字字符串快速识别方法的研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106778759A (en) * | 2016-12-29 | 2017-05-31 | 成都数联铭品科技有限公司 | For the feature image automatic creation system of pictograph identification |
CN106682671A (en) * | 2016-12-29 | 2017-05-17 | 成都数联铭品科技有限公司 | Image character recognition system |
CN109981920B (en) * | 2017-12-28 | 2021-10-01 | 夏普株式会社 | Image processing apparatus, image processing program, image processing method, and image forming apparatus |
CN109981920A (en) * | 2017-12-28 | 2019-07-05 | 夏普株式会社 | Image processing apparatus, image processing program, image processing method and image forming apparatus |
CN110569835A (en) * | 2018-06-06 | 2019-12-13 | 北京搜狗科技发展有限公司 | Image identification method and device and electronic equipment |
CN110569835B (en) * | 2018-06-06 | 2024-03-05 | 北京搜狗科技发展有限公司 | Image recognition method and device and electronic equipment |
CN109284712A (en) * | 2018-09-20 | 2019-01-29 | 浙江口碑网络技术有限公司 | The configuration method and device of merchandise news |
CN109284712B (en) * | 2018-09-20 | 2021-04-27 | 浙江口碑网络技术有限公司 | Configuration method and device of commodity information |
CN109522805A (en) * | 2018-10-18 | 2019-03-26 | 成都中科信息技术有限公司 | A kind of form processing method for Form ballot paper in community election |
CN109522805B (en) * | 2018-10-18 | 2020-12-08 | 成都中科信息技术有限公司 | Form processing method for form vote in community election |
CN109492549A (en) * | 2018-10-24 | 2019-03-19 | 杭州睿琪软件有限公司 | A kind of processing of training sample set, model training method and system |
CN109614923A (en) * | 2018-12-07 | 2019-04-12 | 上海智臻智能网络科技股份有限公司 | The recognition methods of OCR document and its device |
CN109871910A (en) * | 2019-03-12 | 2019-06-11 | 成都工业学院 | A kind of hand-written character recognition method and device |
CN109871910B (en) * | 2019-03-12 | 2021-06-22 | 成都工业学院 | Handwritten character recognition method and device |
CN109993112A (en) * | 2019-03-29 | 2019-07-09 | 杭州睿琪软件有限公司 | The recognition methods of table and device in a kind of picture |
CN110188758A (en) * | 2019-06-03 | 2019-08-30 | 中国热带农业科学院农业机械研究所 | Area-of-interest acquisition methods, device, equipment and storage medium in sugarcane image |
CN110942004A (en) * | 2019-11-20 | 2020-03-31 | 深圳追一科技有限公司 | Handwriting recognition method and device based on neural network model and electronic equipment |
CN111507250A (en) * | 2020-04-16 | 2020-08-07 | 北京世纪好未来教育科技有限公司 | Image recognition method, device and storage medium |
CN112364790B (en) * | 2020-11-16 | 2022-10-25 | 中国民航大学 | Airport work order information identification method and system based on convolutional neural network |
CN112364790A (en) * | 2020-11-16 | 2021-02-12 | 中国民航大学 | Airport work order information identification method and system based on convolutional neural network |
CN115063667A (en) * | 2022-05-12 | 2022-09-16 | 吉林省吉林祥云信息技术有限公司 | Parallel recognition processing method for document scanning PDF file |
CN115063667B (en) * | 2022-05-12 | 2024-01-02 | 吉林省吉林祥云信息技术有限公司 | Parallel identification processing method for document scanning PDF (Portable document Format) file |
CN114973291A (en) * | 2022-07-28 | 2022-08-30 | 北京和人广智科技有限公司 | Text line image text segmentation method and device |
CN114973291B (en) * | 2022-07-28 | 2022-11-04 | 北京和人广智科技有限公司 | Text line image character segmentation method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105447522A (en) | Complex image character identification system | |
CN105512611A (en) | Detection and identification method for form image | |
CN109902622B (en) | Character detection and identification method for boarding check information verification | |
CN109726643B (en) | Method and device for identifying table information in image, electronic equipment and storage medium | |
CN105426856A (en) | Image table character identification method | |
CN106447658B (en) | Conspicuousness object detection method based on global and local convolutional network | |
CN109948510B (en) | Document image instance segmentation method and device | |
CN107093172B (en) | Character detection method and system | |
CN103942550B (en) | A kind of scene text recognition methods based on sparse coding feature | |
US20200311460A1 (en) | Character identification method and device | |
CN111428723B (en) | Character recognition method and device, electronic equipment and storage medium | |
CN102332096B (en) | Video caption text extraction and identification method | |
CN103049763B (en) | Context-constraint-based target identification method | |
CN105469053A (en) | Bayesian optimization-based image table character segmentation method | |
CN105469047A (en) | Chinese detection method based on unsupervised learning and deep learning network and system thereof | |
CN111950528B (en) | Graph recognition model training method and device | |
CN106203539B (en) | Method and device for identifying container number | |
CN105608454A (en) | Text structure part detection neural network based text detection method and system | |
CN113537227B (en) | Structured text recognition method and system | |
CN113591866B (en) | Special operation certificate detection method and system based on DB and CRNN | |
Salvi et al. | Handwritten text segmentation using average longest path algorithm | |
CN106682671A (en) | Image character recognition system | |
CN109800756A (en) | A kind of text detection recognition methods for the intensive text of Chinese historical document | |
CN113221711A (en) | Information extraction method and device | |
Kumar | Performance comparison of features on Devanagari hand-printed dataset |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20160330 |
|
WD01 | Invention patent application deemed withdrawn after publication |