CN108364036A - A kind of modeling method, recognition methods, device, storage medium and equipment - Google Patents

A kind of modeling method, recognition methods, device, storage medium and equipment Download PDF

Info

Publication number
CN108364036A
CN108364036A CN201711459765.1A CN201711459765A CN108364036A CN 108364036 A CN108364036 A CN 108364036A CN 201711459765 A CN201711459765 A CN 201711459765A CN 108364036 A CN108364036 A CN 108364036A
Authority
CN
China
Prior art keywords
waybill
written
hand
sample
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711459765.1A
Other languages
Chinese (zh)
Inventor
武晨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SF Technology Co Ltd
SF Tech Co Ltd
Original Assignee
SF Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SF Technology Co Ltd filed Critical SF Technology Co Ltd
Priority to CN201711459765.1A priority Critical patent/CN108364036A/en
Publication of CN108364036A publication Critical patent/CN108364036A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/24Character recognition characterised by the processing or recognition method
    • G06V30/242Division of the character sequences into groups prior to recognition; Selection of dictionaries
    • G06V30/244Division of the character sequences into groups prior to recognition; Selection of dictionaries using graphical properties, e.g. alphabet type or font
    • G06V30/2455Discrimination between machine-print, hand-print and cursive writing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks

Abstract

The invention discloses a kind of modeling method, recognition methods, device, storage medium and equipment, the modeling method includes:Sample is subjected to path scanning, the processing of time recurrent neural network, convolutional neural networks processing, generates full connection layer data;The full connection layer data obtains the model by CTC graders, optimization.The model realization that the identification usage is built with the modeling method, the storage medium and equipment are respectively used to realize the method.The process that the technology of the present invention can replace artificial typewriting record single, reduces a large amount of human capital, and the single accuracy rate of record is greatly improved, and technical solution can not only identify the handwritten text on waybill.

Description

A kind of modeling method, recognition methods, device, storage medium and equipment
Technical field
The present invention relates to handwritten Kanji recognition technical field more particularly to a kind of modeling method, recognition methods, device, deposit Storage media and equipment.
Background technology
The accurate identification to single handwritten Chinese character has may be implemented in current manual's smart field, so by hand-written Address text dividing the identification that full address originally may be implemented is identified at individual Chinese character and successively, still, at present " cutting The technology of Chinese character " can ensure the degree of accuracy there are no developing to, and then cause recognition result accuracy very low.
In addition, express delivery, before being transported on road, each express delivery will undergo the single process of record, it is therefore an objective to will be on waybill The correspondence of Quick Response Code and the hand-written address information of sender be stored in computer, and then transmitting-receiving node later only leads to Quick Response Code is over-scanned to obtain the destination address of the express mail.Moreover, our record list process is all to rely on manually to strike at present Keystroke, in the hand-written address typing computer system on waybill, not only to consume a large amount of human cost in this way, can not also protect The single accuracy of card record.Once the address error of typing, what corresponding express delivery will have no suspense is mailed to the place of mistake.
As stated above, problem of the existing technology is:" technology of cutting Chinese character " there are no develop to ensure just The degree of true rate, and then cause recognition result accuracy very low, and in delivery industry, the single process of artificial typewriting record can consume A large amount of human cost, and the work of uninteresting repetition, can allow manual identified address often to be malfunctioned, and since hand-written address is Do not have spaced Chinese character for a string, while Chinese character is often left and right or up-down structure, so address is cut into one by one Chinese character be the process got half the result with twice the effort.
Invention content
In order to solve above-mentioned deficiency in the prior art, the purpose of the present invention is to provide a kind of modeling method, identification sides Method, device, storage medium and equipment.It improves the identification accuracy of hand-written waybill, improves recognition efficiency.
To achieve the goals above, the technical solution adopted in the present invention is:
A method of hand-written waybill text identification model is established, including:
Sample is subjected to path scanning, the processing of time recurrent neural network, convolutional neural networks processing, generates full articulamentum Data;
The full connection layer data obtains the model by CTC graders, optimization.
The path scanning is that four direction paths are scanned.
The time recurrent neural network processing, convolutional neural networks are handled in the process of implementation, including:
Sample set is equally divided into several sample sets;
Each sample in the sample set, duplicate paths scanning, LSTM processing and convolution summation process several times, Generate full connection layer data.
It is described to optimize in the process of implementation, including:
The full connection layer data is compared and is changed with the truthful data of sample by the data that CTC graders obtain In generation, carries out the optimization of whole network parameter, recycles training set, test set and verification collection, is optimized using gradient descent method, and be based on GPU accelerator iteration carries out model training.
A kind of device for establishing hand-written waybill text identification model, including:
Data generating unit is configured to sample carrying out path scanning, the processing of time recurrent neural network, convolutional Neural Network processes generate full connection layer data;
Optimize unit, is configured to the full connection layer data and obtains the model by CTC graders, optimization.
A kind of equipment, the equipment include:
One or more processors;
Memory, for storing one or more programs,
When one or more of programs are executed by one or more of processors so that one or more of places It manages device and executes the method for establishing hand-written waybill text identification model.
A kind of computer readable storage medium being stored with computer program, when which is executed by processor described in realization The method for establishing hand-written waybill text identification model.
A kind of hand-written waybill text recognition method, including:
The handwritten text region in hand-written waybill to be identified is extracted, several letters are extracted in the handwritten text region Cease picture;
Described information picture is identified by hand-written waybill text identification model, and the model establishes hand using described The method for writing waybill text identification model is established.
A kind of hand-written waybill text identification device, which is characterized in that including:
Extraction unit is configured to extract the handwritten text region in hand-written waybill to be identified, in the handwritten text area Several information pictures are extracted in domain;
Recognition unit is configured to described information picture and is identified by hand-written waybill text identification model, the mould Type is established using the method for establishing hand-written waybill text identification model.
A kind of equipment, the equipment include:
One or more processors;
Memory, for storing one or more programs,
When one or more of programs are executed by one or more of processors so that one or more of places It manages device and executes the hand-written waybill text recognition method.
A kind of computer readable storage medium being stored with computer program, when which is executed by processor described in realization Hand-written waybill text recognition method.
Compared with prior art, the invention has the advantages that:
1, the technical program can not only identify the handwritten text on waybill, by preparing different training samples, the party Case can also be transplanted to the identification of the handwritten Chinese character under other scenes.
2, the single process of traditional artificial typewriting record can consume a large amount of human capitals, and in the work of uninteresting repetition, Manual identified address is often malfunctioned, and the technical program can partly replace artificial typewriting record list by extraction target information picture Process not only reduces a large amount of human capital, but also the single accuracy rate of record is greatly improved.
3, hand-written address be it is a string do not have a spaced Chinese character, while Chinese character is often left and right or up-down structure, institute It is the process got half the result with twice the effort so that address is cut into Chinese character one by one, the technical program has evaded the recognition methods of traditional OCR, A kind of text fragment recognition methods that exempting from cutting used, i.e., need not carry out Chinese character paragraph the cutting of individual Chinese character, therefore The exemplary technical solution of the present invention can ensure Chinese Character Recognition accuracy.
Description of the drawings
Fig. 1 is the method flow diagram for establishing hand-written waybill text identification model shown in the present embodiment.
Fig. 2 is to carry out the structural schematic diagram that four direction scanning obtains one-dimensional sequence to every pictures.
Fig. 3 is the hand-written waybill text recognition method flow chart shown in the present embodiment.
Fig. 4 is the schematic diagram that hand-written address area is extracted from whole Zhang Yundan address informations picture.
Fig. 5 is the effective information region of the waybill of interception.
Fig. 6 is blank waybill.
Fig. 7 is the waybill picture after comparison, counteracting.
Specific implementation mode
The application is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining related invention, rather than the restriction to the invention.It also should be noted that in order to Convenient for description, is illustrated only in attached drawing and invent relevant part.
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Embodiment:
As shown in Figure 1, a kind of method for establishing hand-written waybill text identification model, including:
Sample is subjected to path scanning, the processing of time recurrent neural network, convolutional neural networks processing, generates full articulamentum Data.
S01 extracts handwritten Chinese character region from hand-written waybill picture, then extracts from the handwritten Chinese character region several Open information picture;Using information picture as sample, sample set is created.This sample set is made of correspondence a group by a group, That is { (picture 1, the content on picture 1), (picture 2, the content on picture 2) ... }, then by all combinations of these sample sets It is grouped as three sample sets according to a certain percentage, the embodiment of the present invention is according to 8:1:1 ratio is grouped as training set, test Collection and verification collection.
The step S01 includes step:
S011 is named information picture whole in step S01, and each described information picture corresponds to unique life Name.Naming method is:Such as " 00000001.jpg ", " 00000356.jpg " etc., i.e. number total bit 8, insufficient front Zero padding.
S012, using the word sequence of every described information picture as the label of the pictures;
Whole labels is stored in a text file by S013 in sequence, and each label accounts for a line;
S014 creates the sample set of handwritten Kanji recognition using described information picture and its label as sample.
The sample set is equally divided into several sample sets by S02;
S03 carries out path scanning to every information picture in the sample set and obtains corresponding subset with LSTM processing;
S04 carries out convolution and summation process to every pictures in the corresponding subset, obtains fisrt feature image;
S05 carries out fisrt feature image caused by same information picture at path scanning and LSTM successively respectively Reason, convolution and summation process obtain second feature image;
S06 carries out second feature image caused by same information picture at path scanning and LSTM successively respectively Reason, obtains intermediate image subset;
The intermediate image subset is established parameter matrix, then by relevant parameter matrix phase by S07 by full articulamentum Add summation, finally obtains the full connection layer data of every information picture;
S08 carries out parameter normalization processing to each full connection layer data, obtains normalized parameter matrix;
S09 passes through CTC graders to each normalized parameter matrix, obtains feature vector;
Whole sample sets is carried out the processing of S03-S09 by S10, and the result of acquisition and true result are carried out Compare the optimization that simultaneously iteration carries out whole network parameter.
The path scanning and LSTM processing include step:Four kinds of paths are carried out to every pictures and scan (four kinds of paths Scanning is as shown in Figure 2), obtain the scanned picture of four order informations per pictures;Every scanned picture is passed through one layer LSTM layers containing N cores respectively obtain N corresponding pictures.
LSTM (Long-Short Term Memory, shot and long term memory) is a kind of time recurrent neural network.Usually LSTM is suitable for the critical event being spaced in processing and predicted time sequence and delay is very long.
The performance of LSTM usually more preferably than time recurrent neural network and Hidden Markov Model (HMM), for example is used in not On zonal cooling handwriting recognition.2009, the match of ICDAR handwriting recognitions was won with the LSTM artificial nerve network models built Champion.LSTM is also commonly used for autonomous speech recognition, and database of giving a lecture naturally with TIMIT for 2013 reaches 17.7% error rate Record.As nonlinear model, LSTM can be used as complicated non-linear unit for constructing larger deep neural network.
The convolution and summation process include step:Every pictures are by the convolutional layer that one includes M convolution kernel Eigenmatrix is obtained by filtration, the eigenmatrix of the picture corresponding to same information picture is correlation eigen matrix, the correspondence It refers to by same information picture the obtained picture after above-mentioned processing, the correlation eigen matrix is summed simultaneously It carries out nonlinear operation and obtains characteristic image.
In the step S03, N=2;In the step S05, N=10;N=50 in the step S35.
In the step S03, M=6;In the step S05, M=20.
The full connection layer data obtains initial model by CTC graders.
This programme uses recurrent neural network (Recurrent Neural on tensorflow deep learning platforms Network LSTM (the Long-Short Term Memory) algorithms in) and CTC (Connectionist Temporal Classification) method that algorithm combines builds deep learning network.By taking specific implementation mode as an example, initial model is established Comprise the concrete steps that:
A. all sample decompositions are divided into 20 and separately included at the set of several same sizes, such as 10000 pictures Then the set of 500 samples is each gathered while for carrying out parameter operation below.
B. four kinds of path scannings are carried out to each pictures in set, and then obtains four scannings for carrying order information Then all scanned pictures are passed through the LSTM layers of one layer of core containing there are two by picture respectively, have one to carry order information respectively Picture become two corresponding pictures.
C. each set from 500 pictures becomes 4000 pictures (this set is divided into four according to four kinds of scan paths A subclass), these pictures are passed through to a convolutional layer comprising 6 convolution kernels respectively, corresponding eigenmatrix is obtained by filtration, By relevant four eigenmatrixes, (four eigenmatrixes herein are to be directed in four subclass of same information picture relatively The eigenmatrix answered, for example first, second, third, fourth subclass their first figure is exactly so-called relevant four Eigenmatrix) it is summed and carries out nonlinear operation, and then each set is converted into 500*6 characteristic images (first again Characteristic image).
D. it opens image by each group 6 and repeats step b as object, wherein each LSTM layers of check figure becomes 10, then Step c is repeated, wherein the check figure of each convolutional layer is set as 20, each collection credit union obtains 500*20 characteristic images (the at this time Two characteristic images).
E. each group of 20 images being repeated into step b as an object, wherein each LSTM layers of check figure is 50, Then it is one group by every 50 obtained, it is four groups every (to be derived from the image with group by four acquired in four kinds of scanpaths Group) object pass through a full articulamentum, establish 50:Then relevant four groups of additions are summed, are obtained by 3755 parameter matrix Each original image corresponds to a full connection layer parameter.
F. flattening is carried out to the data of each full articulamentum and passes through softmax progress parameter normalizations respectively.
G. each normalized parameter matrix is passed through into a CTC grader, obtains one-dimensional including 3755 variables Feature vector, it illustrates all words shown in this pictures.
H. all pictures are all handled by sequence described above, and the result of all acquisitions and true result are compared Pair and iteration carry out whole network parameter optimization.
Initial model obtains the model by optimization.Network structure is mainly the final one layer of CTC compositions of three layers of LSTM, phase Feature is extracted by one layer of CNN between two layers of LSTM of neighbour and carries out down-sampling.The input of wherein first layer LSTM is by each It opens sample and carries out the sequence composition that four kinds of scanning directions scannings generate, such as Fig. 1.
By in the sample information picture and corresponding label according to 8:1:1 ratio is established training library, test library and is tested Demonstrate,prove library.
The flow of training pattern is to be made iteratively to operate:1. taking a certain amount of training set sample by matching some ginsengs It counts to establish a grader;2. taking model of a certain amount of verification collection sample to study out, the parameter of grader is adjusted;3. taking The recognition capability of a certain amount of trained model of test set test sample is known to decide whether to carry out next iteration to improve Not rate.
Training set, test set and verification collection based on acquisition optimize handwritten Chinese character text identification model using gradient descent method Parameter, and model training is carried out based on GPU accelerator iteration, final obtain establishes hand-written waybill text identification model.
A kind of device for establishing hand-written waybill text identification model, including:
Data generating unit is configured to sample carrying out path scanning, the processing of time recurrent neural network, convolutional Neural Network processes generate full connection layer data;
Optimize unit, is configured to the full connection layer data and obtains the model by CTC graders, optimization.
A kind of equipment, the equipment include:
One or more processors;
Memory, for storing one or more programs,
When one or more of programs are executed by one or more of processors so that one or more of places It manages device and executes the method for establishing hand-written waybill text identification model.
A kind of computer readable storage medium being stored with computer program, when which is executed by processor described in realization The method for establishing hand-written waybill text identification model.
As shown in Figure 3-4, a kind of hand-written waybill text recognition method, includes the following steps:
S1 extracts handwritten Chinese character region from hand-written waybill picture to be identified, then is extracted from the handwritten Chinese character region Go out several information pictures;
The step S1 includes step:
S11 obtains hand-written waybill picture to be identified;
S12 adjusts the hand-written waybill picture to be identified, makes the hand-written waybill picture to be identified and object reference picture Between angle of inclination in preset error range;The object reference picture is by hand-written waybill picture to be identified according to system The picture that predetermined manner is put.The concept at angle of inclination is:Appoint in hand-written waybill picture to be identified and takes (A, B) at 2 points even It is in alignment to be defined as first straight line, since hand-written waybill picture to be identified is that the same picture is with object reference picture Disposing way is different, so can find (A1, B1) with corresponding A in images to be recognized, B, A1 in object reference picture at 2 points It draws a straight line with 2 points of B1 and is defined as target line, which is target line defined in object reference picture, described Angle between first straight line and target line is angle of inclination.
S13 matches adjusted images to be recognized in step S12 with background picture, obtains in images to be recognized Handwritten Chinese character region;The background image is the image behind hand-written waybill picture removal handwriting region to be identified;It will the back of the body In scape image white space (i.e. blank fortune is defined as locking the region in handwritten Chinese character region in hand-written waybill picture to be identified Destination address region in list), the handwritten Chinese character area in target image is gone out using the feature recognition of white space in background image Domain.I.e. according in blank waybill picture the characteristics of destination address region, the characteristics of with destination address region upper and lower, left and right edge It is matched with waybill picture to be identified, to lock the destination address region in waybill.
S14 obtains the picture in effective information region from handwritten Chinese character region;
S15 extracts several information pictures from the picture in the effective information region.
As shown in Fig. 2, adjust the waybill picture, make the bottom edge of the waybill picture with horizontal misalignment angle pre- If error range in;
As shown in fig. 6-7, adjusted waybill picture in step S12 compared with blank waybill picture, offset, obtained Obtain the fill substance in waybill picture.
As shown in figure 5, intercepting the 45%-70% of whole Zhang Yundan pictures using the waybill picture left side edge as initial edge Region obtains the picture in effective information region as effective information region (value 60% when specific implementation).
S15 extracts several information pictures from the picture in the effective information region.
S2, described information picture are identified by hand-written waybill text identification model, and the model is built using described The method for founding hand-written waybill text identification model is established.
The step S2 includes step:
Whole information pictures is sequentially input the hand-written waybill text identification model to be identified;It will be above-mentioned whole Recognition result, which is spliced and rectified a deviation, obtains final recognition result.
A kind of hand-written waybill text identification device, which is characterized in that including:
Extraction unit is configured to extract the handwritten text region in hand-written waybill to be identified, in the handwritten text area Several information pictures are extracted in domain;
Recognition unit is configured to described information picture and is identified by hand-written waybill text identification model, the mould Type is established using the method for establishing hand-written waybill text identification model.
A kind of equipment, the equipment include:
One or more processors;
Memory, for storing one or more programs,
When one or more of programs are executed by one or more of processors so that one or more of places It manages device and executes the hand-written waybill text recognition method.
A kind of computer readable storage medium being stored with computer program, when which is executed by processor described in realization Hand-written waybill text recognition method.
The single process of traditional artificial typewriting record can consume a large amount of human capitals, and in the work of uninteresting repetition, people Work identification address is often malfunctioned, and the process that technical solution of the present invention can partly replace artificial typewriting record single not only reduces a large amount of Human capital, and the single accuracy rate of record is greatly improved.Moreover, hand-written address be it is a string do not have spaced Chinese character, Chinese character is often left and right or up-down structure simultaneously, so it is the mistake got half the result with twice the effort that address, which is cut into Chinese character one by one, Journey, the technical program have evaded the recognition methods of traditional OCR, a kind of text fragment recognition methods for exempting from cutting of use, i.e., not The cutting to Chinese character paragraph progress individual Chinese character is needed, therefore this programme can ensure Chinese Character Recognition accuracy.
By preparing different training samples, the program can also be transplanted to the identification of the handwritten Chinese character under other scenes
Above description is only the preferred embodiment of the application and the explanation to institute's application technology principle.People in the art Member should be appreciated that invention scope involved in the application, however it is not limited to technology made of the specific combination of above-mentioned technical characteristic Scheme, while should also cover in the case where not departing from the inventive concept, it is carried out by above-mentioned technical characteristic or its equivalent feature Other technical solutions of arbitrary combination and formation.Such as features described above has similar work(with (but not limited to) disclosed herein Can technical characteristic replaced mutually and the technical solution that is formed.
In addition to the technical characteristic described in specification, remaining technical characteristic is the known technology of those skilled in the art, is prominent Go out the innovative characteristics of the present invention, details are not described herein for remaining technical characteristic.

Claims (11)

1. a kind of method for establishing hand-written waybill text identification model, which is characterized in that including:
Sample is subjected to path scanning, the processing of time recurrent neural network, convolutional neural networks processing, generates the full connection number of plies According to;
The full connection layer data obtains the model by CTC graders, optimization.
2. according to the method described in claim 1, it is characterized in that, path scanning is the scanning of four direction paths.
3. according to the method described in claim 1, it is characterized in that, time recurrent neural network processing, convolutional Neural net Network is handled in the process of implementation, including:
Sample set is equally divided into several sample sets;
Each sample in the sample set, duplicate paths scanning, LSTM processing and convolution summation process several times, generate Full connection layer data.
4. according to the method described in claim 1, it is characterized in that, it is described optimization in the process of implementation, including:
By the data that the full connection layer data is obtained by CTC graders be compared with the truthful data of sample and iteration into The optimization of row whole network parameter is recycled training set, test set and verification collection, is optimized using gradient descent method, and added based on GPU Fast device iteration carries out model training.
5. a kind of device for establishing hand-written waybill text identification model, which is characterized in that including:
Data generating unit is configured to sample carrying out path scanning, the processing of time recurrent neural network, convolutional neural networks Processing generates full connection layer data;
Optimize unit, is configured to the full connection layer data and obtains the model by CTC graders, optimization.
6. a kind of equipment, which is characterized in that the equipment includes:
One or more processors;
Memory, for storing one or more programs,
When one or more of programs are executed by one or more of processors so that one or more of processors Execute the method as described in any one of claim 1-4.
7. a kind of computer readable storage medium being stored with computer program, which is characterized in that the program is executed by processor Methods of the Shi Shixian as described in any one of claim 1-4.
8. a kind of hand-written waybill text recognition method, which is characterized in that including:
The handwritten text region in hand-written waybill to be identified is extracted, several hum patterns are extracted in the handwritten text region Piece;
Described information picture is identified by hand-written waybill text identification model, and the model is any using claim 1-4 The method is established.
9. a kind of hand-written waybill text identification device, which is characterized in that including:
Extraction unit is configured to extract the handwritten text region in hand-written waybill to be identified, in the handwritten text region Extract several information pictures;
Recognition unit is configured to described information picture and is identified by hand-written waybill text identification model, the model profit It is established with any methods of claim 1-4.
10. a kind of equipment, which is characterized in that the equipment includes:
One or more processors;
Memory, for storing one or more programs,
When one or more of programs are executed by one or more of processors so that one or more of processors Execute method as claimed in claim 8.
11. a kind of computer readable storage medium being stored with computer program, which is characterized in that the program is executed by processor Shi Shixian methods as claimed in claim 8.
CN201711459765.1A 2017-12-28 2017-12-28 A kind of modeling method, recognition methods, device, storage medium and equipment Pending CN108364036A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711459765.1A CN108364036A (en) 2017-12-28 2017-12-28 A kind of modeling method, recognition methods, device, storage medium and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711459765.1A CN108364036A (en) 2017-12-28 2017-12-28 A kind of modeling method, recognition methods, device, storage medium and equipment

Publications (1)

Publication Number Publication Date
CN108364036A true CN108364036A (en) 2018-08-03

Family

ID=63010481

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711459765.1A Pending CN108364036A (en) 2017-12-28 2017-12-28 A kind of modeling method, recognition methods, device, storage medium and equipment

Country Status (1)

Country Link
CN (1) CN108364036A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105512692A (en) * 2015-11-30 2016-04-20 华南理工大学 BLSTM-based online handwritten mathematical expression symbol recognition method
CN106570456A (en) * 2016-10-13 2017-04-19 华南理工大学 Handwritten Chinese character recognition method based on full-convolution recursive network
CN106991374A (en) * 2017-03-07 2017-07-28 中国矿业大学 Handwritten Digit Recognition method based on convolutional neural networks and random forest
CN107220655A (en) * 2016-03-22 2017-09-29 华南理工大学 A kind of hand-written, printed text sorting technique based on deep learning
CN107239733A (en) * 2017-04-19 2017-10-10 上海嵩恒网络科技有限公司 Continuous hand-written character recognizing method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105512692A (en) * 2015-11-30 2016-04-20 华南理工大学 BLSTM-based online handwritten mathematical expression symbol recognition method
CN107220655A (en) * 2016-03-22 2017-09-29 华南理工大学 A kind of hand-written, printed text sorting technique based on deep learning
CN106570456A (en) * 2016-10-13 2017-04-19 华南理工大学 Handwritten Chinese character recognition method based on full-convolution recursive network
CN106991374A (en) * 2017-03-07 2017-07-28 中国矿业大学 Handwritten Digit Recognition method based on convolutional neural networks and random forest
CN107239733A (en) * 2017-04-19 2017-10-10 上海嵩恒网络科技有限公司 Continuous hand-written character recognizing method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
RONALDO MESSINA ET AL.: "Segmentation Handwritten Chinese Text Recognition with LSTM-RNN", 《2015 13TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION(ICDAR)》 *
金连文 等: "深度学习在手写汉字识别中的应用综述", 《自动化学报》 *

Similar Documents

Publication Publication Date Title
RU2737720C1 (en) Retrieving fields using neural networks without using templates
CN109543690B (en) Method and device for extracting information
CN108364037A (en) Method, system and the equipment of Handwritten Chinese Character Recognition
US10963685B2 (en) Generating variations of a known shred
Davis et al. Text and style conditioned GAN for generation of offline handwriting lines
US20090116755A1 (en) Systems and methods for enabling manual classification of unrecognized documents to complete workflow for electronic jobs and to assist machine learning of a recognition system using automatically extracted features of unrecognized documents
CN110619059B (en) Building marking method based on transfer learning
US20170076152A1 (en) Determining a text string based on visual features of a shred
CN108681735A (en) Optical character recognition method based on convolutional neural networks deep learning model
CN112508011A (en) OCR (optical character recognition) method and device based on neural network
CN110114776A (en) Use the system and method for the character recognition of full convolutional neural networks
Thompson et al. finFindR: Automated recognition and identification of marine mammal dorsal fins using residual convolutional neural networks
CN111767390A (en) Skill word evaluation method and device, electronic equipment and computer readable medium
CN110197140A (en) Material checking method and equipment based on Text region
CN104899551B (en) A kind of form image sorting technique
WO2022134580A1 (en) Method and apparatus for acquiring certificate information, and storage medium and computer device
US20230206676A1 (en) Systems and Methods for Generating Document Numerical Representations
CN108364036A (en) A kind of modeling method, recognition methods, device, storage medium and equipment
Brewer et al. Reading PDFs using Adversarially trained Convolutional Neural Network based optical character recognition
Bhagat et al. Complex document classification and integration with indexing
US20230409644A1 (en) Systems and method for generating labelled datasets
CN112966685B (en) Attack network training method and device for scene text recognition and related equipment
Thambi et al. Offline text document authorization on the basis SIFT and SURF
Grieggs et al. The Paleographer's Eye ex machina: Using Computer Vision To Assist Humanists in Scribal Hand Identification
Koushik et al. Equation Detection in the Camera Captured Handwritten Document

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180803

RJ01 Rejection of invention patent application after publication