CN108595544A

CN108595544A - A kind of document picture classification method

Info

Publication number: CN108595544A
Application number: CN201810309072.2A
Authority: CN
Inventors: 赖荣凤; 黄贤俊
Original assignee: Shenzhen Yuan Heng Technology Co Ltd
Current assignee: Shenzhen Yuan Heng Technology Co Ltd
Priority date: 2018-04-09
Filing date: 2018-04-09
Publication date: 2018-09-28

Abstract

The invention discloses a kind of document picture classification methods, use object detection method, visually judge whether identity card occur in picture, bank card, driving license, driver's license, passport, business license etc. has very strong feature, the Doctype having a long way to go between classification, the method of target detection can fast and accurately handle the document picture of these classifications, the document picture of other corresponding classifications, picture is first converted into word with the Text region algorithm based on deep neural network, then the word for sorting out identification is handled using file classification method, the method of text classification can distinguish nuance, accuracy rate is high.

Description

A kind of document picture classification method

Technical field

The present invention relates to a kind of sorting technique, specifically a kind of document picture classification method.

Background technology

Insurance company is when establishing declaration form archives, to need to compile a large amount of document, and the management that classifies stores.With Digitized revolution, current all documents are required for shooting at digital picture.The present invention is exactly for these document pictures Automatic classification.The common document classification of insurance company is more than hundreds of, and the gap between some classifications is also very small, such as：Outpatient service The difference of invoice and in hospital invoice is withdrawn deposit not often between several different words.Document classification is more, and difference is small between classification, leads Cause this task extremely difficult.For this purpose, combination picture classification, target detection, Text region and the text classification etc. of our creativeness Method obtains high classification accuracy.

The defect of the prior art

1. the method for picture classification：It is achieved currently based on the picture classification method of depth convolutional neural networks prodigious prominent It is broken, the level of the mankind has even been surmounted in the task of some picture classifications.But existing picture classification technology is for spy It seeks peace the classification of significant difference, such as：Distinguish cat and dog, the accuracy rate that it can not also be determined on sophisticated category.Thus, Existing picture classification technology can not accurately distinguish the small Doctype of certain difference.

2. the method for target detection：The method of target detection based on deep learning has good standard under general task True rate.Such as：It can accurately judge whether there is the targets such as identity card, bank card from document picture.However, in face of subtle The outpatient service invoice of difference and invoice, object detection method are also helpless in hospital.

3. the method for text classification：The method development with a long history of text classification also will be ripe, can distinguish subtle text Word difference.But it cannot be used directly for the classification of document picture.

Invention content

The purpose of the present invention is to provide a kind of document picture classification methods, to solve mentioned above in the background art ask Topic.

To achieve the above object, the present invention provides the following technical solutions：

A kind of document picture classification method, includes the following steps：(1) it is examined from document picture with algorithm of target detection first Survey identity card, bank card, driver's license, driving license, business license, working qualification card, Road Transportation demonstrate,prove certificate, if detection at Work(then directly differentiates document classification；(2) if detection failure, enters the process flow with text classification：2.1 are examined with word Method of determining and calculating detected the location information of the text strings in picture；2.2 texts that detected using Text region Model Identification Word string, then all text strings are combined into document by there is sequence of positions；2.3 use Algorithm of documents categorization, and identification document is returned Class, the category, that is, document picture generic.

As a further solution of the present invention：The algorithm of target detection includes Faster RCNN, SSD, YOLO.

As a further solution of the present invention：The text detection algorithm can either use general algorithm of target detection, Also the algorithm after optimizing exclusively for text detection can be used.

As a further solution of the present invention：The general algorithm of target detection, including：Faster RCNN、SSD、 YOLO。

As further scheme of the invention：Algorithm after the optimization exclusively for text detection, including：EAST、 RRCNN、TextBoxes、CTPN。

Compared with prior art, the beneficial effects of the invention are as follows：The present invention uses object detection method, visually judges Whether occurring identity card, bank card, driving license, driver's license, passport, business license etc. in picture has very strong feature, between classification The Doctype having a long way to go, the method for target detection can fast and accurately handle the document picture of these classifications, correspond to it Picture is first converted into word with the Text region algorithm based on deep neural network, then used by the document picture of his classification File classification method sorts out the word of identification to handle, and the method for text classification can distinguish nuance, and accuracy rate is high.

Description of the drawings

Fig. 1 is the flow chart of document picture classification method.

Specific implementation mode

Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation describes, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.

Referring to Fig. 1, in the embodiment of the present invention, a kind of document picture classification method includes the following steps：(1) it uses first Algorithm of target detection detected from document picture identity card, bank card, driver's license, driving license, business license, working qualification card, Road Transportation demonstrate,proves certificate, if detecting successfully, directly differentiates document classification；(2) if detection failure, entering has text The process flow of classification：2.1 detected the location information of the text strings in picture with text detection algorithm；2.2 use text The text strings that word identification model recognition detection comes out, then all text strings are combined into document by there is sequence of positions；2.3 using Algorithm of documents categorization will identify document classification, the category, that is, document picture generic.

The algorithm of target detection includes Faster RCNN, SSD, YOLO.

Below its principle is illustrated by example of Faster RCNN：

1) depth convolutional neural networks (conv layers) extraction picture abstract characteristics (feature maps)；

2) using area candidate network recommended candidate certificate region (proposal generator)；

3) the accurate region (Box Classifier) of certificate is returned from candidate region.

The text detection algorithm can either use general algorithm of target detection, can also use exclusively for text detection Algorithm after optimization, the general algorithm of target detection, including：Faster RCNN, SSD, YOLO, it is described exclusively for text Algorithm after word inspection optimization, including：EAST、RRCNN、TextBoxes、CTPN.

It is row that this, which sentences EAST, illustrates how to detect text strings from picture：

1) abstract characteristics for first using convolutional neural networks extraction picture, can use PVANet, MobileNet herein, The arbitrary convolutional neural networks such as VGG, ResNet.Pay attention to preserving each layer of feature：F1, f2, f3, f4；

2) each layer output feature is up-sampled using transposition convolution technique, and splices convolutional layer feature, obtain h1, H2, h3, h4；

3) it after above-mentioned two step, then carries out a convolution and obtains：Score map, text boxes or text Quadrangle coordinates

4) non-maxima suppression algorithm (NMS) screening is used to be most likely to be the region of text strings.

Text region algorithm combines depth convolutional neural networks and Recognition with Recurrent Neural Network, realizes that picture turns to word It changes.Its algorithm principle and steps are as follows：

1) convolutional network is used to extract ear tag picture feature；

2) the bidirectional circulating neural network for constituting features described above input LSTM；

3) CTC algorithms are used to merge reduplicated word and placeholder, the maximum word sequence of output probability；

The method of text classification has very much.In general, it can undergo：Text segments, and term vector indicates, the steps such as document representation Suddenly.Thereafter, text classification can be carried out using arbitrary sorting technique.Such as：Support vector machines (SVM), naive Bayesian Grader, K- neighbours (KNN), decision tree, random forest etc..Or by document representation at term vector matrix after, can use volume Product neural network or Recognition with Recurrent Neural Network are classified.Below text classification is carried out using depth convolutional neural networks with regard to introducing Method：

1) by each word or word (w0, w1, w2, w3 etc.), it is expressed as term vector (embedding).Can be with method The arbitrary term vector algorithm such as one-hot, skip-word, glovec, fastText；

2) all term vectors are spliced into matrix, then convolutional neural networks (CNN) are used to extract feature；

3) further text feature calculate with two layers of full articulamentum again and be abstracted；

4) classified to file characteristics using softmax layers.

It is obvious to a person skilled in the art that invention is not limited to the details of the above exemplary embodiments, Er Qie In the case of without departing substantially from spirit or essential attributes of the invention, the present invention can be realized in other specific forms.Therefore, no matter From the point of view of which point, the present embodiments are to be considered as illustrative and not restrictive, and the scope of the present invention is by appended power Profit requires rather than above description limits, it is intended that all by what is fallen within the meaning and scope of the equivalent requirements of the claims Variation is included within the present invention.Any reference signs in the claims should not be construed as limiting the involved claims.

In addition, it should be understood that although this specification is described in terms of embodiments, but not each embodiment is only wrapped Containing an independent technical solution, this description of the specification is merely for the sake of clarity, and those skilled in the art should It considers the specification as a whole, the technical solutions in the various embodiments may also be suitably combined, forms those skilled in the art The other embodiment being appreciated that.

Claims

1. a kind of document picture classification method, which is characterized in that include the following steps：（1）Use algorithm of target detection from text first Detection identity card, bank card, driver's license, driving license, business license, working qualification's card, Road Transportation card card in shelves picture Part directly differentiates document classification if detecting successfully；（2）If detection failure, enters the process flow with text classification： 2.1 detected the location information of the text strings in picture with text detection algorithm；2.2 are examined using Text region Model Identification The text strings come are measured, then all text strings are combined into document by there is sequence of positions；2.3 use Algorithm of documents categorization, will Identify document classification, the category, that is, document picture generic.

2. document picture classification method according to claim 1, which is characterized in that the algorithm of target detection includes Faster RCNN、SSD、YOLO。

3. document picture classification method according to claim 1, which is characterized in that the text detection algorithm can either make With general algorithm of target detection, the algorithm after optimizing exclusively for text detection can be also used.

4. document picture classification method according to claim 3, which is characterized in that the general algorithm of target detection, Including：Faster RCNN、SSD、YOLO.

5. document picture classification method according to claim 3, which is characterized in that described to optimize exclusively for text detection Algorithm afterwards, including：EAST、RRCNN、TextBoxes、CTPN.