CN109635627A - Pictorial information extracting method, device, computer equipment and storage medium - Google Patents

Pictorial information extracting method, device, computer equipment and storage medium Download PDF

Info

Publication number
CN109635627A
CN109635627A CN201811236301.9A CN201811236301A CN109635627A CN 109635627 A CN109635627 A CN 109635627A CN 201811236301 A CN201811236301 A CN 201811236301A CN 109635627 A CN109635627 A CN 109635627A
Authority
CN
China
Prior art keywords
text
sequence
bill picture
picture
text filed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811236301.9A
Other languages
Chinese (zh)
Inventor
邢欣来
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Property and Casualty Insurance Company of China Ltd
Original Assignee
Ping An Property and Casualty Insurance Company of China Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Property and Casualty Insurance Company of China Ltd filed Critical Ping An Property and Casualty Insurance Company of China Ltd
Priority to CN201811236301.9A priority Critical patent/CN109635627A/en
Publication of CN109635627A publication Critical patent/CN109635627A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/412Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition

Abstract

The invention discloses pictorial information extracting method, device, computer equipment and storage mediums.It goes forward side by side the adjustment of line skew and illumination this method comprises: obtaining bill picture to be identified, bill picture after being pre-processed;Identification obtains after pretreatment included multiple text filed in bill picture;Obtain it is multiple it is text filed in each text filed space coordinate, concatenated its corresponding vector to obtain text box sequence by splicing sequence;The input that text box sequence is made to sequence marking model obtains the corresponding subsequence in region to be extracted;By text filed carry out text identification corresponding to the corresponding subsequence in region to be extracted, the corresponding text information in region to be extracted is obtained.This method is not necessarily to carry out text extraction and identification to all text information frames of complicated bill, without the association successively calculated between text, reduce calculation amount, and the training of the bill picture labeled data of various angles, distortion can be had good robustness using image recognition technology.

Description

Pictorial information extracting method, device, computer equipment and storage medium
Technical field
The present invention relates to image identification technical fields more particularly to a kind of pictorial information extracting method, device, computer to set Standby and storage medium.
Background technique
Currently, because of the complicated multiplicity of its page text of complicated bill arrangement relationship, frequently with following extraction process:
1) text filed mark: all text filed in mark bill picture;
2) content of text is extracted and is identified: to all text filed carry out Text regions, extracting text;
3) content of text is associated with: in conjunction with the position of text information and text in picture, carrying out content pass to text information Connection.
The above process, which has the following deficiencies:, to be needed to extract text information one by one to text informations all in bill region, and It is associated with one by one, leads to that computationally intensive, execution efficiency is low.And in a practical situation, the large amount of text information pair in bill It is unrelated in task.
Summary of the invention
The embodiment of the invention provides a kind of pictorial information extracting method, device, computer equipment and storage mediums, it is intended to Solution extracts text information to text informations all in bill region in the prior art one by one, and is associated with one by one, causes to locate A large amount of unrelated content of text are managed, so that the entire problem that treatment process is computationally intensive, execution efficiency is low.
In a first aspect, the embodiment of the invention provides a kind of pictorial information extracting methods comprising:
Bill picture to be identified is obtained, the adjustment of deflection and illumination is carried out to bill picture to be identified, obtains pre- place Bill picture after reason;
Identification obtains after pretreatment included multiple text filed in bill picture;
Obtain it is multiple it is text filed in each text filed space coordinate, will be by each text filed space coordinate pair The vector answered is concatenated by preset splicing sequence, obtains text box sequence;
Sequence labelling model trained in advance is obtained, text box sequence is made to the input of sequence marking model, is obtained wait mention Take the corresponding subsequence in region;
By text filed carry out text identification corresponding to the corresponding subsequence in region to be extracted, region pair to be extracted is obtained The text information answered.
Second aspect, the embodiment of the invention provides a kind of pictorial information extraction elements comprising:
Picture pretreatment unit, for obtaining bill picture to be identified, to bill picture to be identified carry out deflection and The adjustment of illumination, bill picture after being pre-processed;
Text filed recognition unit obtains after pretreatment included multiple text filed in bill picture for identification;
Text box retrieval unit, for obtain it is multiple it is text filed in each text filed space coordinate, will be by The corresponding vector of each text filed space coordinate is concatenated by preset splicing sequence, obtains text box sequence;
Text box sequence is made sequence for obtaining sequence labelling model trained in advance by target subsequences acquiring unit The input of marking model obtains the corresponding subsequence in region to be extracted;
Text information extraction unit, for by text filed carry out text corresponding to the corresponding subsequence in region to be extracted Identification, obtains the corresponding text information in region to be extracted.
The third aspect, the embodiment of the present invention provide a kind of computer equipment again comprising memory, processor and storage On the memory and the computer program that can run on the processor, the processor execute the computer program Pictorial information extracting method described in the above-mentioned first aspect of Shi Shixian.
Fourth aspect, the embodiment of the invention also provides a kind of storage mediums, wherein the storage medium is stored with calculating Machine program, the computer program make the processor execute the letter of picture described in above-mentioned first aspect when being executed by a processor Cease extracting method.
The embodiment of the invention provides a kind of pictorial information extracting method, device, computer equipment and storage mediums.The party Method by bill picture to be identified carry out deflection and illumination pretreatment, then identify it is therein multiple text filed, will be more It is a it is text filed be converted into text box sequence after, text box sequence is made to the input of sequence marking model, obtains region to be extracted Corresponding subsequence finally only needs the corresponding text filed progress text identification of sub-sequences that can obtain user's text of interest This information.Without carrying out text extraction and identification to all text information frames of complicated bill in this method, without successively counting The association between text is calculated, reduces calculation amount, and can have to the training of the bill picture labeled data of various angles, distortion Good robustness.
Detailed description of the invention
Technical solution in order to illustrate the embodiments of the present invention more clearly, below will be to needed in embodiment description Attached drawing is briefly described, it should be apparent that, drawings in the following description are some embodiments of the invention, general for this field For logical technical staff, without creative efforts, it is also possible to obtain other drawings based on these drawings.
Fig. 1 is the flow diagram of pictorial information extracting method provided in an embodiment of the present invention;
Fig. 2 is the sub-process schematic diagram of pictorial information extracting method provided in an embodiment of the present invention;
Fig. 3 is another sub-process schematic diagram of pictorial information extracting method provided in an embodiment of the present invention;
Fig. 4 is another sub-process schematic diagram of pictorial information extracting method provided in an embodiment of the present invention;
Fig. 5 is another sub-process schematic diagram of pictorial information extracting method provided in an embodiment of the present invention;
Fig. 6 is the schematic block diagram of pictorial information extraction element provided in an embodiment of the present invention;
Fig. 7 is the subelement schematic block diagram of pictorial information extraction element provided in an embodiment of the present invention;
Fig. 8 is another subelement schematic block diagram of pictorial information extraction element provided in an embodiment of the present invention;
Fig. 9 is another subelement schematic block diagram of pictorial information extraction element provided in an embodiment of the present invention;
Figure 10 is another subelement schematic block diagram of pictorial information extraction element provided in an embodiment of the present invention;
Figure 11 is the schematic block diagram of computer equipment provided in an embodiment of the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are some of the embodiments of the present invention, instead of all the embodiments.Based on this hair Embodiment in bright, every other implementation obtained by those of ordinary skill in the art without making creative efforts Example, shall fall within the protection scope of the present invention.
It should be appreciated that ought use in this specification and in the appended claims, term " includes " and "comprising" instruction Described feature, entirety, step, operation, the presence of element and/or component, but one or more of the other feature, whole is not precluded Body, step, operation, the presence or addition of element, component and/or its set.
It is also understood that mesh of the term used in this description of the invention merely for the sake of description specific embodiment And be not intended to limit the present invention.As description of the invention and it is used in the attached claims, unless on Other situations are hereafter clearly indicated, otherwise " one " of singular, "one" and "the" are intended to include plural form.
It will be further appreciated that the term "and/or" used in description of the invention and the appended claims is Refer to any combination and all possible combinations of one or more of associated item listed, and including these combinations.
Referring to Fig. 1, Fig. 1 is the flow diagram of pictorial information extracting method provided in an embodiment of the present invention, the picture Information extracting method is applied in management server, and this method is held by the application software being installed in management server Row, management server is the enterprise terminal for carrying out pictorial information extraction.
As shown in Figure 1, the method comprising the steps of S101~S105.
S101, bill picture to be identified is obtained, the adjustment of deflection and illumination is carried out to bill picture to be identified, is obtained Bill picture after pretreatment.
In the present embodiment, bill picture to be identified is provided by business end, and is uploaded to management server by managing The received carry out identifying processing of server.I.e. then business end obtains bill picture to be identified by the modes such as taking pictures or scanning It is uploaded to management server, such as business personnel and the complicated bill of insurance contract, the vehicle new car quality certification of client's signature of company, These complicated bills and simple bill (such as identity card picture can be considered a kind of simple bill, during identified point Also the limited information such as name, identification card number, identity card validity period need to only be extracted) difference be that text information is more, it is required The information of identification is also more.
Since business end obtains bill picture to be identified by the modes such as taking pictures or scanning, may because of shooting angle or The problems such as light, causes the identification degree of picture not high, needs to pre-process picture to be identified at this time.
In one embodiment, as shown in Fig. 2, step S101 includes:
S1011, deflection adjustment is carried out to bill picture to be identified by Hough straight-line detection, schemed after obtaining deflection adjustment Piece;
S1012, illumination adjustment, bill after being pre-processed are carried out to picture after deflection adjustment by histogram equalization Piece.
In the present embodiment, Hough straight-line detection is realized using Hough transformation.Hough transformation is the warp in image transformation One of allusion quotation means are mainly used to isolate the geometry (e.g., straight line, circle etc.) with certain same characteristic features from image.Suddenly Husband converts the method for finding straight line and circle compared to other methods, can preferably reduce noise jamming.Classical Hough transformation It is commonly used to detection straight line, circle, ellipse etc..
However in field of image processing, the pixel coordinate P (x, y) of image is known, and the corresponding pole of image is sat R in mark, theta are then the variables that need to be found.Each (r, theta) value is if possible drawn, according to pixel coordinate P (x, y) If value, then being just transformed into polar coordinates hough space system, this change from point to curve from image cartesian coordinate system It is referred to as the Hough transformation of straight line.Transformation is that limited value is spaced equal part or cumulative grid by quantization Hough parameter space. When Hough transformation algorithm starts, each pixel coordinate point P (x, y) is switched to above the curve point of (r, theta), is added to pair The grid data point answered illustrates when a wave crest occurs with the presence of straight line.By Hough straight-line detection to bill picture into The adjustment of line skew corrects it in order to subsequent Text region.
" central idea " of histogram equalization processing is the grey level histogram of original image from some for comparing concentration Gray scale interval becomes being uniformly distributed in whole tonal ranges.Histogram equalization is exactly to carry out Nonlinear extension to image, Image pixel value is redistributed, keeps the pixel quantity in certain tonal range roughly the same.Histogram equalization is exactly given The histogram distribution of image is changed to the distribution of " uniform " distribution histogram.
The basic thought of histogram equalization is that the histogram of original graph is transformed to equally distributed form, is thus increased The dynamic range of pixel gray value is added to can reach the effect of enhancing image overall contrast ratio.If original image is at (x, y) The gray scale at place is f, and the image after changing is g, then can be expressed as reflecting the gray scale f at (x, y) to the method for image enhancement It penetrates as g.The mapping function of image may be defined as in gray-level histogram equalizationization processing: g=EQ (f), this mapping function EQ (f) must satisfy two conditions (the wherein number of greyscale levels that L is image):
A) EQ (f) is a monodrome single-increasing function within the scope of 0≤f≤L-1.This is to guarantee enhancing processing without beating The gray scale arrangement order of random original image, each gray level of original image still keep the row from black to white (or from white to black) after the conversion Column.
B) there is 0≤g≤L-1 for 0≤f≤L-1, this condition ensure that the consistent of transformation front and back gray value dynamic range Property.
By carrying out the adjustment of deflection and illumination to bill picture to be identified, the identification success of picture can be effectively improved Rate, and have good robustness.
S102, identification obtain after pretreatment included multiple text filed in bill picture.
In the present embodiment, after completing the pretreatment of bill picture to be identified, to after pretreatment in bill picture It is included it is text filed identified one by one, carry out text filed identification at this time only to judge which region is including text , the region for including text is carried out without specifically including which text identifies in text filed to identification, namely only Positioning, and text filed rectangle frame is drawn for each.Due to only positioning the region of text, and all texts are not carried out Identification, reduces calculation amount, improves data-handling efficiency.
It is in one embodiment, multiple text filed included by after identification acquisition pretreatment in bill picture, comprising:
Propose that network algorithm is examined bill picture after pretreatment by the spatial window of pre-set dimension by connection text Survey, after pre-process in bill picture included by it is multiple text filed.
In the present embodiment, propose that (CTPN algorithm, full name are Connectionist to network algorithm by connection text Text Proposal Network) text filed positioning is carried out to bill picture after pretreatment, the treatment process of CTPN algorithm is such as Under:
A1) firstly, (VGG16 is the VGG convolutional neural networks model that Oxford University put forward in 2014 using VGG16 A mutation, possess 16 layers of model for one) extract feature as base net (i.e. base net), obtain conv5_3 The feature of (conv5_3 refers to the third convolutional layer inside the 5th convolution block) as characteristics of image (i.e. feature map), The size of characteristics of image is W × H × C;
A2 sliding window then) is done on this feature map, window size is 3 × 3.Namely each window can obtain The feature vector for being 3 × 3 × C to length, these feature vectors will be used to predict the anchor of position k (anchor's Define similar with Faster RCNN) corresponding classification information, location information;
A3) by feature obtained in the previous step, i.e. the feature of 3*3*C (W*3*3*C) is input in a two-way LSTM, obtains It is the output of W × 256 to length, then connects one 512 full articulamentum (fc layers), prepares output.
A4) mainly there are three outputs for output layer part.2k vertical coordinate (rectangular coordinate system), because One anchor's is that the height (y-coordinate) of center and two values of height of rectangle frame indicate, so one with 2k Output (notices that is exported here is the offset relative to anchor).
A5) fc layers of feature are input to three classification or return in layer.What second 2k scores was indicated is k The classification information (be character or be not character) of anchor.First 2k vertical coordinate and third k Side-refinement is the location information for returning k anchor.2k vertical coordinate indicate be The height of bounding box and the y-axis coordinate (can determine up-and-down boundary) at center, what k side-refinement was indicated The horizontal translation amount of bounding box.
A6 the textproposal (a part of line of text, it can be understood as an elongated rectangle) that) classification is obtained It is merged into line of text.
S103, obtain it is multiple it is text filed in each text filed space coordinate, will be by each text filed space The corresponding vector of coordinate is concatenated by preset splicing sequence, obtains text box sequence.
In the present embodiment, obtain it is multiple it is text filed in each text filed space coordinate when, due to each text Region is the region of rectangle, therefore need to only obtain the coordinate and concatenation on text filed four vertex of each rectangle, can be obtained To vector corresponding with this article one's respective area.It is identified by the coordinate on text filed four vertex to each rectangle It realizes to all text filed positioning.
In one embodiment, as shown in figure 3, step S103 includes:
S1031, obtain it is multiple it is text filed in each text filed corresponding rectangular area four apex coordinates;
S1032, four apex coordinates of each rectangular area are pressed into preset sequential concatenation, obtained and each rectangular area Corresponding vector;
S1033, the sequencing according to each rectangular area in bill picture to be identified, by each rectangular area pair The vector answered sequentially is concatenated, and text box sequence is obtained.
In the present embodiment, obtain it is multiple it is text filed in each text filed corresponding rectangular area, can be denoted as respectively First text filed-the N is text filed, and the coordinate of the first text filed top left corner apex is (x11, y11), the first text area The coordinate of the upper right angular vertex in domain is (x12, y12), and the coordinate of the first text filed lower-left angular vertex is (x13, y13), the The coordinate of one text filed bottom right angular vertex be (x14, y14), by (x11, y11), (x12, y12), (x13, y13) and (x14, y14) sequentially concatenation obtains primary vector box1=[x11y11x12y12x13y13x14y14].
And so on the text filed corresponding N vector boxN=[xN1yN1xN2yN2xN3yN3xN4yN4] of N.By One vector box1, secondary vector box2 ..., N vector boxN concatenated, obtain text box sequence box1box2box3……boxN.By above-mentioned processing, simple text box sequence has been converted by bill picture to be processed, Indicate after pretreatment included multiple text filed in bill picture by text box sequence.
S104, sequence labelling model trained in advance is obtained, text box sequence is made to the input of sequence marking model, obtained The corresponding subsequence in region to be extracted.
In the present embodiment, sequence labelling model trained in advance is used for the interested region of user in text box sequence It is extracted.Due to each type bill picture to be identified (insurance contract, vehicle of the business personnel of such as company and client's signature The complexity such as new car quality certification bill) its text template for using is all the same, and the region that only some users fill in, which is only, needs weight The region of point concern.
In one embodiment, as shown in figure 4, step S104 includes:
S1041, the history bill picture for obtaining multiple types classify according to the type of history bill picture is corresponding, obtain Bill picture after classification;
S1042, by the bill picture of each classification carries out the adjustment of deflection and illumination in bill picture after classification, obtain Bill picture after training data pretreatment;
It is text filed that S1043, identification obtain multiple training datas included in bill picture after training data pre-processes;
The text filed space coordinate of each training data during S1044, the multiple training datas of acquisition are text filed, will be by every The corresponding vector of the text filed space coordinate of one training data is concatenated by preset splicing sequence, obtains training data text This frame sequence;
S1045, selected vector in training data text box sequence is labeled, is obtained and training data text box The corresponding subsequence to be extracted of sequence;
S1046, will make with the one-to-one training data text box sequence of bill picture after the pretreatment of multiple training datas For the input of initiation sequence marking model, the one-to-one sub- sequence to be extracted of bill picture after being pre-processed with multiple training datas Column are trained initiation sequence marking model, obtain sequence labelling model as output.
Wherein, sequence labelling model can be using any one in RNN, LSTM, bi-LSTM+crf model;Wherein, RNN model is Recognition with Recurrent Neural Network model, and LSTM model is shot and long term memory models, and bi-LSTM+crf model is condition random The composite model of field and bidirectional circulating neural network.
In the training sequence labelling model, a large amount of history bill picture can be first collected, it is advanced according to bill type Row classification, such as it is divided into insurance contract class, vehicle new car quality certification class, later:
B1 it) first carries out such as the step in S101-S103, obtains text box sequence corresponding with history bill picture;
B2) user's vector interested that need to be extracted in each text box sequence is labeled, is obtained and text frame sequence Arrange the corresponding subsequence as composed by the vector that need to be extracted;
For example, text box sequence box1box2box3 ... boxN is indicated, it is corresponding with text frame sequence by that need to mention Subsequence composed by the vector taken can be indicated with box6box7 ... boxN-1 namely subsequence is the son of text box sequence Collection.
It B3, will be with) using text box sequence corresponding with history bill picture as the input of initiation sequence marking model Output of the corresponding subsequence of text box sequence as initiation sequence marking model, is trained initiation sequence marking model, Obtain sequence labelling model.
By the above process, sequence labelling model corresponding with type bill picture to be identified can be obtained.Later The sequence labelling model that training obtains may be used for the extraction that area-of-interest is carried out to bill picture to be identified, without complete Text carries out text identification, reduces calculation amount, improves the efficiency of text identification.
S105, by text filed carry out text identification corresponding to the corresponding subsequence in region to be extracted, obtain to be extracted The corresponding text information in region.
In the present embodiment, when obtaining the corresponding subsequence in region to be extracted, need to correspond to lookup will be in the subsequence Each vector is corresponding text filed, by it is above-mentioned it is text filed using Text region model carry out text extraction, can be obtained to Extract the corresponding text information in region.
For example, can (this article one's respective area can to text filed using CRNN model (i.e. convolution loop neural network model) Picture with the part being considered as in bill picture to be identified) it is identified, it obtains that the corresponding text information in region need to be extracted.
In one embodiment, as shown in figure 5, step S105 includes:
S1051, vector included in the corresponding subsequence in region to be extracted is obtained;
S1052, each vector corresponding rectangular area in bill picture is obtained;
S1053, text in rectangular area is identified by the convolution loop neural network model for text identification, Obtain the corresponding text information in region to be extracted.
In the present embodiment, after the corresponding subsequence in acquisition region to be extracted, you can learn that bill picture to be identified In which region it is corresponding it is text filed in content of text need to be extracted, only need at this time for these it is specified it is text filed into Row text identification is avoided to all text filed carry out Text regions with extracting text information to extract text, drop Low calculation amount, improves treatment effeciency.
Without carrying out text extraction and identification to all text information frames of complicated bill in this method, without successively counting The association between text is calculated, reduces calculation amount, and can have to the training of the bill picture labeled data of various angles, distortion Good robustness.
The embodiment of the present invention also provides a kind of pictorial information extraction element, and the pictorial information extraction element is aforementioned for executing Any embodiment of pictorial information extracting method.Specifically, referring to Fig. 6, Fig. 6 is pictorial information provided in an embodiment of the present invention The schematic block diagram of extraction element.The pictorial information extraction element 100 can be configured in management server or terminal.
As shown in fig. 6, pictorial information extraction element 100 includes picture pretreatment unit 101, text filed recognition unit 102, text box retrieval unit 103, target subsequences acquiring unit 104 and text information extraction unit 105.
Picture pretreatment unit 101 carries out deflection to bill picture to be identified for obtaining bill picture to be identified With the adjustment of illumination, bill picture after being pre-processed.
In the present embodiment, bill picture to be identified is provided by business end, and is uploaded to management server by managing The received carry out identifying processing of server.I.e. then business end obtains bill picture to be identified by the modes such as taking pictures or scanning It is uploaded to management server, such as business personnel and the complicated bill of insurance contract, the vehicle new car quality certification of client's signature of company, These complicated bills and simple bill (such as identity card picture can be considered a kind of simple bill, during identified point Also the limited information such as name, identification card number, identity card validity period need to only be extracted) difference be that text information is more, it is required The information of identification is also more.
Since business end obtains bill picture to be identified by the modes such as taking pictures or scanning, may because of shooting angle or The problems such as light, causes the identification degree of picture not high, needs to pre-process picture to be identified at this time.
In one embodiment, as shown in fig. 7, picture pretreatment unit 101 includes:
Picture deflection adjustment unit 1011, for carrying out deflection tune to bill picture to be identified by Hough straight-line detection It is whole, obtain picture after deflection adjustment;
Picture illumination adjustment 1012 is obtained for carrying out illumination adjustment to picture after deflection adjustment by histogram equalization Bill picture after to pretreatment.
In the present embodiment, Hough straight-line detection is realized using Hough transformation.Hough transformation is the warp in image transformation One of allusion quotation means are mainly used to isolate the geometry (e.g., straight line, circle etc.) with certain same characteristic features from image.Suddenly Husband converts the method for finding straight line and circle compared to other methods, can preferably reduce noise jamming.Classical Hough transformation It is commonly used to detection straight line, circle, ellipse etc..
However in field of image processing, the pixel coordinate P (x, y) of image is known, and the corresponding pole of image is sat R in mark, theta are then the variables that need to be found.Each (r, theta) value is if possible drawn, according to pixel coordinate P (x, y) If value, then being just transformed into polar coordinates hough space system, this change from point to curve from image cartesian coordinate system It is referred to as the Hough transformation of straight line.Transformation is that limited value is spaced equal part or cumulative grid by quantization Hough parameter space. When Hough transformation algorithm starts, each pixel coordinate point P (x, y) is switched to above the curve point of (r, theta), is added to pair The grid data point answered illustrates when a wave crest occurs with the presence of straight line.By Hough straight-line detection to bill picture into The adjustment of line skew corrects it in order to subsequent Text region.
" central idea " of histogram equalization processing is the grey level histogram of original image from some for comparing concentration Gray scale interval becomes being uniformly distributed in whole tonal ranges.Histogram equalization is exactly to carry out Nonlinear extension to image, Image pixel value is redistributed, keeps the pixel quantity in certain tonal range roughly the same.Histogram equalization is exactly given The histogram distribution of image is changed to the distribution of " uniform " distribution histogram.
The basic thought of histogram equalization is that the histogram of original graph is transformed to equally distributed form, is thus increased The dynamic range of pixel gray value is added to can reach the effect of enhancing image overall contrast ratio.If original image is at (x, y) The gray scale at place is f, and the image after changing is g, then can be expressed as reflecting the gray scale f at (x, y) to the method for image enhancement It penetrates as g.The mapping function of image may be defined as in gray-level histogram equalizationization processing: g=EQ (f), this mapping function EQ (f) must satisfy two conditions (the wherein number of greyscale levels that L is image):
A) EQ (f) is a monodrome single-increasing function within the scope of 0≤f≤L-1.This is to guarantee enhancing processing without beating The gray scale arrangement order of random original image, each gray level of original image still keep the row from black to white (or from white to black) after the conversion Column.
B) there is 0≤g≤L-1 for 0≤f≤L-1, this condition ensure that the consistent of transformation front and back gray value dynamic range Property.
By carrying out the adjustment of deflection and illumination to bill picture to be identified, the identification success of picture can be effectively improved Rate, and have good robustness.
Text filed recognition unit 102 obtains multiple text areas included in bill picture after pre-processing for identification Domain.
In the present embodiment, after completing the pretreatment of bill picture to be identified, to after pretreatment in bill picture It is included it is text filed identified one by one, carry out text filed identification at this time only to judge which region is including text , the region for including text is carried out without specifically including which text identifies in text filed to identification, namely only Positioning, and text filed rectangle frame is drawn for each.Due to only positioning the region of text, and all texts are not carried out Identification, reduces calculation amount, improves data-handling efficiency.
It is in one embodiment, multiple text filed included by after identification acquisition pretreatment in bill picture, comprising:
Propose that network algorithm is examined bill picture after pretreatment by the spatial window of pre-set dimension by connection text Survey, after pre-process in bill picture included by it is multiple text filed.
In the present embodiment, propose that (CTPN algorithm, full name are network algorithm by connection text ConnectionistTextProposal Network) text filed positioning is carried out to bill picture after pretreatment.
Text box retrieval unit 103, for obtain it is multiple it is text filed in each text filed space coordinate, will It is concatenated by the corresponding vector of each text filed space coordinate by preset splicing sequence, obtains text box sequence.
In the present embodiment, obtain it is multiple it is text filed in each text filed space coordinate when, due to each text Region is the region of rectangle, therefore need to only obtain the coordinate and concatenation on text filed four vertex of each rectangle, can be obtained To vector corresponding with this article one's respective area.It is identified by the coordinate on text filed four vertex to each rectangle It realizes to all text filed positioning.
In one embodiment, as shown in figure 8, text box retrieval unit 103 includes:
Rectangular area vertex acquiring unit 1031, for obtain it is multiple it is text filed in each text filed corresponding rectangle Four apex coordinates in region;
Region vector acquiring unit 1032, for four apex coordinates of each rectangular area to be pressed preset sequence string It connects, obtains vector corresponding with each rectangular area;
Vector concatenation unit 1033, for the sequencing according to each rectangular area in bill picture to be identified, The corresponding vector in each rectangular area is sequentially concatenated, text box sequence is obtained.
In the present embodiment, obtain it is multiple it is text filed in each text filed corresponding rectangular area, can be denoted as respectively First text filed-the N is text filed, and the coordinate of the first text filed top left corner apex is (x11, y11), the first text area The coordinate of the upper right angular vertex in domain is (x12, y12), and the coordinate of the first text filed lower-left angular vertex is (x13, y13), the The coordinate of one text filed bottom right angular vertex be (x14, y14), by (x11, y11), (x12, y12), (x13, y13) and (x14, y14) sequentially concatenation obtains primary vector box1=[x11y11x12y12x13y13x14y14].
And so on the text filed corresponding N vector boxN=[xN1yN1xN2yN2xN3yN3xN4yN4] of N.By One vector box1, secondary vector box2 ..., N vector boxN concatenated, obtain text box sequence box1box2box3……boxN.By above-mentioned processing, simple text box sequence has been converted by bill picture to be processed, Indicate after pretreatment included multiple text filed in bill picture by text box sequence.
Target subsequences acquiring unit 104 contributes a foreword text box sequence for obtaining sequence labelling model trained in advance The input of column marking model obtains the corresponding subsequence in region to be extracted.
In the present embodiment, sequence labelling model trained in advance is used for the interested region of user in text box sequence It is extracted.Due to each type bill picture to be identified (insurance contract, vehicle of the business personnel of such as company and client's signature The complexity such as new car quality certification bill) its text template for using is all the same, and the region that only some users fill in, which is only, needs weight The region of point concern.
In one embodiment, as shown in figure 9, target subsequences acquiring unit 104 includes:
History picture classification unit 1041, for obtaining the history bill picture of multiple types, according to history bill picture The corresponding classification of type, bill picture after being classified;
History picture pretreatment unit 1042, for carrying out the bill picture of each classification in bill picture after classification The adjustment of deflection and illumination obtains bill picture after training data pretreatment;
History text area acquisition unit 1043 obtains included in bill picture after training data pre-processes for identification Multiple training datas it is text filed;
History text frame retrieval unit 1044, for obtain multiple training datas it is text filed in each training data Text filed space coordinate, will be suitable by preset splicing by the corresponding vector of the text filed space coordinate of each training data Sequence is concatenated, and training data text box sequence is obtained;
History mark unit 1045 is obtained for selected vector in training data text box sequence to be labeled Subsequence to be extracted corresponding with training data text box sequence;
Sequence labelling model training unit 1046 is corresponded for bill picture after pre-processing with multiple training datas Input of the training data text box sequence as initiation sequence marking model, bill after being pre-processed with multiple training datas Subsequence to be extracted is trained initiation sequence marking model, obtains sequence labelling model piece as output correspondingly.
Wherein, sequence labelling model can be using any one in RNN, LSTM, bi-LSTM+crf model;Wherein, RNN model is Recognition with Recurrent Neural Network model, and LSTM model is shot and long term memory models, and bi-LSTM+crf model is condition random The composite model of field and bidirectional circulating neural network.
By the above process, sequence labelling model corresponding with type bill picture to be identified can be obtained.Later The sequence labelling model that training obtains may be used for the extraction that area-of-interest is carried out to bill picture to be identified, without complete Text carries out text identification, reduces calculation amount, improves the efficiency of text identification.
Text information extraction unit 105, for by text filed progress corresponding to the corresponding subsequence in region to be extracted Text identification obtains the corresponding text information in region to be extracted.
In the present embodiment, when obtaining the corresponding subsequence in region to be extracted, need to correspond to lookup will be in the subsequence Each vector is corresponding text filed, by it is above-mentioned it is text filed using Text region model carry out text extraction, can be obtained to Extract the corresponding text information in region.
For example, can (this article one's respective area can to text filed using CRNN model (i.e. convolution loop neural network model) Picture with the part being considered as in bill picture to be identified) it is identified, it obtains that the corresponding text information in region need to be extracted.
In one embodiment, as shown in Figure 10, text information extraction unit 105 includes:
Subsequence vector extraction unit 1051, for obtaining vector included in the corresponding subsequence in region to be extracted;
Rectangular area extraction unit 1052, for obtaining each vector corresponding rectangular area in bill picture;
Rectangular area Text Feature Extraction unit 1053, for passing through the convolution loop neural network model pair for text identification Text is identified in rectangular area, obtains the corresponding text information in region to be extracted.
In the present embodiment, after the corresponding subsequence in acquisition region to be extracted, you can learn that bill picture to be identified In which region it is corresponding it is text filed in content of text need to be extracted, only need at this time for these it is specified it is text filed into Row text identification is avoided to all text filed carry out Text regions with extracting text information to extract text, drop Low calculation amount, improves treatment effeciency.
Without carrying out text extraction and identification to all text information frames of complicated bill in the device, without successively counting The association between text is calculated, reduces calculation amount, and can have to the training of the bill picture labeled data of various angles, distortion Good robustness.
Above-mentioned pictorial information extraction element can be implemented as the form of computer program, which can such as scheme It is run in computer equipment shown in 11.
Figure 11 is please referred to, Figure 11 is the schematic block diagram of computer equipment provided in an embodiment of the present invention.
Refering to fig. 11, which includes processor 502, memory and the net connected by system bus 501 Network interface 505, wherein memory may include non-volatile memory medium 503 and built-in storage 504.
The non-volatile memory medium 503 can storage program area 5031 and computer program 5032.The computer program 5032 are performed, and processor 502 may make to execute pictorial information extracting method.
The processor 502 supports the operation of entire computer equipment 500 for providing calculating and control ability.
The built-in storage 504 provides environment for the operation of the computer program 5032 in non-volatile memory medium 503, should When computer program 5032 is executed by processor 502, processor 502 may make to execute pictorial information extracting method.
The network interface 505 is for carrying out network communication, such as the transmission of offer data information.Those skilled in the art can To understand, structure shown in Figure 11, only the block diagram of part-structure relevant to the present invention program, is not constituted to this hair The restriction for the computer equipment 500 that bright scheme is applied thereon, specific computer equipment 500 may include than as shown in the figure More or fewer components perhaps combine certain components or with different component layouts.
Wherein, the processor 502 is for running computer program 5032 stored in memory, to realize following function Can: bill picture to be identified is obtained, the adjustment of deflection and illumination, ticket after being pre-processed are carried out to bill picture to be identified According to picture;Identification obtains after pretreatment included multiple text filed in bill picture;Obtain it is multiple it is text filed in it is each Text filed space coordinate will be gone here and there by the corresponding vector of each text filed space coordinate by preset splicing sequence It connects, obtains text box sequence;Sequence labelling model trained in advance is obtained, text box sequence is made into the defeated of sequence marking model Enter, obtains the corresponding subsequence in region to be extracted;By the text of text filed progress corresponding to the corresponding subsequence in region to be extracted This identification obtains the corresponding text information in region to be extracted.
In one embodiment, processor 502 is executing the adjustment that deflection and illumination are carried out to bill picture to be identified, obtains After to pretreatment when the step of bill picture, perform the following operations: by Hough straight-line detection to bill picture to be identified into Line skew adjustment obtains picture after deflection adjustment;Illumination adjustment is carried out to picture after deflection adjustment by histogram equalization, is obtained Bill picture after to pretreatment.
In one embodiment, the multiple texts included in bill picture after executing identification acquisition pretreatment of processor 502 It when the step of one's respective area, performs the following operations: proposing network algorithm to bill picture after pretreatment by default by connection text The spatial window of size is detected, included multiple text filed in bill picture after being pre-processed.
In one embodiment, processor 502 execute obtain it is multiple it is text filed in each text filed space coordinate, It will be concatenated by the corresponding vector of each text filed space coordinate by preset splicing sequence, obtain text box sequence When step, perform the following operations: obtain it is multiple it is text filed in four vertex of each text filed corresponding rectangular area sit Mark;Four apex coordinates of each rectangular area are pressed into preset sequential concatenation, obtain vector corresponding with each rectangular area; According to sequencing of each rectangular area in bill picture to be identified, by the corresponding vector in each rectangular area sequentially into Row concatenation, obtains text box sequence.
In one embodiment, processor 502 is also held before executing the step of obtaining sequence labelling model trained in advance The following operation of row: obtaining the history bill picture of multiple types, classifies according to the type of history bill picture is corresponding, is classified Bill picture afterwards;By the bill picture of each classification carries out the adjustment of deflection and illumination in bill picture after classification, instructed Bill picture after white silk data prediction;Identification obtains multiple training datas included in bill picture after training data pre-processes It is text filed;Obtain multiple training datas it is text filed in the text filed space coordinate of each training data, will be by each instruction The corresponding vector of space coordinate for practicing data text region is concatenated by preset splicing sequence, obtains training data text box Sequence;Selected vector in training data text box sequence is labeled, is obtained corresponding with training data text box sequence Subsequence to be extracted;The one-to-one training data text box sequence of bill picture is made after pre-processing with multiple training datas For the input of initiation sequence marking model, the one-to-one sub- sequence to be extracted of bill picture after being pre-processed with multiple training datas Column are trained initiation sequence marking model, obtain sequence labelling model as output;Wherein the sequence labelling model is The composite model of Recognition with Recurrent Neural Network model or shot and long term memory models or condition random field and bidirectional circulating neural network.
In one embodiment, processor 502 execute will be text filed corresponding to the corresponding subsequence in region to be extracted It carries out text identification to perform the following operations when obtaining the step of the corresponding text information in region to be extracted: obtaining region to be extracted Included vector in corresponding subsequence;Obtain each vector corresponding rectangular area in bill picture;By for text The convolution loop neural network model of this identification identifies text in rectangular area, obtains the corresponding text in region to be extracted Information.
It will be understood by those skilled in the art that the embodiment of computer equipment shown in Figure 11 is not constituted to computer The restriction of equipment specific composition, in other embodiments, computer equipment may include components more more or fewer than diagram, or Person combines certain components or different component layouts.For example, in some embodiments, computer equipment can only include depositing Reservoir and processor, in such embodiments, the structure and function of memory and processor are consistent with embodiment illustrated in fig. 11, Details are not described herein.
It should be appreciated that in embodiments of the present invention, processor 502 can be central processing unit (Central Processing Unit, CPU), which can also be other general processors, digital signal processor (Digital Signal Processor, DSP), specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field-Programmable GateArray, FPGA) or other programmable logic devices Part, discrete gate or transistor logic, discrete hardware components etc..Wherein, general processor can be microprocessor or The processor is also possible to any conventional processor etc..
Storage medium is provided in another embodiment of the invention.The storage medium can be that non-volatile computer can Read storage medium.The storage medium is stored with computer program, and following step is realized when wherein computer program is executed by processor It is rapid: to obtain bill picture to be identified, the adjustment of deflection and illumination, ticket after being pre-processed are carried out to bill picture to be identified According to picture;Identification obtains after pretreatment included multiple text filed in bill picture;Obtain it is multiple it is text filed in it is each Text filed space coordinate will be gone here and there by the corresponding vector of each text filed space coordinate by preset splicing sequence It connects, obtains text box sequence;Sequence labelling model trained in advance is obtained, text box sequence is made into the defeated of sequence marking model Enter, obtains the corresponding subsequence in region to be extracted;By the text of text filed progress corresponding to the corresponding subsequence in region to be extracted This identification obtains the corresponding text information in region to be extracted.
In one embodiment, the adjustment that deflection and illumination are carried out to bill picture to be identified, after obtaining pretreatment Bill picture, comprising: deflection adjustment is carried out to bill picture to be identified by Hough straight-line detection, is schemed after obtaining deflection adjustment Piece;Illumination adjustment, bill picture after being pre-processed are carried out to picture after deflection adjustment by histogram equalization.
It is in one embodiment, multiple text filed included by after the identification acquisition pretreatment in bill picture, comprising: Propose that network algorithm is detected bill picture after pretreatment by the spatial window of pre-set dimension by connection text, obtains pre- It is included multiple text filed in bill picture after processing.
In one embodiment, it is described obtain it is multiple it is text filed in each text filed space coordinate, will be by each text The corresponding vector of the space coordinate of one's respective area is concatenated by preset splicing sequence, obtains text box sequence, comprising: obtain more It is a it is text filed in each text filed corresponding rectangular area four apex coordinates;By four vertex of each rectangular area Coordinate presses preset sequential concatenation, obtains vector corresponding with each rectangular area;According to each rectangular area to be identified The corresponding vector in each rectangular area is sequentially concatenated, obtains text box sequence by the sequencing in bill picture.
In one embodiment, before acquisition sequence labelling model trained in advance, further includes: obtain multiple types History bill picture is classified according to the type of history bill picture is corresponding, bill picture after being classified;Bill after classifying The bill picture of each classification carries out the adjustment of deflection and illumination in piece, obtains bill picture after training data pretreatment;Know Multiple training datas that Huo Qu be not included in bill picture after training data pretreatment are text filed;Obtain multiple training datas The text filed space coordinate of each training data in text filed, by the space coordinate pair text filed by each training data The vector answered is concatenated by preset splicing sequence, obtains training data text box sequence;By training data text box sequence In selected vector be labeled, obtain subsequence to be extracted corresponding with training data text box sequence;It will be with multiple instructions Input of the one-to-one training data text box sequence of bill picture as initiation sequence marking model after white silk data prediction, Subsequence to be extracted marks initiation sequence as exporting bill picture correspondingly after pre-processing with multiple training datas Model is trained, and obtains sequence labelling model;Wherein the sequence labelling model is Recognition with Recurrent Neural Network model or shot and long term The composite model of memory models or condition random field and bidirectional circulating neural network.
In one embodiment, described by text filed carry out text knowledge corresponding to the corresponding subsequence in region to be extracted , do not obtain the corresponding text information in region to be extracted, comprising: obtain in the corresponding subsequence in region to be extracted it is included to Amount;Obtain each vector corresponding rectangular area in bill picture;Pass through the convolution loop neural network for text identification Model identifies text in rectangular area, obtains the corresponding text information in region to be extracted.
It is apparent to those skilled in the art that for convenience of description and succinctly, foregoing description is set The specific work process of standby, device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein. Those of ordinary skill in the art may be aware that unit described in conjunction with the examples disclosed in the embodiments of the present disclosure and algorithm Step can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and software Interchangeability generally describes each exemplary composition and step according to function in the above description.These functions are studied carefully Unexpectedly the specific application and design constraint depending on technical solution are implemented in hardware or software.Professional technician Each specific application can be used different methods to achieve the described function, but this realization is it is not considered that exceed The scope of the present invention.
In several embodiments provided by the present invention, it should be understood that disclosed unit and method, it can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit It divides, only logical function partition, there may be another division manner in actual implementation, can also will be with the same function Unit set is at a unit, such as multiple units or components can be combined or can be integrated into another system or some Feature can be ignored, or not execute.In addition, shown or discussed mutual coupling, direct-coupling or communication connection can Be through some interfaces, the indirect coupling or communication connection of device or unit, be also possible to electricity, mechanical or other shapes Formula connection.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.Some or all of unit therein can be selected to realize the embodiment of the present invention according to the actual needs Purpose.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit It is that each unit physically exists alone, is also possible to two or more units and is integrated in one unit.It is above-mentioned integrated Unit both can take the form of hardware realization, can also realize in the form of software functional units.
If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product When, it can store in one storage medium.Based on this understanding, technical solution of the present invention is substantially in other words to existing The all or part of part or the technical solution that technology contributes can be embodied in the form of software products, should Computer software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be Personal computer, server or network equipment etc.) execute all or part of step of each embodiment the method for the present invention Suddenly.And storage medium above-mentioned include: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), magnetic disk or The various media that can store program code such as person's CD.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in various equivalent modifications or replace It changes, these modifications or substitutions should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with right It is required that protection scope subject to.

Claims (10)

1. a kind of pictorial information extracting method characterized by comprising
Bill picture to be identified is obtained, the adjustment of deflection and illumination is carried out to bill picture to be identified, after obtaining pretreatment Bill picture;
Identification obtains after pretreatment included multiple text filed in bill picture;
Obtain it is multiple it is text filed in each text filed space coordinate, will be corresponding by each text filed space coordinate Vector is concatenated by preset splicing sequence, obtains text box sequence;
Sequence labelling model trained in advance is obtained, text box sequence is made to the input of sequence marking model, obtains area to be extracted The corresponding subsequence in domain;
By text filed carry out text identification corresponding to the corresponding subsequence in region to be extracted, it is corresponding to obtain region to be extracted Text information.
2. pictorial information extracting method according to claim 1, which is characterized in that it is described to bill picture to be identified into The adjustment of line skew and illumination, bill picture after being pre-processed, comprising:
Deflection adjustment is carried out to bill picture to be identified by Hough straight-line detection, obtains picture after deflection adjustment;
Illumination adjustment, bill picture after being pre-processed are carried out to picture after deflection adjustment by histogram equalization.
3. pictorial information extracting method according to claim 1, which is characterized in that the identification obtains bill after pretreatment It is multiple text filed included by picture, comprising:
Propose that network algorithm is detected bill picture after pretreatment by the spatial window of pre-set dimension by connection text, obtains It is included multiple text filed in bill picture after to pretreatment.
4. pictorial information extracting method according to claim 1, which is characterized in that it is described obtain it is multiple it is text filed in it is every One text filed space coordinate will be carried out by the corresponding vector of each text filed space coordinate by preset splicing sequence Concatenation, obtains text box sequence, comprising:
Obtain it is multiple it is text filed in each text filed corresponding rectangular area four apex coordinates;
By four apex coordinates of each rectangular area press preset sequential concatenation, obtain it is corresponding with each rectangular area to Amount;
According to sequencing of each rectangular area in bill picture to be identified, by the corresponding vector in each rectangular area according to Sequence is concatenated, and text box sequence is obtained.
5. pictorial information extracting method according to claim 1, which is characterized in that the sequence labelling model is circulation mind Composite model through network model or shot and long term memory models or condition random field and bidirectional circulating neural network;
Before acquisition sequence labelling model trained in advance, further includes:
The history bill picture for obtaining multiple types is classified according to the type of history bill picture is corresponding, bill after being classified Picture;
By the bill picture of each classification carries out the adjustment of deflection and illumination in bill picture after classification, it is pre- to obtain training data Bill picture after processing;
It is text filed that identification obtains multiple training datas included in bill picture after training data pre-processes;
Obtain multiple training datas it is text filed in the text filed space coordinate of each training data, will be by each training data The corresponding vector of text filed space coordinate is concatenated by preset splicing sequence, obtains training data text box sequence;
Selected vector in training data text box sequence is labeled, is obtained corresponding with training data text box sequence Subsequence to be extracted;
The one-to-one training data text box sequence of bill picture is as initiation sequence after pre-processing with multiple training datas The input of marking model, will with multiple training datas pre-process after bill picture correspondingly subsequence to be extracted as defeated Out, initiation sequence marking model is trained, obtains sequence labelling model.
6. pictorial information extracting method according to claim 4, which is characterized in that described by the corresponding son in region to be extracted Text filed carry out text identification, obtains the corresponding text information in region to be extracted corresponding to sequence, comprising:
Obtain vector included in the corresponding subsequence in region to be extracted;
Obtain each vector corresponding rectangular area in bill picture;
Text in rectangular area is identified by the convolution loop neural network model for text identification, is obtained to be extracted The corresponding text information in region.
7. a kind of pictorial information extraction element characterized by comprising
Picture pretreatment unit carries out deflection and illumination to bill picture to be identified for obtaining bill picture to be identified Adjustment, bill picture after being pre-processed;
Text filed recognition unit obtains after pretreatment included multiple text filed in bill picture for identification;
Text box retrieval unit, for obtain it is multiple it is text filed in each text filed space coordinate, will be by each The corresponding vector of text filed space coordinate is concatenated by preset splicing sequence, obtains text box sequence;
Text box sequence is made sequence labelling for obtaining sequence labelling model trained in advance by target subsequences acquiring unit The input of model obtains the corresponding subsequence in region to be extracted;
Text information extraction unit, for by text filed carry out text knowledge corresponding to the corresponding subsequence in region to be extracted Not, the corresponding text information in region to be extracted is obtained.
8. pictorial information extraction element according to claim 7, which is characterized in that the text box retrieval unit, Include:
Rectangular area positioning unit, for obtain it is multiple it is text filed in each text filed corresponding rectangular area four tops Point coordinate;
Region vector acquiring unit, for by four apex coordinates of each rectangular area press preset sequential concatenation, obtain with The corresponding vector in each rectangular area;
Vector concatenation unit, for the sequencing according to each rectangular area in bill picture to be identified, by each square The corresponding vector in shape region is sequentially concatenated, and text box sequence is obtained.
9. a kind of computer equipment, including memory, processor and it is stored on the memory and can be on the processor The computer program of operation, which is characterized in that the processor realizes such as claim 1 to 6 when executing the computer program Any one of described in pictorial information extracting method.
10. a kind of storage medium, which is characterized in that the storage medium is stored with computer program, and the computer program is worked as The processor is set to execute such as pictorial information extracting method as claimed in any one of claims 1 to 6 when being executed by processor.
CN201811236301.9A 2018-10-23 2018-10-23 Pictorial information extracting method, device, computer equipment and storage medium Pending CN109635627A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811236301.9A CN109635627A (en) 2018-10-23 2018-10-23 Pictorial information extracting method, device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811236301.9A CN109635627A (en) 2018-10-23 2018-10-23 Pictorial information extracting method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN109635627A true CN109635627A (en) 2019-04-16

Family

ID=66066531

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811236301.9A Pending CN109635627A (en) 2018-10-23 2018-10-23 Pictorial information extracting method, device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN109635627A (en)

Cited By (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110162757A (en) * 2019-04-29 2019-08-23 北京百度网讯科技有限公司 A kind of tableau format extracting method and system
CN110210400A (en) * 2019-06-03 2019-09-06 上海眼控科技股份有限公司 A kind of form document detection method and equipment
CN110263790A (en) * 2019-04-18 2019-09-20 汕头大学 A kind of power plant's ammeter character locating and recognition methods based on convolutional neural networks
CN110751143A (en) * 2019-09-26 2020-02-04 中电万维信息技术有限责任公司 Electronic invoice information extraction method and electronic equipment
CN110866495A (en) * 2019-11-14 2020-03-06 杭州睿琪软件有限公司 Bill image recognition method, bill image recognition device, bill image recognition equipment, training method and storage medium
CN110866457A (en) * 2019-10-28 2020-03-06 世纪保众(北京)网络科技有限公司 Electronic insurance policy obtaining method and device, computer equipment and storage medium
CN110909733A (en) * 2019-10-28 2020-03-24 世纪保众(北京)网络科技有限公司 Template positioning method and device based on OCR picture recognition and computer equipment
CN111209865A (en) * 2020-01-06 2020-05-29 中科鼎富(北京)科技发展有限公司 File content extraction method and device, electronic equipment and storage medium
CN111241974A (en) * 2020-01-07 2020-06-05 深圳追一科技有限公司 Bill information acquisition method and device, computer equipment and storage medium
CN111291742A (en) * 2020-02-10 2020-06-16 北京百度网讯科技有限公司 Object recognition method and device, electronic equipment and storage medium
CN111353458A (en) * 2020-03-10 2020-06-30 腾讯科技(深圳)有限公司 Text box marking method and device and storage medium
CN111582085A (en) * 2020-04-26 2020-08-25 中国工商银行股份有限公司 Document shooting image identification method and device
CN111652232A (en) * 2020-05-29 2020-09-11 泰康保险集团股份有限公司 Bill identification method and device, electronic equipment and computer readable storage medium
CN111666937A (en) * 2020-04-17 2020-09-15 广州多益网络股份有限公司 Method and system for recognizing text in image
CN111709339A (en) * 2020-06-09 2020-09-25 北京百度网讯科技有限公司 Bill image recognition method, device, equipment and storage medium
CN111832423A (en) * 2020-06-19 2020-10-27 北京邮电大学 Bill information identification method, device and system
CN111858994A (en) * 2019-04-26 2020-10-30 深圳市蓝灯鱼智能科技有限公司 Character retrieval method and device
CN111950353A (en) * 2020-06-30 2020-11-17 深圳市雄帝科技股份有限公司 Seal text recognition method and device and electronic equipment
CN112036406A (en) * 2020-11-05 2020-12-04 北京智源人工智能研究院 Text extraction method and device for image document and electronic equipment
CN112101356A (en) * 2020-09-27 2020-12-18 中国建设银行股份有限公司 Method and device for positioning specific text in picture and storage medium
CN112257712A (en) * 2020-10-29 2021-01-22 湖南星汉数智科技有限公司 Train ticket image rectification method and device, computer device and computer readable storage medium
CN112541443A (en) * 2020-12-16 2021-03-23 平安科技(深圳)有限公司 Invoice information extraction method and device, computer equipment and storage medium
CN112818823A (en) * 2021-01-28 2021-05-18 建信览智科技(北京)有限公司 Text extraction method based on bill content and position information
CN113033534A (en) * 2021-03-10 2021-06-25 北京百度网讯科技有限公司 Method and device for establishing bill type identification model and identifying bill type
CN113112567A (en) * 2021-04-16 2021-07-13 中国工商银行股份有限公司 Method and device for generating editable flow chart, electronic equipment and storage medium
CN113657279A (en) * 2021-08-18 2021-11-16 北京玖安天下科技有限公司 Bill image layout analysis method and device
CN113688268A (en) * 2021-08-31 2021-11-23 中国平安人寿保险股份有限公司 Picture information extraction method and device, computer equipment and storage medium
WO2022142627A1 (en) * 2020-12-28 2022-07-07 深圳壹账通智能科技有限公司 Address information extraction method and apparatus, device and medium
CN114821568A (en) * 2022-06-27 2022-07-29 深圳前海环融联易信息科技服务有限公司 Menu element extraction method and device, computer equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108446621A (en) * 2018-03-14 2018-08-24 平安科技(深圳)有限公司 Bank slip recognition method, server and computer readable storage medium
WO2018157862A1 (en) * 2017-03-02 2018-09-07 腾讯科技(深圳)有限公司 Vehicle type recognition method and device, storage medium and electronic device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018157862A1 (en) * 2017-03-02 2018-09-07 腾讯科技(深圳)有限公司 Vehicle type recognition method and device, storage medium and electronic device
CN108446621A (en) * 2018-03-14 2018-08-24 平安科技(深圳)有限公司 Bank slip recognition method, server and computer readable storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZHI TIAN ET.AL: "Detecting Text in Natural Image with Connectionist Text Proposal Network", ARXIV, pages 1 - 16 *

Cited By (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110263790A (en) * 2019-04-18 2019-09-20 汕头大学 A kind of power plant's ammeter character locating and recognition methods based on convolutional neural networks
CN111858994A (en) * 2019-04-26 2020-10-30 深圳市蓝灯鱼智能科技有限公司 Character retrieval method and device
CN110162757A (en) * 2019-04-29 2019-08-23 北京百度网讯科技有限公司 A kind of tableau format extracting method and system
CN110162757B (en) * 2019-04-29 2023-08-18 北京百度网讯科技有限公司 Table structure extraction method and system
CN110210400A (en) * 2019-06-03 2019-09-06 上海眼控科技股份有限公司 A kind of form document detection method and equipment
CN110751143A (en) * 2019-09-26 2020-02-04 中电万维信息技术有限责任公司 Electronic invoice information extraction method and electronic equipment
CN110909733A (en) * 2019-10-28 2020-03-24 世纪保众(北京)网络科技有限公司 Template positioning method and device based on OCR picture recognition and computer equipment
CN110866457A (en) * 2019-10-28 2020-03-06 世纪保众(北京)网络科技有限公司 Electronic insurance policy obtaining method and device, computer equipment and storage medium
CN110866495B (en) * 2019-11-14 2022-06-28 杭州睿琪软件有限公司 Bill image recognition method, bill image recognition device, bill image recognition equipment, training method and storage medium
CN110866495A (en) * 2019-11-14 2020-03-06 杭州睿琪软件有限公司 Bill image recognition method, bill image recognition device, bill image recognition equipment, training method and storage medium
CN111209865A (en) * 2020-01-06 2020-05-29 中科鼎富(北京)科技发展有限公司 File content extraction method and device, electronic equipment and storage medium
CN111241974A (en) * 2020-01-07 2020-06-05 深圳追一科技有限公司 Bill information acquisition method and device, computer equipment and storage medium
CN111241974B (en) * 2020-01-07 2023-10-27 深圳追一科技有限公司 Bill information acquisition method, device, computer equipment and storage medium
KR102610456B1 (en) * 2020-02-10 2023-12-06 베이징 바이두 넷컴 사이언스 테크놀로지 컴퍼니 리미티드 Object recognition methods and devices, electronic devices, and storage media
JP7389824B2 (en) 2020-02-10 2023-11-30 ベイジン バイドゥ ネットコム サイエンス テクノロジー カンパニー リミテッド Object identification method and device, electronic equipment and storage medium
EP3979129A4 (en) * 2020-02-10 2023-07-12 Beijing Baidu Netcom Science Technology Co., Ltd. Object recognition method and apparatus, and electronic device and storage medium
WO2021159843A1 (en) * 2020-02-10 2021-08-19 北京百度网讯科技有限公司 Object recognition method and apparatus, and electronic device and storage medium
JP2022536320A (en) * 2020-02-10 2022-08-15 ベイジン バイドゥ ネットコム サイエンス テクノロジー カンパニー リミテッド Object identification method and device, electronic device and storage medium
CN111291742A (en) * 2020-02-10 2020-06-16 北京百度网讯科技有限公司 Object recognition method and device, electronic equipment and storage medium
KR20210113620A (en) * 2020-02-10 2021-09-16 베이징 바이두 넷컴 사이언스 테크놀로지 컴퍼니 리미티드 Object recognition method and device, electronic device, storage medium
CN111353458B (en) * 2020-03-10 2023-08-18 腾讯科技(深圳)有限公司 Text box labeling method, device and storage medium
CN111353458A (en) * 2020-03-10 2020-06-30 腾讯科技(深圳)有限公司 Text box marking method and device and storage medium
CN111666937A (en) * 2020-04-17 2020-09-15 广州多益网络股份有限公司 Method and system for recognizing text in image
CN111582085A (en) * 2020-04-26 2020-08-25 中国工商银行股份有限公司 Document shooting image identification method and device
CN111582085B (en) * 2020-04-26 2023-10-10 中国工商银行股份有限公司 Document shooting image recognition method and device
CN111652232A (en) * 2020-05-29 2020-09-11 泰康保险集团股份有限公司 Bill identification method and device, electronic equipment and computer readable storage medium
CN111652232B (en) * 2020-05-29 2023-08-22 泰康保险集团股份有限公司 Bill identification method and device, electronic equipment and computer readable storage medium
CN111709339A (en) * 2020-06-09 2020-09-25 北京百度网讯科技有限公司 Bill image recognition method, device, equipment and storage medium
US11854246B2 (en) 2020-06-09 2023-12-26 Beijing Baidu Netcom Science And Technology Co., Ltd. Method, apparatus, device and storage medium for recognizing bill image
EP3836016A1 (en) * 2020-06-09 2021-06-16 Beijing Baidu Netcom Science And Technology Co., Ltd. Recognizing invoice images
KR102612295B1 (en) * 2020-06-09 2023-12-12 베이징 바이두 넷컴 사이언스 앤 테크놀로지 코., 엘티디. Bill image recognition method, device, apparatus, and storage medium
CN111709339B (en) * 2020-06-09 2023-09-19 北京百度网讯科技有限公司 Bill image recognition method, device, equipment and storage medium
KR20210152931A (en) * 2020-06-09 2021-12-16 베이징 바이두 넷컴 사이언스 앤 테크놀로지 코., 엘티디. Bill image recognition method, device, apparatus, and storage medium
CN111832423A (en) * 2020-06-19 2020-10-27 北京邮电大学 Bill information identification method, device and system
CN111950353A (en) * 2020-06-30 2020-11-17 深圳市雄帝科技股份有限公司 Seal text recognition method and device and electronic equipment
CN111950353B (en) * 2020-06-30 2024-04-19 深圳市雄帝科技股份有限公司 Seal text recognition method and device and electronic equipment
CN112101356A (en) * 2020-09-27 2020-12-18 中国建设银行股份有限公司 Method and device for positioning specific text in picture and storage medium
CN112257712B (en) * 2020-10-29 2024-02-27 湖南星汉数智科技有限公司 Train ticket image alignment method and device, computer device and computer readable storage medium
CN112257712A (en) * 2020-10-29 2021-01-22 湖南星汉数智科技有限公司 Train ticket image rectification method and device, computer device and computer readable storage medium
CN112036406A (en) * 2020-11-05 2020-12-04 北京智源人工智能研究院 Text extraction method and device for image document and electronic equipment
CN112541443A (en) * 2020-12-16 2021-03-23 平安科技(深圳)有限公司 Invoice information extraction method and device, computer equipment and storage medium
WO2022142627A1 (en) * 2020-12-28 2022-07-07 深圳壹账通智能科技有限公司 Address information extraction method and apparatus, device and medium
CN112818823B (en) * 2021-01-28 2024-04-12 金科览智科技(北京)有限公司 Text extraction method based on bill content and position information
CN112818823A (en) * 2021-01-28 2021-05-18 建信览智科技(北京)有限公司 Text extraction method based on bill content and position information
CN113033534B (en) * 2021-03-10 2023-07-25 北京百度网讯科技有限公司 Method and device for establishing bill type recognition model and recognizing bill type
CN113033534A (en) * 2021-03-10 2021-06-25 北京百度网讯科技有限公司 Method and device for establishing bill type identification model and identifying bill type
CN113112567A (en) * 2021-04-16 2021-07-13 中国工商银行股份有限公司 Method and device for generating editable flow chart, electronic equipment and storage medium
CN113657279A (en) * 2021-08-18 2021-11-16 北京玖安天下科技有限公司 Bill image layout analysis method and device
CN113657279B (en) * 2021-08-18 2023-05-05 北京玖安天下科技有限公司 Bill image layout analysis method and device
CN113688268A (en) * 2021-08-31 2021-11-23 中国平安人寿保险股份有限公司 Picture information extraction method and device, computer equipment and storage medium
CN113688268B (en) * 2021-08-31 2024-04-02 中国平安人寿保险股份有限公司 Picture information extraction method, device, computer equipment and storage medium
CN114821568A (en) * 2022-06-27 2022-07-29 深圳前海环融联易信息科技服务有限公司 Menu element extraction method and device, computer equipment and storage medium
CN114821568B (en) * 2022-06-27 2022-09-16 深圳前海环融联易信息科技服务有限公司 Menu element extraction method and device, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109635627A (en) Pictorial information extracting method, device, computer equipment and storage medium
CN109508681B (en) Method and device for generating human body key point detection model
KR102629380B1 (en) Method for Distinguishing a Real Three-Dimensional Object from a Two-Dimensional Spoof of the Real Object
Fang et al. Bottom-up saliency detection model based on human visual sensitivity and amplitude spectrum
US8792722B2 (en) Hand gesture detection
CN111860398B (en) Remote sensing image target detection method and system and terminal equipment
CN111291629A (en) Method and device for recognizing text in image, computer equipment and computer storage medium
CN111524106A (en) Skull fracture detection and model training method, device, equipment and storage medium
CN110363084A (en) A kind of class state detection method, device, storage medium and electronics
CN109472193A (en) Method for detecting human face and device
CN115457531A (en) Method and device for recognizing text
CN108846385B (en) Image identification and correction method and device based on convolution-deconvolution neural network
CN109242796A (en) Character image processing method, device, electronic equipment and computer storage medium
CN108288064B (en) Method and device for generating pictures
CN110599455A (en) Display screen defect detection network model, method and device, electronic equipment and storage medium
CN109816694A (en) Method for tracking target, device and electronic equipment
CN117597703A (en) Multi-scale converter for image analysis
CN111931783A (en) Training sample generation method, machine-readable code identification method and device
CN113160231A (en) Sample generation method, sample generation device and electronic equipment
CN109492601A (en) Face comparison method and device, computer-readable medium and electronic equipment
CN112085094A (en) Document image reproduction detection method and device, computer equipment and storage medium
CN109285181A (en) The method and apparatus of image for identification
CN116311214A (en) License plate recognition method and device
CN112651333B (en) Silence living body detection method, silence living body detection device, terminal equipment and storage medium
CN113516697A (en) Image registration method and device, electronic equipment and computer-readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination