CN109447015A - A kind of method and device handling form Image center selection word - Google Patents

A kind of method and device handling form Image center selection word Download PDF

Info

Publication number
CN109447015A
CN109447015A CN201811317237.7A CN201811317237A CN109447015A CN 109447015 A CN109447015 A CN 109447015A CN 201811317237 A CN201811317237 A CN 201811317237A CN 109447015 A CN109447015 A CN 109447015A
Authority
CN
China
Prior art keywords
brief note
form image
position coordinates
lamella
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811317237.7A
Other languages
Chinese (zh)
Inventor
李鹏辉
竺晨曦
邱锡鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Rhinoceros Technology Co Ltd
Original Assignee
Shanghai Rhinoceros Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Rhinoceros Technology Co Ltd filed Critical Shanghai Rhinoceros Technology Co Ltd
Priority to CN201811317237.7A priority Critical patent/CN109447015A/en
Publication of CN109447015A publication Critical patent/CN109447015A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of method for handling form Image center selection word, comprising: the disturbance ecology item in removal form Image;Using joint training model, brief note is selected in form Image centre circle, obtain brief note position coordinates of the brief note in form Image and identifies word content corresponding with brief note;Table reduction is carried out to brief note, brief note position coordinates and word content using table characteristic.Implement the device of the above method, comprising: for removing the preprocessing module of disturbance ecology item in form Image;Using joint training model, brief note is selected in form Image centre circle, obtain brief note position coordinates of the brief note in form Image and identifies the identification module of word content corresponding with brief note;The table recovery module of table reduction is carried out to brief note, brief note position coordinates and word content using table characteristic.The present invention can promote Text region and reduction accuracy rate in form Image.

Description

A kind of method and device handling form Image center selection word
Technical field
The present invention relates to a kind of form processing method, especially a kind of method and dress for handling form Image center selection word It sets.
Background technique
It is higher for the accuracy rate of the big section Text region of similar A4 paper in OCR identification field.But it is directed to table Identification, industry accuracy rate is not very high at present.It cuts word knowledge because original and will cause be difficult to carry out layout reversion otherwise, And the information in table can not be utilized.
Summary of the invention
Aiming at the shortcomings existing in the above problems, the present invention, which provides one kind, can promote Text region in form Image With a kind of method and device of processing form Image center selection word of reduction accuracy rate.
To achieve the above object, the present invention provides a kind of method for handling form Image center selection word, including following step It is rapid:
Step 1, to remove the disturbance ecology item in form Image;
Step 2, using joint training model, select brief note in form Image centre circle, obtain brief note in form Image Brief note position coordinates simultaneously identify word content corresponding with brief note;
Step 3 carries out table reduction to brief note, brief note position coordinates and word content using table characteristic.
The method of above-mentioned a kind of processing form Image center selection word, wherein in step 1, form Image is carried out Pretreatment includes picture angle correction behaviour to the pretreatment that form Image carries out to remove the disturbance ecology item in form Image Make or the removal of watermark seal operates.
A kind of method of above-mentioned processing form Image center selection word, wherein in step 2, including following sub-step:
Step 21 carries out RGB three-channel processing to the form Image of removal disturbance ecology item, to form at least two tables Picture layer;
Step 22 carries out feature extraction to each tabular drawing lamella by convolution transform;
Step 23, in the first tabular drawing lamella, predict brief note position coordinates of the brief note in the first tabular drawing lamella;
Step 24, in the second tabular drawing lamella, obtained by image information and applicational language model corresponding with brief note Word content.
A kind of method of above-mentioned processing form Image center selection word, wherein in step 23, brief note position coordinates packet Include top-left coordinates (x0, y0), upper right coordinate (x1, y1), lower right coordinate (x2, y2), lower-left coordinate (x3, y3).
The method of above-mentioned a kind of processing form Image center selection word, wherein in step 3, sat according to brief note position Mark carries out the cutting of table row and grid column, and word content is imported in brief note position, carries out cell according to Semantic judgement In conjunction with to complete the reduction of whole table.
The method of above-mentioned a kind of processing form Image center selection word, wherein after further including step 4, reduction being presented Table.
The present invention also provides a kind of devices for handling form Image center selection word, comprising: preprocessing module, identification module With table recovery module;
Preprocessing module, for removing the disturbance ecology item in form Image;
Identification module selects brief note in form Image centre circle, obtains brief note in form Image using joint training model Brief note position coordinates and identify word content corresponding with brief note;
Table recovery module carries out table reduction to brief note, brief note position coordinates and word content using table characteristic.
Above-mentioned device, wherein the pretreatment that the preprocessing module carries out form Image includes picture angle correction Operation or the removal operation of watermark seal.
Above-mentioned device, wherein the implementation steps of the identification module are as follows:
RGB three-channel processing is carried out to form Image, to form at least two tabular drawing lamellas;
Feature extraction is carried out to each tabular drawing lamella by convolution transform;
In the first tabular drawing lamella, brief note position coordinates of the brief note in the first tabular drawing lamella are predicted;
In the second tabular drawing lamella, obtained in text corresponding with brief note by image information and applicational language model Hold.
Above-mentioned device, wherein the table recovery module carries out table row and grid column according to brief note position coordinates Cutting imports word content in brief note position, and the combination of cell is carried out according to Semantic judgement, to complete going back for whole table It is former.
Compared with prior art, the invention has the following advantages that
It by the textbox choosing based on table and identifies progress joint training deep learning model, frame is made to select and identify two Task can keep final table Text region more accurate, and do not lose the space of a whole page of table itself with the image information of public table Information promotes the accuracy rate of table layout reversion.
Detailed description of the invention
Fig. 1 is the flow chart of method part in the present invention;
Fig. 2 is the structural block diagram of device part in the present invention.
Main appended drawing reference is described as follows:
1- preprocessing module;2- identification module;3- table recovery module;Module is presented in 4-
Specific embodiment
As shown in Figure 1, the present invention provides a kind of method for handling form Image center selection word, comprising the following steps:
Disturbance ecology item in step 1, removal form Image.
In step 1, form Image is pre-processed, to remove the disturbance ecology item in form Image, to tabular drawing The pretreatment that piece carries out includes picture angle correction operation or the removal operation of watermark seal.
Step 2, using joint training model, select brief note in form Image centre circle, obtain brief note in form Image Brief note position coordinates simultaneously identify word content corresponding with brief note.
In step 2, including following sub-step:
Step 21 carries out RGB three-channel processing to the form Image of removal disturbance ecology item, to form at least two tables Picture layer;
Step 22 carries out feature extraction to each tabular drawing lamella by convolution transform;
Step 23, in the first tabular drawing lamella, predict brief note position coordinates of the brief note in the first tabular drawing lamella;
Wherein, brief note position coordinates include top-left coordinates (x0, y0), upper right coordinate (x1, y1), lower right coordinate (x2, y2), Lower-left coordinate (x3, y3).
Step 24, in the second tabular drawing lamella, obtained by image information and applicational language model corresponding with brief note Word content.
Step 3 carries out table reduction to brief note, brief note position coordinates and word content using table characteristic.
In step 3, the cutting that table row and grid column are carried out according to brief note position coordinates, imports brief note for word content In position, the combination of cell is carried out according to Semantic judgement, to complete the reduction of whole table.
The table after reduction is presented in step 4.
The training process of joint training model is as follows:
1. generating table, and enclose the corresponding informance of brief note, text for different fonts, different form types;
2. a pair generated table adds noise, guarantee the robustness of model;
3. sample is sent to training in joint training model;
4. the model after being trained is identified for OCR.
As shown in Fig. 2, the present invention provides a kind of device for handling form Image center selection word, comprising: preprocessing module 1, identification module 2 and table recovery module 3.
Preprocessing module 1, for removing the disturbance ecology item in form Image.
Preprocessing module pre-processes form Image, to remove the disturbance ecology item in form Image, to tabular drawing The pretreatment that piece carries out includes picture angle correction operation or the removal operation of watermark seal.
Identification module 2 selects brief note in form Image centre circle, obtains brief note in form Image using joint training model In brief note position coordinates and identify word content corresponding with brief note.
The implementation steps of identification module are as follows:
RGB three-channel processing is carried out to the form Image of removal disturbance ecology item, to form at least two tabular drawing lamellas;
Feature extraction is carried out to each tabular drawing lamella by convolution transform;
In the first tabular drawing lamella, brief note position coordinates of the brief note in the first tabular drawing lamella are predicted;
In the second tabular drawing lamella, obtained in text corresponding with brief note by image information and applicational language model Hold.
Wherein, brief note position coordinates include top-left coordinates (x0, y0), upper right coordinate (x1, y1), lower right coordinate (x2, y2), Lower-left coordinate (x3, y3).
Table recovery module 3 carries out table reduction to brief note, brief note position coordinates and word content using table characteristic.
Wherein, table recovery module carries out the cutting of table row and grid column according to brief note position coordinates, by word content It imports in brief note position, the combination of cell is carried out according to Semantic judgement, to complete the reduction of whole table.
It further include that module 4 is presented, the table after restoring for rendering.
Whole CTPN model of the joint training model based on deep learning, while creative on CTPN model connecing Enter CTC and identify the feature around brief note, so that identification process is can use table characteristic, accuracy is substantially improved.
The foregoing is merely presently preferred embodiments of the present invention, is merely illustrative and not restrictive for the invention. Those skilled in the art understand that many changes can be carried out in the spirit and scope defined by invention claim to it, modify, It is even equivalent, but fall in protection scope of the present invention.

Claims (10)

1. a kind of method for handling form Image center selection word, comprising the following steps:
Step 1, to remove the disturbance ecology item in form Image;
Step 2, using joint training model, select brief note in form Image centre circle, obtain brief note of the brief note in form Image Position coordinates simultaneously identify word content corresponding with brief note;
Step 3 carries out table reduction to brief note, brief note position coordinates and word content using table characteristic.
2. a kind of method for handling form Image center selection word according to claim 1, which is characterized in that in step 1 In, form Image is pre-processed, to remove the disturbance ecology item in form Image, to the pretreatment packet of form Image progress Include picture angle correction operation or the removal operation of watermark seal.
3. a kind of method for handling form Image center selection word according to claim 1, which is characterized in that in step 2 In, including following sub-step:
Step 21 carries out RGB three-channel processing to the form Image of removal disturbance ecology item, to form at least two form Images Layer;
Step 22 carries out feature extraction to each tabular drawing lamella by convolution transform;
Step 23, in the first tabular drawing lamella, predict brief note position coordinates of the brief note in the first tabular drawing lamella;
Step 24, in the second tabular drawing lamella, text corresponding with brief note is obtained by image information and applicational language model Word content.
4. a kind of method for handling form Image center selection word according to claim 3, which is characterized in that in step 23 In, brief note position coordinates include top-left coordinates (x0, y0), upper right coordinate (x1, y1), lower right coordinate (x2, y2), lower-left coordinate (x3, y3).
5. a kind of method for handling form Image center selection word according to claim 1, which is characterized in that in step 3 In, the cutting of table row and grid column is carried out according to brief note position coordinates, word content is imported in brief note position, according to semanteme Judgement carries out the combination of cell, to complete the reduction of whole table.
6. a kind of method for handling form Image center selection word according to claim 1, which is characterized in that further include step Rapid 4, the table after reduction is presented.
7. a kind of device of the method for processing form Image center selection word described in a kind of implementation claim 1, feature It is, comprising: preprocessing module, identification module and table recovery module;
Preprocessing module, for removing the disturbance ecology item in form Image;
Identification module selects brief note in form Image centre circle, obtains word of the brief note in form Image using joint training model Position coordinates simultaneously identify word content corresponding with brief note;
Table recovery module carries out table reduction to brief note, brief note position coordinates and word content using table characteristic.
8. device according to claim 7, which is characterized in that the pretreatment that the preprocessing module carries out form Image Including picture angle correction operation or the removal operation of watermark seal.
9. device according to claim 7, which is characterized in that the implementation steps of the identification module are as follows:
RGB three-channel processing is carried out to form Image, to form at least two tabular drawing lamellas;
Feature extraction is carried out to each tabular drawing lamella by convolution transform;
In the first tabular drawing lamella, brief note position coordinates of the brief note in the first tabular drawing lamella are predicted;
In the second tabular drawing lamella, word content corresponding with brief note is obtained by image information and applicational language model.
10. device according to claim 7, which is characterized in that the table recovery module according to brief note position coordinates into The cutting of row table row and grid column imports word content in brief note position, and the combination of cell is carried out according to Semantic judgement, To complete the reduction of whole table.
CN201811317237.7A 2018-11-03 2018-11-03 A kind of method and device handling form Image center selection word Pending CN109447015A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811317237.7A CN109447015A (en) 2018-11-03 2018-11-03 A kind of method and device handling form Image center selection word

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811317237.7A CN109447015A (en) 2018-11-03 2018-11-03 A kind of method and device handling form Image center selection word

Publications (1)

Publication Number Publication Date
CN109447015A true CN109447015A (en) 2019-03-08

Family

ID=65551246

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811317237.7A Pending CN109447015A (en) 2018-11-03 2018-11-03 A kind of method and device handling form Image center selection word

Country Status (1)

Country Link
CN (1) CN109447015A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109948507A (en) * 2019-03-14 2019-06-28 北京百度网讯科技有限公司 Method and apparatus for detecting table
CN111340023A (en) * 2020-02-24 2020-06-26 创新奇智(上海)科技有限公司 Text recognition method and device, electronic equipment and storage medium
CN112861736A (en) * 2021-02-10 2021-05-28 上海大学 Document table content identification and information extraction method based on image processing

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07220023A (en) * 1994-01-31 1995-08-18 Hitachi Ltd Method and device for table recognition
CN105574486A (en) * 2015-11-25 2016-05-11 成都数联铭品科技有限公司 Image table character segmenting method
CN107704857A (en) * 2017-09-25 2018-02-16 北京邮电大学 A kind of lightweight licence plate recognition method and device end to end
CN108416279A (en) * 2018-02-26 2018-08-17 阿博茨德(北京)科技有限公司 Form analysis method and device in file and picture

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07220023A (en) * 1994-01-31 1995-08-18 Hitachi Ltd Method and device for table recognition
CN105574486A (en) * 2015-11-25 2016-05-11 成都数联铭品科技有限公司 Image table character segmenting method
CN107704857A (en) * 2017-09-25 2018-02-16 北京邮电大学 A kind of lightweight licence plate recognition method and device end to end
CN108416279A (en) * 2018-02-26 2018-08-17 阿博茨德(北京)科技有限公司 Form analysis method and device in file and picture

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109948507A (en) * 2019-03-14 2019-06-28 北京百度网讯科技有限公司 Method and apparatus for detecting table
CN109948507B (en) * 2019-03-14 2021-05-07 北京百度网讯科技有限公司 Method and device for detecting table
CN111340023A (en) * 2020-02-24 2020-06-26 创新奇智(上海)科技有限公司 Text recognition method and device, electronic equipment and storage medium
CN111340023B (en) * 2020-02-24 2022-09-09 创新奇智(上海)科技有限公司 Text recognition method and device, electronic equipment and storage medium
CN112861736A (en) * 2021-02-10 2021-05-28 上海大学 Document table content identification and information extraction method based on image processing

Similar Documents

Publication Publication Date Title
KR102107887B1 (en) Method and apparatus for generating a composite image
CN109634961B (en) Test paper sample generation method and device, electronic equipment and storage medium
CN109447015A (en) A kind of method and device handling form Image center selection word
CN110390269A (en) PDF document table extracting method, device, equipment and computer readable storage medium
SG11201900263SA (en) Method, device and server for recognizing characters of claim document, and storage medium
CN104636742B (en) A kind of method by imaging automatic lock onto target topic and transmitting
US20170139575A1 (en) Data entering method and terminal
CN105260751B (en) A kind of character recognition method and its system
CN102156865A (en) Handwritten text line character segmentation method and identification method
JPH04104324A (en) Program generating device
CN105574486A (en) Image table character segmenting method
CN112036406B (en) Text extraction method and device for image document and electronic equipment
JP2021039424A5 (en)
CN103440239A (en) Functional region recognition-based webpage segmentation method and device
CN111240669A (en) Interface generation method and device, electronic equipment and computer storage medium
CN113191309A (en) Method and system for recognizing, scoring and correcting handwritten Chinese characters
CN109816743A (en) Generate the method and terminal device of identification pattern
CN105868240A (en) Method and device for self-adaptive display of page
CN107436931B (en) Webpage text extraction method and device
CN103186880B (en) Generate the method and apparatus of thumbnail
CN106297477A (en) A kind of method and device generating digitized copybook
CN108021423B (en) Multilingual website generation method and system and computer readable storage medium
CN112416340A (en) Webpage generation method and system based on sketch
CN108062548B (en) Braille square self-adaptive positioning method and system
CN102200966A (en) Method for extracting and processing layout information

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190308

RJ01 Rejection of invention patent application after publication