CN111539312A

CN111539312A - Method for extracting table from image

Info

Publication number: CN111539312A
Application number: CN202010318730.1A
Authority: CN
Inventors: 罗嘉杰
Original assignee: Individual
Current assignee: Individual
Priority date: 2020-04-21
Filing date: 2020-04-21
Publication date: 2020-08-14

Abstract

The invention discloses a method for extracting a table from an image, which comprises the following steps: s1, converting the image; s2, calculating the image pixel intensity gradient in the vertical and horizontal directions, and performing edge detection; s3, enhancing the edge pixels of the acquired image, and performing image binarization processing; s4, utilizing different structural elements to open in the morphology in the vertical and horizontal directions to find out the objects in the image which conform to the shape of the strip; s5, performing the operation of step S4 in the horizontal and vertical directions, respectively, and overlapping the results of the vertical and horizontal directions as an output; s6, finding out a closed square frame; s7, extracting and marking the position and the content of the table in the image; s8, performing table border correction to obtain the complete information of all tables in the image. The method can automatically finish the cutting of the image in the picture and the extraction of the table, and can improve the efficiency of the prepositive operation and the accuracy of the subsequent operation in the image character recognition tasks such as OCR (optical character recognition) and the like.

Description

Method for extracting table from image

Technical Field

The invention relates to the technical field of image processing methods, in particular to a method for extracting a table from an image.

Background

The domestic OCR technology for Chinese character recognition has good research results in recent years, and general recognition can achieve the accuracy of more than 95%, but a general model used for the layout analysis of pictures does not have good expression and universality, and most of the models are purposefully developed and customized. The domestic OCR technology for Chinese character recognition has good research results in recent years, and general recognition can achieve the accuracy of more than 95%, but a general model used for the layout analysis of pictures does not have good expression and universality, and most of the models are purposefully developed and customized.

If the accuracy and flexibility of Chinese character recognition need to be improved, layout analysis is a very important ring. Particularly, for some financial statements, business documents, design engineering drawings and the like, texts and forms are mixed, and how to accurately extract corresponding forms and use different models for calculation is necessary work.

The general table does not have an absolute format, the rows and the columns do not have a fixed quantity, the direction is not fixed, and the style used by the frame of the table is not absolute, so that the complexity and the difficulty of extracting the table are increased.

Disclosure of Invention

In view of the above technical shortcomings, the present invention provides a method for extracting a table from an image, which aims to solve the problems in the background art.

In order to solve the technical problems, the invention adopts the following technical scheme:

the invention provides a method for extracting a table from an image, which comprises the following steps:

s1, converting the original image colorful image into a gray-scale image;

s2, performing image pixel intensity gradient calculation in the vertical and horizontal directions by using a convolution method, and performing edge detection on the processed gray-scale image;

s3, using expansion in image morphology to enhance the edge pixels of the image, and carrying out image binarization processing according to a specific threshold value;

s4, opening the processed image in the morphology by using different structures in the vertical and horizontal directions respectively;

s5, overlapping the results of the vertical direction and the horizontal direction as output;

s6, through analysis of the topological structure, secondary judgment is carried out on the area occupied by the frame obtained in the step S5, whether the frame is reserved as a table or not is determined, and a closed square frame, namely a complete frame of the table, is found out;

s7, extracting and marking the position and the content of the table in the picture according to the result obtained in the step S6, namely extracting the table from the picture;

s8, performing table border correction on the outline characteristics of the table to obtain the complete information of all tables in the image.

Preferably, step S2 is specifically: firstly, evolution convolution is carried out by utilizing a Gaussian filter of 5x5 to achieve a noise reduction effect, and then gradual strength calculation is carried out by utilizing a kernel of 3x 3; the calculation of the gradient is divided into horizontal and vertical differential equations, wherein the horizontal equation is:

Gx_(i,j)＝I_i+1,j-1-I_i-1,i-j+2I_i+1,j-2I_i-1,j+I_i+1,j+1-I_i-1,j+1

the equation for the vertical direction is:

Gy_(i,j)＝I_i-1,j+1-I_i-1,i-j+2I_i,j+1-2I_i,j-1+I_i+1,j+1-I_i+1,j-1

finally, the expected gradient value is found out by utilizing L2 norm.

The invention has the beneficial effects that: the method can automatically finish the cutting of the image in the picture and the extraction of the table, and can improve the efficiency of the prepositive operation and the accuracy of the subsequent operation in the image character recognition tasks such as OCR (optical character recognition) and the like.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

FIG. 1 is a flow chart of a method for extracting a table from an image according to the present invention;

FIG. 2 is an original image in example 1;

FIG. 3 is a transformed grayscale image of example 1;

FIG. 4 is a graph of the edge detection in example 1;

FIG. 5 is the image after the binarization processing in example 1;

FIG. 6 is an image of embodiment 1 undergoing structure element for opening in morphology;

FIG. 7 is an output image of the result in the horizontal direction in example 1;

FIG. 8 is an output image of the result in the vertical direction in example 1;

FIG. 9 is an output image in which the vertical and horizontal directions coincide in embodiment 1;

FIG. 10 is a table image extracted in example 1;

fig. 11 is a frame-corrected form image in example 1.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example 1

As shown in fig. 1, a method of extracting a table from an image includes the steps of:

s1, converting the original image colorful image into a gray-scale image, as shown in figures 2-3;

s2, performing image pixel intensity gradient calculation in the vertical and horizontal directions by using a convolution method, and performing edge detection on the processed grayscale image, as shown in fig. 4:

firstly, evolution convolution is carried out by utilizing a Gaussian filter of 5x5 to achieve a noise reduction effect, and then gradual strength calculation is carried out by utilizing a kernel of 3x 3; the calculation of the gradient is divided into horizontal and vertical differential equations, wherein the horizontal equation is:

Gx_(i,j)＝I_i+1,j-1-I_i-1,i-j+2I_i+1,j-2I_i-1,j+I_i+1,j+1-I_i-1,j+1

the equation for the vertical direction is:

Gy_(i,j)＝I_i-1,j+1-I_i-1,i-j+2I_i,j+1-2I_i,j-1+I_i+1,j+1-I_i+1,j-1

finally, finding out an expected gradient value by using L2 norm;

s3, using the expansion in image morphology to enhance the edge pixels of the acquired image, and performing image binarization processing according to a specific threshold, see fig. 5;

s4, starting the morphology of the processed image in vertical and horizontal directions by using different structural elements, as shown in fig. 6;

s5, superimposing the results of the vertical and horizontal directions as an output, see fig. 7-9;

s7, extracting and marking the position and content of the table in the picture according to the result obtained in the step S6, namely extracting the table from the picture, as shown in figure 10;

s8, performing table border correction on the outline characteristics of the table to obtain the complete information of all the tables in the image, as shown in FIG. 11.

It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims

1. A method of extracting a form from an image, comprising the steps of:

s1, converting the original image from the color image into a gray-scale image;

s2, calculating the image pixel intensity gradient in the vertical and horizontal directions by using a convolution method, carrying out edge detection on the processed gray-scale image, and judging conditions by using Non-Max medium compression according to the calculated gradient value and direction to find out edge errors;

s3, using the expansion in the image morphology to enhance the edge pixels of the acquired image, carrying out image binarization processing according to a specific threshold value, deleting noise, enhancing edge characteristics, determining that the edge is greater than the threshold value is set as 1, and setting the threshold value as 0;

s4, respectively opening the processed image in the morphology in the vertical and horizontal directions by using different structural elements, finding out the object in the image which conforms to the shape of the strip, and removing characters or other symbol objects;

s5, performing the operation of step S4 in the horizontal and vertical directions, respectively, adding the results of the two directions to locate the table position and size in the image, and superimposing the results of the vertical and horizontal directions as an output;

s6, finding out a closed square frame, namely a complete frame of the table, through the analysis of the topological structure;

s7, extracting and marking the position and content of the table in the image according to the result obtained in the step S6, namely extracting the table from the image;

2. The method for extracting table from image as claimed in claim 1, wherein the step S2 is specifically: firstly, evolution convolution is carried out by utilizing a Gaussian filter of 5x5 to achieve a noise reduction effect, and then gradual strength calculation is carried out by utilizing a kernel of 3x 3; the calculation of the gradient is divided into horizontal and vertical differential equations, wherein the horizontal equation is:

Gx_(i,j)＝I_i+1,j-1-I_i-1,i-j+2I_i+1,j-2I_i-1,j+I_i+1,j+1-I_i-1,j+1

the equation for the vertical direction is:

Gy_(i,j)＝I_i-1,j+1-I_i-1,i-j+2I_i,j+1-2I_i,j-1+I_i+1,j+1-I_i+1,j-1

finally, the expected gradient value is found by using L2 norm.