CN111401312A

CN111401312A - PDF drawing character recognition method, system and equipment

Info

Publication number: CN111401312A
Application number: CN202010278085.5A
Authority: CN
Inventors: 张东锋; 曾雏鹏; 李俊波
Original assignee: Shenzhen Xinzhi Software Co ltd
Current assignee: Shenzhen Xinzhi Software Co ltd
Priority date: 2020-04-10
Filing date: 2020-04-10
Publication date: 2020-07-10
Anticipated expiration: 2040-04-10
Also published as: CN111401312B

Abstract

The invention provides a PDF drawing character recognition method, a system and equipment, wherein the PDF drawing character recognition method comprises the following steps: performing an optical character recognition step based on the deep learning; customized identification and universal identification; and a mobile device low quality image recognition step; wherein the step of performing optical character recognition based on deep learning includes the steps of: detecting a region with characters in a scene and identifying the characters in the region, wherein text detection is performed based on CTPN, Seglink, TextBox, FTSN, Pixellink and CRAFT algorithms; the characters are identified based on CNN and CRNN algorithms; wherein the step of customizing the identification comprises the steps of: identifying the type of a PDF drawing according to the table characters in the PDF or the frame content in the PDF; extracting content in the region according to the structural features; and extracting a key area, and identifying characters in the area or extracting key characters through a deep neural network.

Description

PDF drawing character recognition method, system and equipment

Technical Field

The invention relates to the field of image processing, in particular to a method, a system and equipment for identifying characters of a PDF drawing.

Background

The artificial intelligence has made rapid development in the aspects of data, algorithm and computing power, and a new round of development wave is met under the large background of global economy digital transformation. The influence of the artificial intelligence wave is far beyond the prior art, and the most remarkable characteristic is that the influence is diffused from the professional field to the popular field.

PDF high-precision identification is a rather mature technology in the current market, and a method based on traditional OCR and deep learning is also applied to various industries. Identification of bank notes, PDF forms and industrial drawings are all widely and mature technologies that have been used. Identification of formatted and templated PDFs has dramatic results in both accuracy and speed, thereby improving the work efficiency and capacity of practitioners in various industries.

The conventional PDF identifies PDF in a basic fixed form, and has certain requirements on PDF quality. With the popularization of smart phones, the traditional method does not have a good solution for low-quality PDF images shot by personal mobile phones.

Most of the current PDF identification is whole identification, and extraction and identification aiming at special areas are not provided. For some structured PDF drawings, extracting a region (POI) which is interested by a user and analyzing the content in the POI is also a characteristic of the scheme.

Disclosure of Invention

The invention aims to provide a method, a system and equipment for recognizing characters on a PDF drawing, which provide various recognition schemes such as customized and general scenes, and can provide a solution for recognizing low-quality images by a user based on a deep learning nearest OCR algorithm and a corresponding image processing technology.

Another object of the present invention is to provide a method, a system and a device for identifying characters on a PDF drawing, which can identify various scenes such as industrial drawings, bills, images shot by personal devices, and solve different requirements of different users.

In order to achieve at least one of the above objects, the present invention provides a method for identifying characters on a PDF drawing, wherein the method for identifying characters on a PDF drawing comprises the following steps:

performing an optical character recognition step based on the deep learning; customized identification and universal identification; and a mobile device low quality image recognition step;

wherein the performing optical character recognition based on deep learning includes: detecting a region with characters in a scene and identifying the characters in the region, wherein text detection is performed based on CTPN, Seglink, TextBox, FTSN, Pixellink and CRAFT algorithms; the characters are identified based on CNN and CRNN algorithms;

wherein the step of customizing identification comprises the steps of: identifying the type of a PDF drawing according to the table characters in the PDF or the frame content in the PDF; extracting content in the region according to the structural features; and extracting a key area, and identifying characters in the area or extracting key characters through a deep neural network.

In some embodiments, the step of extracting key regions of the step of customizing identification further comprises the steps of:

extracting a key area according to the proportion of the POI;

extracting all frames by using Hough transform and corner detection;

extracting a key area according to fuzzy matching and accurate matching of characters;

extracting a key area according to the edge characteristics of the area; and

and extracting a key area according to the character characteristics in the area.

In some embodiments, in the step of extracting key regions in the step of customizing identification, key regions are extracted according to edge characteristics of shapes or symmetries or angles or edge granularities of the regions, wherein the key regions are extracted according to characteristics of fonts or sizes of characters in the regions or character types.

In some embodiments, wherein the mobile device low quality image recognition step further comprises the steps of:

performing filtering processing on the image;

performing image enhancement processing on the image;

performing image marginalization sharpening processing on the image;

performing image texture analysis processing on the image;

performing image segmentation processing on the image;

performing geometric analysis processing on the image;

performing image matching processing on the image; and

morphological processing is performed on the image.

In some embodiments, the filtering the image comprises performing image smoothing and image denoising on the image; wherein the performing image texture analysis processing on the image is performing skeleton removal and connectivity processing on the image; wherein the step of performing image matching processing on the image is to perform template matching and search matching processing on the image; wherein the step of performing morphological processing on the image is performing dilation, apparel, and opening and closing operations on the image.

According to another aspect of the present invention, a computer-readable storage medium is further provided, on which a computer program is stored, which, when being executed by a processor, performs the steps of the PDF drawing text recognition method.

According to another aspect of the present invention, there is also provided a PDF drawing text recognition apparatus, including: a software application, a memory for storing the software application, and a processor for executing the software application; and correspondingly executing the steps in the PDF drawing character recognition method by each program of the software application program.

According to another aspect of the present invention, there is also provided a PDF drawing text recognition system, comprising an optical character recognition unit, a customized recognition and general purpose recognition unit, and a mobile device low-quality image recognition unit, wherein the optical character recognition unit comprises a text detection module and a text recognition module, wherein the text detection module is configured to: detecting a region with characters in a scene, and executing text detection based on CTPN, Seglink, TextBox, FTSN, Pixellink and CRAFT algorithms; wherein the word recognition module is configured to: recognizing characters in the detected region, and recognizing the characters based on CRNN and CNN algorithms; the customized identification and general identification unit is provided with a customized identification module, the customized identification module is configured to structurally extract the content in the region, and identify characters in the region or extract key characters through a deep neural network; wherein the mobile device low quality image recognition unit is configured to: performing filtering processing, image enhancement processing, image marginalization sharpening processing, image texture analysis processing, image segmentation processing, geometric shape analysis processing, image matching processing, and morphological processing on the identified image

In some embodiments, the key region extraction module further comprises a POI scale extraction module, a hough transform corner detection extraction module, a text blurring and exact matching extraction module, a region edge characteristic extraction module, and a region text characteristic extraction module; wherein the POI scale extraction module is configured to extract key regions according to POI scale size; the Hough transform corner detection extraction module is configured to extract all frames by utilizing Hough transform and corner detection; the character fuzzy and precise matching extraction module is configured to extract a key area according to fuzzy matching and precise matching of characters; the region edge characteristic extraction module is configured to extract a key region according to edge characteristics of the region; the regional text characteristic extraction module is configured to extract key regions according to text characteristics in the regions.

In some embodiments, the edge characteristics of the region edge characteristic extraction module are shape, symmetry, angle and edge granularity, and the text characteristics in the region of the region text characteristic extraction module are font, size and text type.

Drawings

Fig. 1 is a flowchart illustrating steps of a method for recognizing characters on a PDF drawing according to an embodiment of the present invention.

Fig. 2 is a schematic structural diagram of a PDF drawing text recognition system according to an embodiment of the present invention.

Detailed Description

The following description is presented to disclose the invention so as to enable any person skilled in the art to practice the invention. The preferred embodiments in the following description are given by way of example only, and other obvious variations will occur to those skilled in the art. The basic principles of the invention, as defined in the following description, may be applied to other embodiments, variations, modifications, equivalents, and other technical solutions without departing from the spirit and scope of the invention.

It is understood that the terms "a" and "an" should be interpreted as meaning that a number of one element or element is one in one embodiment, while a number of other elements is one in another embodiment, and the terms "a" and "an" should not be interpreted as limiting the number.

The present invention relates to a computer program. Fig. 1 is a flow chart of a method for recognizing characters on a PDF drawing according to the present invention, which illustrates a solution for controlling or processing an external object or an internal object of a computer by executing a computer program compiled according to the above flow on the basis of a computer program processing flow to solve the problems of the present invention. By the PDF drawing character recognition method, a computer system can be utilized, artificial experience and machine learning results can be integrated, recognition of various scenes such as industrial drawings, bills and images shot by personal equipment can be achieved, and different requirements of different users can be met.

As shown in fig. 1, the method for recognizing characters on a PDF drawing includes the following steps:

s10: an optical character recognition step is performed based on the deep learning.

The optical character recognition step includes the steps of:

detecting a region with characters in a scene, wherein text detection is executed based on CTPN, Seglink, TextBoxes, FTSN, Pixellink and CRAFT algorithms; and

and recognizing characters in the detected region, wherein the characters are recognized based on the CRNN algorithm and the CNN algorithm.

Further, the method for recognizing characters on a PDF drawing further includes step S20: customized identification and universal identification.

The customized identification and universal identification steps comprise the following steps:

customizing and identifying; and

and (5) universal identification.

Wherein the step of customizing the identification further comprises the steps of:

identifying the type of a PDF drawing according to the table characters in the PDF or the frame content in the PDF;

extracting content in the region according to the structural features; and

and extracting a key area, and identifying characters in the area or extracting key characters through a deep neural network.

Wherein the step of extracting the key area further comprises the following steps:

extracting according to the proportion of POI;

extracting all frames by using Hough transform and corner detection;

fuzzy matching and accurate matching of characters;

according to the edge characteristics of the region, such as shape, symmetry, angle, edge granularity, and the like; and

depending on the character properties in the region, such as font, size, character type, etc.

Further, the method for recognizing characters on a PDF drawing further includes step S30: a mobile device low quality image recognition step.

The mobile device low quality image recognition step further comprises the steps of:

filtering, e.g., image smoothing, image denoising;

enhancing the image;

sharpening the edge of the image;

image texture analysis, e.g., de-framing, connectivity;

image segmentation;

analyzing the geometrical morphology;

image matching, e.g., template matching, search matching; and

morphological treatments such as inflation, apparel, opening and closing operations, and the like.

The enterprise provides an identification entrance, and the enterprise is provided with the capability of shooting PDF drawings and identifying individual users. However, the image taken by the user is often poor in quality due to the lighting condition, the shooting angle, and the like. The PDF drawing character recognition method can improve the quality of images, thereby improving the recognition precision. Aiming at the low-quality image provided by the personal equipment provided by the user, after the low-quality image recognition step of the mobile equipment is carried out, the image can have better expressive force, and the image quality approaches to the high-precision PDF, so that the recognition algorithm can be better carried out. But also can expand the applicable scenes and generalization of the whole algorithm.

As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects.

It can be understood by those skilled in the art that the method for identifying the text on the PDF drawing according to the present invention can be implemented by hardware, software, or a combination of hardware and software. The present invention can be realized in a centralized fashion in at least one computer system, or in a distributed fashion where different elements are spread across several interconnected computer systems. Any kind of computer system or other apparatus adapted for carrying out the methods described herein is suited. A typical combination of hardware and software could be a general purpose computer system with a computer program that, when being loaded and executed, controls the computer system such that it carries out the methods described herein.

The present invention can be embedded in a computer program product, which comprises all the features enabling the implementation of the methods described herein. The computer program product is embodied in one or more computer-readable storage media having computer-readable program code embodied therein. According to another aspect of the invention, there is also provided a computer-readable storage medium having stored thereon a computer program capable, when executed by a processor, of performing the steps of the method of the invention. Computer storage media is media in computer memory for storage of some discrete physical quantity. Computer storage media includes, but is not limited to, semiconductors, magnetic disk storage, magnetic cores, magnetic drums, magnetic tape, laser disks, and the like. It will be appreciated by persons skilled in the art that computer storage media are not limited by the foregoing examples, which are intended to be illustrative only and not limiting of the invention.

A typical combination of hardware and software could be a general purpose computer system with a computer program that, when being loaded and executed, controls the computer system such that it carries out the methods described herein. According to another aspect of the present invention, there is also provided a PDF drawing text recognition apparatus, including: a software application, a memory for storing the software application, and a processor for executing the software application. And each program of the software application program can correspondingly execute the steps in the PDF drawing character recognition method.

Corresponding to the embodiment of the method, according to another aspect of the invention, a PDF drawing character recognition system is also provided, and the PDF drawing character recognition system is an application of the PDF drawing character recognition method in the improvement of computer programs.

As shown in fig. 2, in this embodiment of the present invention, the PDF drawing text recognition system includes an optical character recognition unit 100, a customized recognition and general purpose recognition unit 200, and a mobile device low-quality image recognition unit 300.

Specifically, the optical character recognition unit 100 includes a text detection module 110 and a recognition module 120. Wherein the text detection module 110 detects a region with words in the scene, preferably, text detection is performed based on CTPN, Seglink, TextBoxes, FTSN, Pixellink, and CRAFT algorithms. The character recognition module 120 recognizes characters in the detected region, preferably, recognizes characters based on CRNN and CNN algorithms. It will be appreciated by those skilled in the art that in other embodiments of the invention, algorithms other than CRNN, CNN, etc. may be used, and the invention is not limited in this respect.

With the rapid development of neural networks in computer vision, the precision of Optical Character Recognition (OCR) is also greatly improved compared with the conventional technology. Under the large background of deep learning, character recognition is also expanded from the recognition of traditional scenes to the character recognition of general scenes, namely, the character recognition of natural scenes. The algorithm model of the optical character recognition unit ensures that the recognition accuracy of the user is ensured to the maximum extent. In addition, the speed and the precision may be different for different users, so according to different scenes and users, the PDF drawing text recognition system adapts to multiple algorithms, and the text recognition module 120 replaces some complex text recognition algorithms (such as CRNN) with CNN, which can provide a recognition scheme with a faster rate as possible for simple scenes.

The customized identification and general identification unit 200 is provided with a customized identification module 210, and the customized identification module 210 is configured to structure the content in the feature extraction area and identify the words in the area or extract key words through a deep neural network. Preferably, the customized identification module 210 is provided with a key region extraction module 220, and the key region extraction module 220 further includes a POI proportion extraction module 221, a hough transformation corner detection extraction module 222, a text blurring and exact matching extraction module 223, a region edge characteristic extraction module 224, and a region text characteristic extraction module 225. The POI proportion extraction module 221 extracts according to the POI proportion; the hough transform corner detection and extraction module 222 extracts all frames by using hough transform and corner detection; the text fuzzy and precise matching extraction module 223 extracts the content in the key area according to the fuzzy matching and precise matching of the text; the region edge characteristic extraction module 224 extracts contents in the key region, such as shape, symmetry, angle, edge granularity, and the like, according to the edge characteristics of the region); the regional text property extraction module 225 extracts content, such as font, size, text type, etc., in the key region according to the text properties in the region.

The mobile device low-quality image recognition unit 300 is configured to perform filtering processing, image enhancement processing, image marginalization sharpening processing, image texture analysis processing, image segmentation processing, geometric shape analysis processing, image matching processing, and morphology processing on the recognized image.

The enterprise provides an identification entrance, and the enterprise is provided with the capability of shooting PDF drawings and identifying individual users. However, the image taken by the user is often poor in quality due to the lighting condition, the shooting angle, and the like. Aiming at the problems, the low-quality image recognition unit 300 of the mobile equipment adopts a plurality of image processing schemes, so that the quality of the image is improved, and the recognition accuracy is improved.

Aiming at low-quality images provided by personal equipment provided by a user, after the low-quality images are processed by the PDF drawing character recognition system, the images can have better expressive force, and the image quality approaches to high-precision PDF, so that the recognition algorithm can be better carried out, and the application scene and the generalization of the whole algorithm can be expanded.

It will be appreciated by those skilled in the art that the present invention has been described with reference to flowchart illustrations and/or block diagrams of methods, systems and computer program products according to the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart and/or block diagram block or blocks.

It will be appreciated by persons skilled in the art that the embodiments of the invention described above and shown in the drawings are given by way of example only and are not limiting of the invention. The objects of the invention have been fully and effectively accomplished. The functional and structural principles of the present invention have been shown and described in the examples, and any variations or modifications of the embodiments of the present invention may be made without departing from the principles.

Claims

1. A PDF drawing character recognition method is characterized by comprising the following steps:

2. The method for recognizing characters on PDF drawing as recited in claim 1, wherein said step of extracting key regions in said step of customizing recognition further comprises the steps of:

extracting a key area according to the proportion of the POI;

extracting all frames by using Hough transform and corner detection;

extracting a key area according to the edge characteristics of the area; and

3. The method for recognizing characters on PDF drawing as claimed in claim 2, wherein in the step of extracting key regions in the customized recognition step, key regions are extracted according to the edge characteristics of the shapes or symmetries or angles or edge granularity of the regions, wherein the key regions are extracted according to the characters fonts or sizes or characters types in the regions.

4. The method of claim 1, wherein the mobile device low quality image recognition step further comprises the steps of:

performing filtering processing on the image;

performing image enhancement processing on the image;

performing image marginalization sharpening processing on the image;

performing image texture analysis processing on the image;

performing image segmentation processing on the image;

performing geometric analysis processing on the image;

performing image matching processing on the image; and

morphological processing is performed on the image.

5. The method for recognizing characters on PDF drawing as recited in claim 4, wherein said step of performing filtering process on image includes performing image smoothing and image denoising process on image; wherein the performing image texture analysis processing on the image is performing skeleton removal and connectivity processing on the image; wherein the step of performing image matching processing on the image is to perform template matching and search matching processing on the image; wherein the step of performing morphological processing on the image is performing dilation, apparel, and opening and closing operations on the image.

6. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the PDF drawing text recognition method according to any of claims 1 to 5.

7. The utility model provides a PDF drawing characters recognition equipment which characterized in that, PDF drawing characters recognition equipment includes: a software application, a memory for storing the software application, and a processor for executing the software application; the method for identifying characters on PDF drawings is characterized in that each program of the software application program correspondingly executes the steps in the method for identifying characters on PDF drawings as claimed in any one of claims 1 to 5.

8. A PDF drawing text recognition system, comprising an optical character recognition unit, a customized recognition and generic recognition unit, and a mobile device low quality image recognition unit, wherein the optical character recognition unit comprises a text detection module and a text recognition module, wherein the text detection module is configured to: detecting a region with characters in a scene, and executing text detection based on CTPN, Seglink, TextBox, FTSN, Pixellink and CRAFT algorithms; wherein the word recognition module is configured to: recognizing characters in the detected region, and recognizing the characters based on CRNN and CNN algorithms; the customized identification and general identification unit is provided with a customized identification module, the customized identification module is configured to structurally extract the content in the region, and identify characters in the region or extract key characters through a deep neural network; wherein the mobile device low quality image recognition unit is configured to: and carrying out filtering processing, image enhancement processing, image marginalization sharpening processing, image texture analysis processing, image segmentation processing, geometric morphology analysis processing, image matching processing and morphology processing on the identified image.

9. The PDF drawing text recognition system of claim 8, wherein the key region extraction module further comprises a POI scale extraction module, a hough transform corner detection extraction module, a text blur and exact match extraction module, a region edge characteristic extraction module, and a region text characteristic extraction module; wherein the POI scale extraction module is configured to extract key regions according to POI scale size; the Hough transform corner detection extraction module is configured to extract all frames by utilizing Hough transform and corner detection; the character fuzzy and precise matching extraction module is configured to extract a key area according to fuzzy matching and precise matching of characters; the region edge characteristic extraction module is configured to extract a key region according to edge characteristics of the region; the regional text characteristic extraction module is configured to extract key regions according to text characteristics in the regions.

10. The PDF drawing text recognition system of claim 9, wherein the edge characteristics of the region edge characteristic extraction module are shape, symmetry, angle and edge granularity, and wherein the text characteristics in the region of the region text characteristic extraction module are font, size and text type.