CN112328825A

CN112328825A - Picture construction method based on natural language processing

Info

Publication number: CN112328825A
Application number: CN202011082580.5A
Authority: CN
Inventors: 王涛
Original assignee: Suzhou Zero Spring Technology Co ltd
Current assignee: Suzhou Zero Spring Technology Co ltd
Priority date: 2020-10-15
Filing date: 2020-10-15
Publication date: 2021-02-05

Abstract

The invention discloses a picture construction method based on natural language processing, comprising the following steps of; converting the needed pdf file into a picture through Smallpdf; step two; performing expansion and corrosion operation on the picture by using OpenCV; step three; performing character recognition; step four; the invention relates to the technical field of picture construction, and aims to match recognition results. The picture construction method based on natural language processing provides great convenience for processing of digital images and application of computer vision technology, and the picture construction method not only is completely free open source software, but also contains abundant functions of various image processing and recognition, and improves the running speed and accurate matching.

Description

Picture construction method based on natural language processing

Technical Field

The invention relates to the technical field of picture searching, in particular to a picture construction method based on natural language processing.

Background

The picture information can reflect the related content of the picture through the characters, most software packages are compiled by adopting C/C + + based on the view of the calculation speed, although the software packages provide great convenience for the research of computer image processing and computer vision, the software packages have the defects, most software packages do not have advanced mathematical calculation functions, and the operation speed is slow; most software packages do not support the development of application programs of a network server structure; most software packages do not support embeddability.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides a picture construction method based on natural language processing, an OpenCV image processing algorithm library runs in a VC + + compiling environment, great convenience is provided for the processing of digital images and the application of computer vision technology, and the method not only is completely free open source software, but also contains abundant functions of various image processing and recognition.

In order to achieve the purpose, the invention is realized by the following technical scheme: a picture construction method based on natural language processing comprises the following steps:

step one; converting the needed pdf file into a picture through Smallpdf;

step two; performing expansion and corrosion operation on the picture by using OpenCV;

step three; performing character recognition;

step four; and matching the recognition results.

Further, the digital image converted by the PDF in the step one is operated, and the digital image is an image represented in a two-dimensional array form, and a digital unit of the image is a pixel.

Further, the basic elements of the digital image are pixels, and the pixels are obtained by discretizing a continuous space when the analog image is digitized.

Further, the operation in the step two includes: binary corrosion and expansion, binary switching operation, skeleton extraction, limit corrosion and hit-miss conversion.

Further, the fourth step is specifically to cross-filter the recognition result and the rule-based extraction result to obtain the text.

And further, extracting the longest common substring of the recognition result and the result extracted based on the rule, and simplifying the residual text extracted based on the rule.

Further, the third step is specifically calling an open-source Tesseract OCR API to perform character recognition.

Advantageous effects

The invention provides a picture construction method based on natural language processing. Has the following beneficial effects:

according to the picture construction method based on natural language processing, the picture is subjected to expansion and corrosion operations by using OpenCV, an OpenCV image processing algorithm library runs in a VC + + compiling environment, great convenience is provided for digital image processing and computer vision technology application, the picture construction method not only is completely free open-source software, but also contains abundant functions of various image processing and recognition, and the running speed and the accurate matching are improved.

Drawings

Fig. 1 is a flowchart of a picture construction method based on natural language processing.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 1, the present invention provides a technical solution: a picture construction method based on natural language processing comprises the following steps: step one; converting the needed pdf file into a picture through Smallpdf; performing operation on a digital image converted by PDF in the first step, wherein the digital image is an image represented in a two-dimensional array form, and a digital unit of the digital image is a pixel; the basic elements of the digital image are pixels, and the digital image is obtained by discretizing a continuous space when an analog image is digitized; step two; performing expansion and corrosion operation on the picture by using OpenCV; the operation in the second step comprises the following steps: binary corrosion and expansion, binary switching operation, skeleton extraction, limit corrosion and hit-miss conversion;

structural element for expansion and corrosion operation in the invention^]Is the most important and basic concept. The role of the structural element in the morphological transformation is equivalent to the filtering window in the signal processing, denoted by b (x), and for each point x in the working space E, the definition of erosion and dilation is:

expansion:

and (3) corrosion:

the result of expansion of E by B (x) is a set of points whose intersection of B and E is not empty, as a result of translation of B, and the result of erosion of E by B (x) is a set of all points whose intersection of B is contained in E, as a result of translation of B

The dilation operation convolves the image X with a structuring element B of arbitrary shape, typically square or circular.

When the expansion operation is carried out, the structural element B is drawn across the image X, the maximum pixel value of the coverage area of the structural element B is extracted, and the pixel of the anchor point position is replaced. Obviously, this maximization will cause the bright areas in the image to begin to expand.

And the minimum value of the pixel covered by the structural element is extracted by corrosion, and when the corrosion operation is carried out, the structural element B is drawn by an image X, the minimum pixel value of the area covered by the structural element B is extracted, and the pixel at the anchor point position is replaced.

Step three, the invention; performing character recognition, wherein the third step is specifically calling a Tesseract OCR API of an open source to perform character recognition; step four; and matching the recognition results, wherein the fourth step is specifically that the recognition results and the rule-based extraction results are subjected to cross filtering to obtain texts, the recognition results and the rule-based extraction results are subjected to longest common substring extraction, and part of residual texts extracted based on the rules are simplified.

The process of first corrosion and then expansion is called as open operation, the open operation has the functions of eliminating fine objects, separating the objects at fine positions and smoothing the boundaries of larger objects, the process of first expansion and then corrosion is called as closed operation, and the closed operation has the functions of filling fine cavities in the objects and connecting adjacent objects and smooth boundaries.

It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.

Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. A picture construction method based on natural language processing comprises the following steps:

step one; converting the needed pdf file into a picture through Smallpdf;

step three; performing character recognition;

step four; and matching the recognition results.

2. The picture construction method based on natural language processing according to claim 1, wherein: and operating the digital image converted by the PDF in the first step, wherein the digital image is an image represented in a two-dimensional array form, and a digital unit of the digital image is a pixel.

3. The picture construction method based on natural language processing according to claim 2, wherein: the basic elements of the digital image are pixels, and the digital image is obtained by discretizing a continuous space when an analog image is digitized.

4. The picture construction method based on natural language processing according to claim 1, wherein: the operation in the second step comprises the following steps: binary corrosion and expansion, binary switching operation, skeleton extraction, limit corrosion and hit-miss transformation.

5. The picture construction method based on natural language processing according to claim 1, wherein: and step four, specifically, the recognition result and the rule-based extraction result are subjected to cross filtering to obtain a text.

6. The picture construction method based on natural language processing according to claim 5, wherein: and extracting the longest common substring of the recognition result and the result extracted based on the rule, and simplifying the residual text extracted based on the rule.

7. The picture construction method based on natural language processing according to claim 1, wherein: and step three, specifically, calling an open-source Tesseract OCR API to perform character recognition.