CN111191647A

CN111191647A - Standard formula identification method based on image processing

Info

Publication number: CN111191647A
Application number: CN201911365677.4A
Authority: CN
Inventors: 周培培; 侯幸林; 蔡纪鹤
Original assignee: Changzhou Institute of Technology
Current assignee: Changzhou Institute of Technology
Priority date: 2019-12-26
Filing date: 2019-12-26
Publication date: 2020-05-22

Abstract

The invention discloses a standard formula identification method based on image processing, which comprises the following steps: step 1, a formula to be identified; step 2, a horizontal formula after calibration; step 3, a formula after binarization processing is carried out; step 4, splitting each sub item; step 5, identifying a system by a formula; and 6, outputting the editable formula. The invention uses artificial intelligence method to output the identification result of the whole formula, and converts it into editable text, such as Latex, EdrawMath, Onenote or Word, or converts it into dynamic link library or loading item, or can be designed into software.

Description

Standard formula identification method based on image processing

Technical Field

The invention relates to the field of image processing, in particular to a standard formula identification method based on image processing.

Background

Formula editing is an extremely important part in text editing, however, large formula editing is complex and often occupies a large amount of time for text editors. For formulas in some texts (PDFs and pictures), editors need to copy all the texts, but the texts are often edited again in work, and a large amount of time is occupied, so that the design of the formula identification method has important significance.

Disclosure of Invention

1. Objects of the invention

The invention mainly takes pictures or captures screens aiming at formulas in pictures and PDF documents, and then converts the formulas into editable formula texts by an image processing method, and has high recognition rate and high recognition speed.

2. The technical scheme adopted by the invention

The invention discloses a standard formula identification method based on image processing, which comprises the following steps:

step 1, a formula to be identified;

step 2, a horizontal formula after calibration;

step 3, a formula after binarization processing is carried out;

step 4, splitting each sub item;

step 5, identifying a system by a formula;

and 6, outputting the editable formula.

And 5, identifying the formula by using image processing software:

5.1 transforming the intercepted or shot formula image to change the formula into a horizontal direction;

5.2 extracting a formula target by using an image processing algorithm;

5.3 using image processing algorithm to split the whole formula into sub-items;

5.4 recording the position corresponding relation among the sub items, wherein the specific rule is as follows:

the identification symbols and positions comprise left side, right side, upper right corner, upper and lower part type and root;

and 5.5, identifying the meaning of each sub-item according to the position relation among the sub-items, forming a logical ordering relation, classifying the relation into a method or a result, recording and storing the relation in the whole formula, and gradually reasoning from a small range to a large range after equal sign to infer the relation of each sub-item in the whole formula.

Further, said 5.2 uses image processing algorithm including threshold segmentation, binarization to extract formula object.

Furthermore, the 5.3 uses an image processing algorithm including a connected domain identification method.

Furthermore, in step 5.5, according to the position relation of each sub-item and the meaning of each sub-item, an artificial intelligence method is used for comprehensively outputting the recognition result of the whole formula and converting the recognition result into editable text comprising Latex, EdrawMath, Onenote or Word, or converting the recognition result into a dynamic link library or a loading item or independent software.

Further, step 1, the formula to be recognized is acquired by a camera.

Further, step 1, for the formula in the PDF document in the computer or the mobile phone, the formula target can be intercepted by using screenshot software as the input of the system on the premise of opening the file.

3. Advantageous effects adopted by the present invention

The invention uses artificial intelligence method to output the identification result of the whole formula, and converts it into editable text, such as Latex, EdrawMath, Onenote or Word, or converts it into dynamic link library or loading item, or can be designed into software.

Drawings

FIG. 1 is a flow chart of the present invention;

Detailed Description

The technical solutions in the examples of the present invention are clearly and completely described below with reference to the drawings in the examples of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without inventive step, are within the scope of the present invention.

The present invention will be described in further detail with reference to the accompanying drawings.

Example 1

The formula of the invention can be a handwritten formula (including pen writing, liquid crystal panel writing or screen handwriting and the like), and can also be a standard formula (Latex, EdrawMath, One note or Word and the like) edited by a formula editor;

the screen capture can be performed on a PDF document, a Word document or a picture opened on a computer, or can be performed on a PDF document, a Word document or a picture opened on mobile equipment;

the invention can be used as software on a computer or mobile equipment; the plug-in can be used as a plug-in and integrated in Word, WPS and other software; and can also be called as a dynamic link library or a loading item.

The invention aims at a handwritten formula and a formula of standard editing, uses an image processing algorithm for recognition, and converts the handwritten formula and the formula of standard editing into a text which can be edited, and mainly relates to the following aspects:

acquiring an image: aiming at a handwritten formula or a standard formula in a text, a target formula is obtained by adopting a photographing or screen capturing mode;

and (3) partitioning the formula: aiming at a formula image, dividing the formula image into a plurality of sub-items by using methods such as threshold segmentation, binarization, communication domain extraction and the like;

and (3) semantic recognition: analyzing the position relation of each sub-item and the specific meaning of each symbol, and converting the position relation into a formula text which can be directly edited by using an artificial intelligence method;

and (4) outputting the result: outputting editable formula text, software, dynamic link library or loading items.

The method specifically comprises the following steps:

1. a formula to be identified;

2. a calibrated horizontal formula;

3. a formula after binarization processing;

4. splitting each sub item;

5. a formula identification system;

6. and outputting the editable formula.

Example 2

1. Selecting a formula target to be identified for selection;

1.1 for handwritten formula (paper, liquid crystal panel, computer screen, etc.), shooting target with camera and intercepting target formula as system input

1.2 for the formula in the picture, capturing the formula target after taking a picture by using a camera as the input of the system

1.3 for the formula in PDF document in computer or mobile phone, on the premise of opening the file, intercepting the formula target by using the screenshot software as the input of the system

2 identifying formulas using image processing software

2.1, the intercepted or shot formula image is subjected to position, angle and other transformations, so that the formula is changed into the horizontal direction;

2.2 extracting the formula target by using an image processing algorithm (threshold segmentation, binarization and the like);

2.3 using image processing algorithm (communication domain identification, etc.) to split the whole formula into sub-items, such as numerator, denominator, operation symbol, bracket, etc.; e.g. y ═ x²+z³Splitting to obtain sub-items of 'y' '═ x' '' 2 '' + '' z '' '3', and the like;

2.4 recording the position correspondence between the sub-items, e.g. y ═ x²+z³In 'y' on the left side of '═ 2' on the upper right of 'x', etc.;

2.5 identifying the meaning of each sub-item according to the positional relationship between the sub-items, 'y' is on the left side of ═ and then 'y' is the calculation result, 'x' '2' '+' 'z' '3' is the calculation method, etc.;

and 3, comprehensively outputting the recognition result of the whole formula by using an artificial intelligence method according to the position relation and the meaning of each sub-item, and converting the recognition result into an editable text such as Latex, EdrawMath, Onenote or Word, or converting the recognition result into a dynamic link library or a loading item, wherein the recognition result can also be designed into software for use.

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. a standard formula recognition method based on image processing, is characterized in that:

Step 1. The formula to be identified;

Step 2. The calibrated horizontal formula;

Step 3. The formula after binarization;

Step 4. Each sub-item after splitting;

Step 5. Formula recognition system;

Step 6. Editable formula for output.

2. the standard formula identification method based on image processing according to claim 1 is characterized in that using the software of image processing in step 5 to identify formula:

5.1 Transform the captured or photographed formula image so that the formula becomes horizontal;

5.2 Use the image processing algorithm to extract the formula target;

5.3 Use the image processing algorithm to split the formula as a whole into sub-items;

5.4 Record the positional correspondence between each sub-item. The specific rules are as follows:

Identification symbols and positions include left, right, upper right, upper and lower fractions, and under the root sign;

5.5 According to the positional relationship between each sub-item, identify the meaning of each sub-item, form a logical sorting relationship, and classify it as a method or result, store this type of relationship record in the entire formula, and use the small range after the equal sign. Step-by-step reasoning to a large range of sequences can infer the relationship of each subterm in the entire formula.

3 . The standard formula recognition method based on image processing according to claim 1 , wherein the image processing algorithm used in 5.2 includes threshold segmentation and binarization to extract formula targets. 4 .

4. The standard formula recognition method based on image processing according to claim 1, characterized in that the described 5.3 using image processing algorithm includes the Unicom domain recognition method.

5. the standard formula identification method based on image processing according to claim 1, is characterized in that: in step 5.5, according to the positional relationship of each subitem and the meaning of each subitem, use artificial intelligence method synthetically output the identification result of whole formula , and convert it into editable text, including Latex, EdrawMath, Onenote or Word, or convert it into a dynamic link library or add-in or stand-alone software.

6 . The standard formula identification method based on image processing according to claim 1 , wherein: step 1. The formula to be identified is obtained by using a camera. 7 .

7. the standard formula recognition method based on image processing according to claim 1, is characterized in that: step 1. for formula in the PDF document in computer or mobile phone, can under the premise of opening file, use screenshot software to intercept formula target , as the input to the system.