CN111191647A - Standard formula identification method based on image processing - Google Patents
Standard formula identification method based on image processing Download PDFInfo
- Publication number
- CN111191647A CN111191647A CN201911365677.4A CN201911365677A CN111191647A CN 111191647 A CN111191647 A CN 111191647A CN 201911365677 A CN201911365677 A CN 201911365677A CN 111191647 A CN111191647 A CN 111191647A
- Authority
- CN
- China
- Prior art keywords
- formula
- image processing
- method based
- sub
- standard
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims abstract description 25
- 238000013473 artificial intelligence Methods 0.000 claims abstract description 6
- 239000004816 latex Substances 0.000 claims abstract description 6
- 229920000126 latex Polymers 0.000 claims abstract description 6
- 230000011218 segmentation Effects 0.000 claims description 4
- 238000004364 calculation method Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 230000000694 effects Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/62—Text, e.g. of license plates, overlay texts or captions on TV images
- G06V20/635—Overlay text, e.g. embedded captions in a TV program
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Character Discrimination (AREA)
- Character Input (AREA)
Abstract
The invention discloses a standard formula identification method based on image processing, which comprises the following steps: step 1, a formula to be identified; step 2, a horizontal formula after calibration; step 3, a formula after binarization processing is carried out; step 4, splitting each sub item; step 5, identifying a system by a formula; and 6, outputting the editable formula. The invention uses artificial intelligence method to output the identification result of the whole formula, and converts it into editable text, such as Latex, EdrawMath, Onenote or Word, or converts it into dynamic link library or loading item, or can be designed into software.
Description
Technical Field
The invention relates to the field of image processing, in particular to a standard formula identification method based on image processing.
Background
Formula editing is an extremely important part in text editing, however, large formula editing is complex and often occupies a large amount of time for text editors. For formulas in some texts (PDFs and pictures), editors need to copy all the texts, but the texts are often edited again in work, and a large amount of time is occupied, so that the design of the formula identification method has important significance.
Disclosure of Invention
1. Objects of the invention
The invention mainly takes pictures or captures screens aiming at formulas in pictures and PDF documents, and then converts the formulas into editable formula texts by an image processing method, and has high recognition rate and high recognition speed.
2. The technical scheme adopted by the invention
The invention discloses a standard formula identification method based on image processing, which comprises the following steps:
step 3, a formula after binarization processing is carried out;
step 4, splitting each sub item;
step 5, identifying a system by a formula;
and 6, outputting the editable formula.
And 5, identifying the formula by using image processing software:
5.1 transforming the intercepted or shot formula image to change the formula into a horizontal direction;
5.2 extracting a formula target by using an image processing algorithm;
5.3 using image processing algorithm to split the whole formula into sub-items;
5.4 recording the position corresponding relation among the sub items, wherein the specific rule is as follows:
the identification symbols and positions comprise left side, right side, upper right corner, upper and lower part type and root;
and 5.5, identifying the meaning of each sub-item according to the position relation among the sub-items, forming a logical ordering relation, classifying the relation into a method or a result, recording and storing the relation in the whole formula, and gradually reasoning from a small range to a large range after equal sign to infer the relation of each sub-item in the whole formula.
Further, said 5.2 uses image processing algorithm including threshold segmentation, binarization to extract formula object.
Furthermore, the 5.3 uses an image processing algorithm including a connected domain identification method.
Furthermore, in step 5.5, according to the position relation of each sub-item and the meaning of each sub-item, an artificial intelligence method is used for comprehensively outputting the recognition result of the whole formula and converting the recognition result into editable text comprising Latex, EdrawMath, Onenote or Word, or converting the recognition result into a dynamic link library or a loading item or independent software.
Further, step 1, the formula to be recognized is acquired by a camera.
Further, step 1, for the formula in the PDF document in the computer or the mobile phone, the formula target can be intercepted by using screenshot software as the input of the system on the premise of opening the file.
3. Advantageous effects adopted by the present invention
The invention uses artificial intelligence method to output the identification result of the whole formula, and converts it into editable text, such as Latex, EdrawMath, Onenote or Word, or converts it into dynamic link library or loading item, or can be designed into software.
Drawings
FIG. 1 is a flow chart of the present invention;
Detailed Description
The technical solutions in the examples of the present invention are clearly and completely described below with reference to the drawings in the examples of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without inventive step, are within the scope of the present invention.
The present invention will be described in further detail with reference to the accompanying drawings.
Example 1
The formula of the invention can be a handwritten formula (including pen writing, liquid crystal panel writing or screen handwriting and the like), and can also be a standard formula (Latex, EdrawMath, One note or Word and the like) edited by a formula editor;
the screen capture can be performed on a PDF document, a Word document or a picture opened on a computer, or can be performed on a PDF document, a Word document or a picture opened on mobile equipment;
the invention can be used as software on a computer or mobile equipment; the plug-in can be used as a plug-in and integrated in Word, WPS and other software; and can also be called as a dynamic link library or a loading item.
The invention aims at a handwritten formula and a formula of standard editing, uses an image processing algorithm for recognition, and converts the handwritten formula and the formula of standard editing into a text which can be edited, and mainly relates to the following aspects:
acquiring an image: aiming at a handwritten formula or a standard formula in a text, a target formula is obtained by adopting a photographing or screen capturing mode;
and (3) partitioning the formula: aiming at a formula image, dividing the formula image into a plurality of sub-items by using methods such as threshold segmentation, binarization, communication domain extraction and the like;
and (3) semantic recognition: analyzing the position relation of each sub-item and the specific meaning of each symbol, and converting the position relation into a formula text which can be directly edited by using an artificial intelligence method;
and (4) outputting the result: outputting editable formula text, software, dynamic link library or loading items.
The method specifically comprises the following steps:
1. a formula to be identified;
2. a calibrated horizontal formula;
3. a formula after binarization processing;
4. splitting each sub item;
5. a formula identification system;
6. and outputting the editable formula.
Example 2
1. Selecting a formula target to be identified for selection;
1.1 for handwritten formula (paper, liquid crystal panel, computer screen, etc.), shooting target with camera and intercepting target formula as system input
1.2 for the formula in the picture, capturing the formula target after taking a picture by using a camera as the input of the system
1.3 for the formula in PDF document in computer or mobile phone, on the premise of opening the file, intercepting the formula target by using the screenshot software as the input of the system
2 identifying formulas using image processing software
2.1, the intercepted or shot formula image is subjected to position, angle and other transformations, so that the formula is changed into the horizontal direction;
2.2 extracting the formula target by using an image processing algorithm (threshold segmentation, binarization and the like);
2.3 using image processing algorithm (communication domain identification, etc.) to split the whole formula into sub-items, such as numerator, denominator, operation symbol, bracket, etc.; e.g. y ═ x2+z3Splitting to obtain sub-items of 'y' '═ x' '' 2 '' + '' z '' '3', and the like;
2.4 recording the position correspondence between the sub-items, e.g. y ═ x2+z3In 'y' on the left side of '═ 2' on the upper right of 'x', etc.;
2.5 identifying the meaning of each sub-item according to the positional relationship between the sub-items, 'y' is on the left side of ═ and then 'y' is the calculation result, 'x' '2' '+' 'z' '3' is the calculation method, etc.;
and 3, comprehensively outputting the recognition result of the whole formula by using an artificial intelligence method according to the position relation and the meaning of each sub-item, and converting the recognition result into an editable text such as Latex, EdrawMath, Onenote or Word, or converting the recognition result into a dynamic link library or a loading item, wherein the recognition result can also be designed into software for use.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911365677.4A CN111191647A (en) | 2019-12-26 | 2019-12-26 | Standard formula identification method based on image processing |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911365677.4A CN111191647A (en) | 2019-12-26 | 2019-12-26 | Standard formula identification method based on image processing |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111191647A true CN111191647A (en) | 2020-05-22 |
Family
ID=70709401
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911365677.4A Withdrawn CN111191647A (en) | 2019-12-26 | 2019-12-26 | Standard formula identification method based on image processing |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111191647A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113326675A (en) * | 2021-08-04 | 2021-08-31 | 江西风向标教育科技有限公司 | Formula processing method and system for education resource library |
CN114610405A (en) * | 2022-03-03 | 2022-06-10 | 深圳盛显科技有限公司 | Multi-application screen capture and network code output method, device, medium and product |
-
2019
- 2019-12-26 CN CN201911365677.4A patent/CN111191647A/en not_active Withdrawn
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113326675A (en) * | 2021-08-04 | 2021-08-31 | 江西风向标教育科技有限公司 | Formula processing method and system for education resource library |
CN114610405A (en) * | 2022-03-03 | 2022-06-10 | 深圳盛显科技有限公司 | Multi-application screen capture and network code output method, device, medium and product |
CN114610405B (en) * | 2022-03-03 | 2024-03-29 | 深圳盛显科技有限公司 | Multi-application screen capturing and network code output method, equipment, medium and product |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110689037B (en) | Method and system for automatic object annotation using deep networks | |
CN110135411B (en) | Business card recognition method and device | |
CA3027038C (en) | Document field detection and parsing | |
JP5050075B2 (en) | Image discrimination method | |
CN103946866B (en) | The text detection that multilayer is connected component is used together with histogram | |
US20170039193A1 (en) | Language generation from flow diagrams | |
US9940511B2 (en) | Machine print, hand print, and signature discrimination | |
CN108229481B (en) | Screen content analysis method, device, computing device and storage medium | |
JP7389824B2 (en) | Object identification method and device, electronic equipment and storage medium | |
JP2024501642A (en) | Detecting annotated regions of interest in images | |
CN111191647A (en) | Standard formula identification method based on image processing | |
Marne et al. | Identification of optimal optical character recognition (OCR) engine for proposed system | |
CN113065559A (en) | Image comparison method and device, electronic equipment and storage medium | |
Nachamai | Alphabet recognition of american sign language: a hand gesture recognition approach using sift algorithm | |
CN113628181A (en) | Image processing method, image processing device, electronic equipment and storage medium | |
Dave et al. | Ocr text detector and audio convertor | |
Revathi et al. | Optical character recognition for handwritten Telugu Text | |
WO2024169656A1 (en) | Self-monitoring-based video stream feature generation method, device, and storage medium | |
Dey et al. | A comparative study of margin noise removal algorithms on marnr: A margin noise dataset of document images | |
Gladwin et al. | Read Textual features in Images and convert to Editable form by extended use of Artificial Neural Networks, Deep learning and Maximally Stable Extremal Region techniques | |
CN118447522B (en) | A PDF file component extraction device, method, electronic device and readable storage medium | |
Darahan et al. | Real-Time Page Extraction for Document Digitization | |
US20150142784A1 (en) | Retrieval device and method and computer program product | |
Joyakin | iMask—An Artificial Intelligence Based Redaction Engine | |
Lakshmi et al. | Modeling of Patterns with Spectral Data and Time-Varying PSO to Identify Concealed Character Strokes of Historical Manuscripts |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20200522 |
|
WW01 | Invention patent application withdrawn after publication |