CN111191647A - Standard formula identification method based on image processing - Google Patents

Standard formula identification method based on image processing Download PDF

Info

Publication number
CN111191647A
CN111191647A CN201911365677.4A CN201911365677A CN111191647A CN 111191647 A CN111191647 A CN 111191647A CN 201911365677 A CN201911365677 A CN 201911365677A CN 111191647 A CN111191647 A CN 111191647A
Authority
CN
China
Prior art keywords
formula
image processing
sub
item
relation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201911365677.4A
Other languages
Chinese (zh)
Inventor
周培培
侯幸林
蔡纪鹤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Changzhou Institute of Technology
Original Assignee
Changzhou Institute of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Changzhou Institute of Technology filed Critical Changzhou Institute of Technology
Priority to CN201911365677.4A priority Critical patent/CN111191647A/en
Publication of CN111191647A publication Critical patent/CN111191647A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • G06V20/635Overlay text, e.g. embedded captions in a TV program
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Input (AREA)
  • Character Discrimination (AREA)

Abstract

The invention discloses a standard formula identification method based on image processing, which comprises the following steps: step 1, a formula to be identified; step 2, a horizontal formula after calibration; step 3, a formula after binarization processing is carried out; step 4, splitting each sub item; step 5, identifying a system by a formula; and 6, outputting the editable formula. The invention uses artificial intelligence method to output the identification result of the whole formula, and converts it into editable text, such as Latex, EdrawMath, Onenote or Word, or converts it into dynamic link library or loading item, or can be designed into software.

Description

Standard formula identification method based on image processing
Technical Field
The invention relates to the field of image processing, in particular to a standard formula identification method based on image processing.
Background
Formula editing is an extremely important part in text editing, however, large formula editing is complex and often occupies a large amount of time for text editors. For formulas in some texts (PDFs and pictures), editors need to copy all the texts, but the texts are often edited again in work, and a large amount of time is occupied, so that the design of the formula identification method has important significance.
Disclosure of Invention
1. Objects of the invention
The invention mainly takes pictures or captures screens aiming at formulas in pictures and PDF documents, and then converts the formulas into editable formula texts by an image processing method, and has high recognition rate and high recognition speed.
2. The technical scheme adopted by the invention
The invention discloses a standard formula identification method based on image processing, which comprises the following steps:
step 1, a formula to be identified;
step 2, a horizontal formula after calibration;
step 3, a formula after binarization processing is carried out;
step 4, splitting each sub item;
step 5, identifying a system by a formula;
and 6, outputting the editable formula.
And 5, identifying the formula by using image processing software:
5.1 transforming the intercepted or shot formula image to change the formula into a horizontal direction;
5.2 extracting a formula target by using an image processing algorithm;
5.3 using image processing algorithm to split the whole formula into sub-items;
5.4 recording the position corresponding relation among the sub items, wherein the specific rule is as follows:
the identification symbols and positions comprise left side, right side, upper right corner, upper and lower part type and root;
and 5.5, identifying the meaning of each sub-item according to the position relation among the sub-items, forming a logical ordering relation, classifying the relation into a method or a result, recording and storing the relation in the whole formula, and gradually reasoning from a small range to a large range after equal sign to infer the relation of each sub-item in the whole formula.
Further, said 5.2 uses image processing algorithm including threshold segmentation, binarization to extract formula object.
Furthermore, the 5.3 uses an image processing algorithm including a connected domain identification method.
Furthermore, in step 5.5, according to the position relation of each sub-item and the meaning of each sub-item, an artificial intelligence method is used for comprehensively outputting the recognition result of the whole formula and converting the recognition result into editable text comprising Latex, EdrawMath, Onenote or Word, or converting the recognition result into a dynamic link library or a loading item or independent software.
Further, step 1, the formula to be recognized is acquired by a camera.
Further, step 1, for the formula in the PDF document in the computer or the mobile phone, the formula target can be intercepted by using screenshot software as the input of the system on the premise of opening the file.
3. Advantageous effects adopted by the present invention
The invention uses artificial intelligence method to output the identification result of the whole formula, and converts it into editable text, such as Latex, EdrawMath, Onenote or Word, or converts it into dynamic link library or loading item, or can be designed into software.
Drawings
FIG. 1 is a flow chart of the present invention;
Detailed Description
The technical solutions in the examples of the present invention are clearly and completely described below with reference to the drawings in the examples of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without inventive step, are within the scope of the present invention.
The present invention will be described in further detail with reference to the accompanying drawings.
Example 1
The formula of the invention can be a handwritten formula (including pen writing, liquid crystal panel writing or screen handwriting and the like), and can also be a standard formula (Latex, EdrawMath, One note or Word and the like) edited by a formula editor;
the screen capture can be performed on a PDF document, a Word document or a picture opened on a computer, or can be performed on a PDF document, a Word document or a picture opened on mobile equipment;
the invention can be used as software on a computer or mobile equipment; the plug-in can be used as a plug-in and integrated in Word, WPS and other software; and can also be called as a dynamic link library or a loading item.
The invention aims at a handwritten formula and a formula of standard editing, uses an image processing algorithm for recognition, and converts the handwritten formula and the formula of standard editing into a text which can be edited, and mainly relates to the following aspects:
acquiring an image: aiming at a handwritten formula or a standard formula in a text, a target formula is obtained by adopting a photographing or screen capturing mode;
and (3) partitioning the formula: aiming at a formula image, dividing the formula image into a plurality of sub-items by using methods such as threshold segmentation, binarization, communication domain extraction and the like;
and (3) semantic recognition: analyzing the position relation of each sub-item and the specific meaning of each symbol, and converting the position relation into a formula text which can be directly edited by using an artificial intelligence method;
and (4) outputting the result: outputting editable formula text, software, dynamic link library or loading items.
The method specifically comprises the following steps:
1. a formula to be identified;
2. a calibrated horizontal formula;
3. a formula after binarization processing;
4. splitting each sub item;
5. a formula identification system;
6. and outputting the editable formula.
Example 2
1. Selecting a formula target to be identified for selection;
1.1 for handwritten formula (paper, liquid crystal panel, computer screen, etc.), shooting target with camera and intercepting target formula as system input
1.2 for the formula in the picture, capturing the formula target after taking a picture by using a camera as the input of the system
1.3 for the formula in PDF document in computer or mobile phone, on the premise of opening the file, intercepting the formula target by using the screenshot software as the input of the system
2 identifying formulas using image processing software
2.1, the intercepted or shot formula image is subjected to position, angle and other transformations, so that the formula is changed into the horizontal direction;
2.2 extracting the formula target by using an image processing algorithm (threshold segmentation, binarization and the like);
2.3 using image processing algorithm (communication domain identification, etc.) to split the whole formula into sub-items, such as numerator, denominator, operation symbol, bracket, etc.; e.g. y ═ x2+z3Splitting to obtain sub-items of 'y' '═ x' '' 2 '' + '' z '' '3', and the like;
2.4 recording the position correspondence between the sub-items, e.g. y ═ x2+z3In 'y' on the left side of '═ 2' on the upper right of 'x', etc.;
2.5 identifying the meaning of each sub-item according to the positional relationship between the sub-items, 'y' is on the left side of ═ and then 'y' is the calculation result, 'x' '2' '+' 'z' '3' is the calculation method, etc.;
and 3, comprehensively outputting the recognition result of the whole formula by using an artificial intelligence method according to the position relation and the meaning of each sub-item, and converting the recognition result into an editable text such as Latex, EdrawMath, Onenote or Word, or converting the recognition result into a dynamic link library or a loading item, wherein the recognition result can also be designed into software for use.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (7)

1. A standard formula identification method based on image processing is characterized in that:
step 1, a formula to be identified;
step 2, a horizontal formula after calibration;
step 3, a formula after binarization processing is carried out;
step 4, splitting each sub item;
step 5, identifying a system by a formula;
and 6, outputting the editable formula.
2. The method of claim 1, wherein the image processing software is used to identify the formula in step 5:
5.1 transforming the intercepted or shot formula image to change the formula into a horizontal direction;
5.2 extracting a formula target by using an image processing algorithm;
5.3 using image processing algorithm to split the whole formula into sub-items;
5.4 recording the position corresponding relation among the sub items, wherein the specific rule is as follows:
the identification symbols and positions comprise left side, right side, upper right corner, upper and lower part type and root;
and 5.5, identifying the meaning of each sub-item according to the position relation among the sub-items, forming a logical ordering relation, classifying the relation into a method or a result, recording and storing the relation in the whole formula, and gradually reasoning from a small range to a large range after equal sign to infer the relation of each sub-item in the whole formula.
3. A standard formula recognition method based on image processing as claimed in claim 1, characterized in that 5.2 uses image processing algorithm including threshold segmentation, binarization to extract formula object.
4. The image processing-based standard formula recognition method of claim 1, wherein said 5.3 uses image processing algorithms including Unicom Domain recognition.
5. The image processing-based standard formula recognition method of claim 1, wherein: and 5.5, comprehensively outputting the identification result of the whole formula by using an artificial intelligent method according to the position relation and the meaning of each sub-item, and converting the identification result into editable texts including Latex, EdrawMath, Onenote or Word, or converting the identification result into a dynamic link library or a loading item or independent software.
6. The image processing-based standard formula recognition method of claim 1, wherein: step 1, a formula to be identified is acquired by a camera.
7. The image processing-based standard formula recognition method of claim 1, wherein: step 1, for a formula in a PDF document in a computer or a mobile phone, on the premise of opening a file, intercepting a formula target by using screenshot software as input of a system.
CN201911365677.4A 2019-12-26 2019-12-26 Standard formula identification method based on image processing Withdrawn CN111191647A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911365677.4A CN111191647A (en) 2019-12-26 2019-12-26 Standard formula identification method based on image processing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911365677.4A CN111191647A (en) 2019-12-26 2019-12-26 Standard formula identification method based on image processing

Publications (1)

Publication Number Publication Date
CN111191647A true CN111191647A (en) 2020-05-22

Family

ID=70709401

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911365677.4A Withdrawn CN111191647A (en) 2019-12-26 2019-12-26 Standard formula identification method based on image processing

Country Status (1)

Country Link
CN (1) CN111191647A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113326675A (en) * 2021-08-04 2021-08-31 江西风向标教育科技有限公司 Formula processing method and system for education resource library
CN114610405A (en) * 2022-03-03 2022-06-10 深圳盛显科技有限公司 Multi-application screen capture and network code output method, device, medium and product

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113326675A (en) * 2021-08-04 2021-08-31 江西风向标教育科技有限公司 Formula processing method and system for education resource library
CN114610405A (en) * 2022-03-03 2022-06-10 深圳盛显科技有限公司 Multi-application screen capture and network code output method, device, medium and product
CN114610405B (en) * 2022-03-03 2024-03-29 深圳盛显科技有限公司 Multi-application screen capturing and network code output method, equipment, medium and product

Similar Documents

Publication Publication Date Title
CA3027038C (en) Document field detection and parsing
AU2017302250B2 (en) Optical character recognition in structured documents
JP5050075B2 (en) Image discrimination method
JP5775225B2 (en) Text detection using multi-layer connected components with histograms
CN110689037A (en) Method and system for automatic object annotation using deep networks
CN107885430B (en) Audio playing method and device, storage medium and electronic equipment
US10769427B1 (en) Detection and definition of virtual objects in remote screens
US11625871B2 (en) System and method for capturing and interpreting images into triple diagrams
US20180253852A1 (en) Method and device for locating image edge in natural background
JP7389824B2 (en) Object identification method and device, electronic equipment and storage medium
CN113221918B (en) Target detection method, training method and device of target detection model
CN111191647A (en) Standard formula identification method based on image processing
Nachamai Alphabet recognition of american sign language: a hand gesture recognition approach using sift algorithm
Karanje et al. Survey on text detection, segmentation and recognition from a natural scene images
KR20220005243A (en) Sharing and recognition method and device of handwritten scanned document
Yu et al. On-chip real-time feature extraction using semantic annotations for object recognition
Manikandan et al. Text reader for visually impaired people: any reader
Sukanya et al. A Tool for Extracting Text from Scanned Documents and Convert it into Editable Format
Revathi et al. Optical Character Recognition for Handwritten Telugu Text
JP7315639B2 (en) Paper data digitization method and device, electronic device, storage medium
Sikka et al. Camera–based equation solver for android devices
US20150142784A1 (en) Retrieval device and method and computer program product
Darahan et al. Real-Time Page Extraction for Document Digitization
Joyakin iMask—An Artificial Intelligence Based Redaction Engine
Ridwan et al. Breaking News Recognition Using OCR

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20200522