EP1239406A3 - Device and method for character recognition and for recognition of mathematical expressions - Google Patents

Device and method for character recognition and for recognition of mathematical expressions Download PDF

Info

Publication number
EP1239406A3
EP1239406A3 EP02004377A EP02004377A EP1239406A3 EP 1239406 A3 EP1239406 A3 EP 1239406A3 EP 02004377 A EP02004377 A EP 02004377A EP 02004377 A EP02004377 A EP 02004377A EP 1239406 A3 EP1239406 A3 EP 1239406A3
Authority
EP
European Patent Office
Prior art keywords
mathematical expression
belonging
text
possibility
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP02004377A
Other languages
German (de)
French (fr)
Other versions
EP1239406B1 (en
EP1239406A2 (en
Inventor
Masakazu Kyushu University Suzuki
Kazuaki Yokota
Yuko A910 Kureare Toshiba Ome Eto
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Publication of EP1239406A2 publication Critical patent/EP1239406A2/en
Publication of EP1239406A3 publication Critical patent/EP1239406A3/en
Application granted granted Critical
Publication of EP1239406B1 publication Critical patent/EP1239406B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/24Character recognition characterised by the processing or recognition method
    • G06V30/242Division of the character sequences into groups prior to recognition; Selection of dictionaries
    • G06V30/244Division of the character sequences into groups prior to recognition; Selection of dictionaries using graphical properties, e.g. alphabet type or font
    • G06V30/2455Discrimination between machine-print, hand-print and cursive writing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Abstract

A mathematical expression recognizing device comprises a character recognition unit (112) configured to recognize characters in a document image containing a text and a mathematical expression, a first dictionary (201) configured to store a pair of evaluation scores for each type of word that can be identified by means of normal expression, the score showing the possibility of belonging to the text and that of belonging to the mathematical expression, an evaluation unit (113) configured to obtain the evaluation scores showing the possibility of belonging to the text and that of belonging to the mathematical expression for each of the words included in the characters recognized by the character recognition unit with reference to the first dictionary, and a mathematical expression detecting unit (114) configured to search for an optimal path connecting words by selecting one of the text and the mathematical expression based on a formative grammar and the evaluation scores showing the possibility of belonging to the text and that of belonging to the mathematical expression for each of the words, thereby detecting characters belonging to the mathematical expression.
EP02004377A 2001-03-07 2002-03-05 Device and method for character recognition and for recognition of mathematical expressions Expired - Lifetime EP1239406B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2001063968 2001-03-07
JP2001063968A JP4181310B2 (en) 2001-03-07 2001-03-07 Formula recognition apparatus and formula recognition method

Publications (3)

Publication Number Publication Date
EP1239406A2 EP1239406A2 (en) 2002-09-11
EP1239406A3 true EP1239406A3 (en) 2005-03-16
EP1239406B1 EP1239406B1 (en) 2007-12-19

Family

ID=18922868

Family Applications (1)

Application Number Title Priority Date Filing Date
EP02004377A Expired - Lifetime EP1239406B1 (en) 2001-03-07 2002-03-05 Device and method for character recognition and for recognition of mathematical expressions

Country Status (4)

Country Link
US (1) US7181068B2 (en)
EP (1) EP1239406B1 (en)
JP (1) JP4181310B2 (en)
DE (1) DE60224128T2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108038441A (en) * 2017-12-07 2018-05-15 庞军良 A kind of System and method for based on image recognition

Families Citing this family (67)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007535740A (en) * 2004-03-23 2007-12-06 アンヘル・パラショス・オルエタ Managing formulas
US7698638B2 (en) * 2004-09-15 2010-04-13 Microsoft Corporation Systems and methods for automated equation buildup
US7561739B2 (en) * 2004-09-22 2009-07-14 Microsoft Corporation Analyzing scripts and determining characters in expression recognition
US7929767B2 (en) * 2004-09-22 2011-04-19 Microsoft Corporation Analyzing subordinate sub-expressions in expression recognition
US7561737B2 (en) * 2004-09-22 2009-07-14 Microsoft Corporation Mathematical expression recognition
US7812986B2 (en) 2005-08-23 2010-10-12 Ricoh Co. Ltd. System and methods for use of voice mail and email in a mixed media environment
US7702673B2 (en) 2004-10-01 2010-04-20 Ricoh Co., Ltd. System and methods for creation and use of a mixed media environment
US9405751B2 (en) 2005-08-23 2016-08-02 Ricoh Co., Ltd. Database for mixed media document system
US9530050B1 (en) 2007-07-11 2016-12-27 Ricoh Co., Ltd. Document annotation sharing
US8156116B2 (en) 2006-07-31 2012-04-10 Ricoh Co., Ltd Dynamic presentation of targeted information in a mixed media reality recognition system
US9373029B2 (en) 2007-07-11 2016-06-21 Ricoh Co., Ltd. Invisible junction feature recognition for document security or annotation
US9384619B2 (en) 2006-07-31 2016-07-05 Ricoh Co., Ltd. Searching media content for objects specified using identifiers
US9171202B2 (en) 2005-08-23 2015-10-27 Ricoh Co., Ltd. Data organization and access for mixed media document system
US8176054B2 (en) * 2007-07-12 2012-05-08 Ricoh Co. Ltd Retrieving electronic documents by converting them to synthetic text
US8965145B2 (en) 2006-07-31 2015-02-24 Ricoh Co., Ltd. Mixed media reality recognition using multiple specialized indexes
US10192279B1 (en) 2007-07-11 2019-01-29 Ricoh Co., Ltd. Indexed document modification sharing with mixed media reality
US8249344B2 (en) * 2005-07-01 2012-08-21 Microsoft Corporation Grammatical parsing of document visual structures
US8020091B2 (en) * 2005-07-15 2011-09-13 Microsoft Corporation Alignment and breaking of mathematical expressions in documents
KR100630200B1 (en) * 2005-08-24 2006-10-02 삼성전자주식회사 Method for operating calculator mode in the portable terminal
JP2007072718A (en) * 2005-09-06 2007-03-22 Univ Of Tokyo Handwritten mathematical expression recognizing device and recognizing method
US20100254606A1 (en) * 2005-12-08 2010-10-07 Abbyy Software Ltd Method of recognizing text information from a vector/raster image
RU2309456C2 (en) * 2005-12-08 2007-10-27 "Аби Софтвер Лтд." Method for recognizing text information in vector-raster image
US8509563B2 (en) 2006-02-02 2013-08-13 Microsoft Corporation Generation of documents from images
US9411896B2 (en) 2006-02-10 2016-08-09 Nokia Technologies Oy Systems and methods for spatial thumbnails and companion maps for media objects
CA2642217C (en) 2006-02-17 2014-05-06 Lumex As Method and system for verification of uncertainly recognized words in an ocr system
US9286404B2 (en) 2006-06-28 2016-03-15 Nokia Technologies Oy Methods of systems using geographic meta-metadata in information retrieval and document displays
US9721157B2 (en) * 2006-08-04 2017-08-01 Nokia Technologies Oy Systems and methods for obtaining and using information from map images
US9020966B2 (en) 2006-07-31 2015-04-28 Ricoh Co., Ltd. Client device for interacting with a mixed media reality recognition system
US9063952B2 (en) 2006-07-31 2015-06-23 Ricoh Co., Ltd. Mixed media reality recognition with image tracking
US8201076B2 (en) 2006-07-31 2012-06-12 Ricoh Co., Ltd. Capturing symbolic information from documents upon printing
US9176984B2 (en) 2006-07-31 2015-11-03 Ricoh Co., Ltd Mixed media reality retrieval of differentially-weighted links
US8489987B2 (en) 2006-07-31 2013-07-16 Ricoh Co., Ltd. Monitoring and analyzing creation and usage of visual content using image and hotspot interaction
US7885456B2 (en) * 2007-03-29 2011-02-08 Microsoft Corporation Symbol graph generation in handwritten mathematical expression recognition
US8116570B2 (en) * 2007-04-19 2012-02-14 Microsoft Corporation User interface for providing digital ink input and correcting recognition errors
US8009915B2 (en) * 2007-04-19 2011-08-30 Microsoft Corporation Recognition of mathematical expressions
US8073258B2 (en) * 2007-08-22 2011-12-06 Microsoft Corporation Using handwriting recognition in computer algebra
US20090245646A1 (en) * 2008-03-28 2009-10-01 Microsoft Corporation Online Handwriting Expression Recognition
US8121412B2 (en) * 2008-06-06 2012-02-21 Microsoft Corporation Recognition of tabular structures
US20100115403A1 (en) * 2008-11-06 2010-05-06 Microsoft Corporation Transforming math text objects using build down and build up
US20100166314A1 (en) * 2008-12-30 2010-07-01 Microsoft Corporation Segment Sequence-Based Handwritten Expression Recognition
JP4775462B2 (en) * 2009-03-12 2011-09-21 カシオ計算機株式会社 Computer and program
JP5471126B2 (en) * 2009-07-31 2014-04-16 カシオ計算機株式会社 Electronic device and program
US8571270B2 (en) 2010-05-10 2013-10-29 Microsoft Corporation Segmentation of a word bitmap into individual characters or glyphs during an OCR process
US8751550B2 (en) 2010-06-09 2014-06-10 Microsoft Corporation Freeform mathematical computations
JP5790070B2 (en) * 2010-08-26 2015-10-07 カシオ計算機株式会社 Display control apparatus and program
CN103250149B (en) * 2010-12-07 2015-11-25 Sk电信有限公司 For extracting semantic distance and according to the method for semantic distance to mathematics statement classification and the device for the method from mathematics statement
JP5267546B2 (en) * 2010-12-22 2013-08-21 カシオ計算機株式会社 Electronic computer and program with handwritten mathematical expression recognition function
US8943113B2 (en) * 2011-07-21 2015-01-27 Xiaohua Yi Methods and systems for parsing and interpretation of mathematical statements
US9058331B2 (en) 2011-07-27 2015-06-16 Ricoh Co., Ltd. Generating a conversation in a social network based on visual search results
US9600587B2 (en) 2011-10-19 2017-03-21 Zalag Corporation Methods and apparatuses for generating search expressions from content, for applying search expressions to content collections, and/or for analyzing corresponding search results
US9208218B2 (en) * 2011-10-19 2015-12-08 Zalag Corporation Methods and apparatuses for generating search expressions from content, for applying search expressions to content collections, and/or for analyzing corresponding search results
US9928225B2 (en) 2012-01-23 2018-03-27 Microsoft Technology Licensing, Llc Formula detection engine
JP5950700B2 (en) * 2012-06-06 2016-07-13 キヤノン株式会社 Image processing apparatus, image processing method, and program
CN103679129A (en) * 2012-09-21 2014-03-26 中兴通讯股份有限公司 Method and device for identifying object in image
JP2014127188A (en) * 2012-12-27 2014-07-07 Toshiba Corp Shaping device and method
US9330070B2 (en) 2013-03-11 2016-05-03 Microsoft Technology Licensing, Llc Detection and reconstruction of east asian layout features in a fixed format document
JP2014203393A (en) * 2013-04-09 2014-10-27 株式会社東芝 Electronic apparatus, handwritten document processing method, and handwritten document processing program
CN103996055B (en) * 2014-06-13 2017-06-09 上海珉智信息科技有限公司 Recognition methods based on grader in image file electronic bits of data identifying system
RU2596600C2 (en) * 2014-09-02 2016-09-10 Общество с ограниченной ответственностью "Аби Девелопмент" Methods and systems for processing images of mathematical expressions
CN107092902B (en) * 2016-02-18 2021-04-06 富士通株式会社 Character string recognition method and system
US10025976B1 (en) * 2016-12-28 2018-07-17 Konica Minolta Laboratory U.S.A., Inc. Data normalization for handwriting recognition
JP2019168935A (en) * 2018-03-23 2019-10-03 カシオ計算機株式会社 Input device, input method and program
US11610502B2 (en) 2018-11-28 2023-03-21 Kyndryl, Inc. Portable computing device for learning mathematical concepts
CN110796137A (en) * 2019-10-10 2020-02-14 中国建设银行股份有限公司 Method and device for identifying image
KR20210061523A (en) * 2019-11-19 2021-05-28 삼성전자주식회사 Electronic device and operating method for converting from handwriting input to text
CN112712075B (en) * 2020-12-30 2023-12-01 科大讯飞股份有限公司 Arithmetic detection method, electronic equipment and storage device
KR102449336B1 (en) * 2021-09-23 2022-09-30 (주)웅진씽크빅 Apparatus and method for recommending learning using optical character recognition

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL109268A (en) * 1994-04-10 1999-01-26 Advanced Recognition Tech Pattern recognition method and system

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
BLOSTEIN D ET AL: "COMPUTING WITH GRAPHS AND GRAPH TRANSFORMATIONS", SOFTWARE PRACTICE & EXPERIENCE, JOHN WILEY & SONS LTD. CHICHESTER, GB, vol. 29, no. 3, March 1999 (1999-03-01), pages 197 - 217, XP000802322, ISSN: 0038-0644 *
CHANG CHAO-HUANG: "Word class discovery for postprocessing chinese handwriting recognition", PROC. 15TH INT. CONF. ON COMPUTATIONAL LINGUISTICS - COLING 1994, 5 August 1994 (1994-08-05), pages 1221 - 1225, XP002314335 *
JÖRG HUNSINGER ET AL.: "A single-stage top-down probabilistic approach towards understanding spoken and handwritten mathematical formulas", PROC. 6TH INT. CONF. ON SPOKEN LANGUAGE PROCESSING - CHINA MILITARY FRIENDSHIP PUBLISHERS, vol. 4, 16 October 2000 (2000-10-16), BEIJING, CHINA, pages 386 - 389, XP002314337 *
SRIHARI R K ET AL: "Incorporating syntactic constraints in recognizing handwritten sentences", IJCAI-93. PROCEEDINGS OF THE THIRTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE MORGAN KAUFMANN PUBLISHERS SAN MATEO, CA, USA, vol. 2, 28 August 1993 (1993-08-28), CHAMBÉRY, FRANCE, pages 1262 - 1267, XP002314334 *
TOUMIT J -Y ET AL: "A hierarchical and recursive model of mathematical expressions for automatic reading of mathematical documents", PROC. OF THE 5TH INT. CONF. ON DOCUMENT ANALYSIS AND RECOGNITION. ICDAR '99 (CAT. NO.PR00318) IEEE COMPUT. SOC LOS ALAMITOS, CA, USA, 20 September 1999 (1999-09-20), BANGALORE, INDIA, pages 119 - 122, XP002314336, ISBN: 0-7695-0318-7 *
TWAAKYONDO H M ET AL: "Structure analysis and recognition of mathematical expressions", DOCUMENT ANALYSIS AND RECOGNITION, 1995., PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON MONTREAL, QUE., CANADA 14-16 AUG. 1995, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, vol. 1, 14 August 1995 (1995-08-14), pages 430 - 437, XP010230970, ISBN: 0-8186-7128-9 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108038441A (en) * 2017-12-07 2018-05-15 庞军良 A kind of System and method for based on image recognition
CN108038441B (en) * 2017-12-07 2021-03-16 潘晓梅 System and method based on image recognition

Also Published As

Publication number Publication date
EP1239406B1 (en) 2007-12-19
US7181068B2 (en) 2007-02-20
DE60224128T2 (en) 2008-12-04
JP2002269499A (en) 2002-09-20
EP1239406A2 (en) 2002-09-11
DE60224128D1 (en) 2008-01-31
US20020126905A1 (en) 2002-09-12
JP4181310B2 (en) 2008-11-12

Similar Documents

Publication Publication Date Title
EP1239406A3 (en) Device and method for character recognition and for recognition of mathematical expressions
CN100403235C (en) Information processing method and information processing device
WO2004042641A3 (en) Post-processing system and method for correcting machine recognized text
ATE389225T1 (en) VOICE RECOGNITION
CN105404621B (en) A kind of method and system that Chinese character is read for blind person
DE60217299D1 (en) HOLISTIC-ANALYTICAL DETECTION OF HAND-WRITTEN TEXT
DE69907513T2 (en) HANDWRITTEN OR SPEECH WORD RECOGNITION WITH NEURONAL NETWORKS
Sanchez et al. ICDAR 2015 competition HTRtS: Handwritten Text Recognition on the tranScriptorium dataset
EP1246075A3 (en) Determining language for character sequence
EP1550939A3 (en) Method for entering text
CN113435186A (en) Chinese text error correction system, method, device and computer readable storage medium
EP1168799A3 (en) Data processing system with vocalisation mechanism
US7424156B2 (en) Recognition method and the same system of ingegrating vocal input and handwriting input
EP1225567A3 (en) Method and apparatus for speech recognition
EP1359515A3 (en) System and method for filtering far east languages
KR100831991B1 (en) Information processing method and information processing device
EP0997839A3 (en) Word recognizing apparatus and method for dynamically generating feature amount of word
WO2022060439A1 (en) Language autodetection from non-character sub-token signals
ES2162405T3 (en) SEARCH PROCEDURE FOR THE CONTENT OF TEXTUAL DOCUMENTS USING ORAL RECOGNITION.
JP2006053866A (en) Detection method of notation variability of katakana character string
EP1096462A3 (en) Language learning
JPS62251986A (en) Misread character correction processor
KR950020298A (en) Hangul Handwriting Online Character Recognition Device and Method
JP2890241B2 (en) Optical character recognition device
JPS6160189A (en) Optical character reader

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20020305

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AX Request for extension of the european patent

Extension state: AL LT LV MK RO SI

17Q First examination report despatched

Effective date: 20050510

AKX Designation fees paid

Designated state(s): DE NL

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

RIC1 Information provided on ipc code assigned before grant

Ipc: G06F 17/27 20060101ALN20070619BHEP

Ipc: G06K 9/68 20060101AFI20070619BHEP

Ipc: G06K 9/72 20060101ALI20070619BHEP

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): DE NL

REF Corresponds to:

Ref document number: 60224128

Country of ref document: DE

Date of ref document: 20080131

Kind code of ref document: P

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20080922

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: NL

Payment date: 20210312

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20210224

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R079

Ref document number: 60224128

Country of ref document: DE

Free format text: PREVIOUS MAIN CLASS: G06K0009680000

Ipc: G06V0030196000

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 60224128

Country of ref document: DE

REG Reference to a national code

Ref country code: NL

Ref legal event code: MK

Effective date: 20220304