JP5214642B2 - ベクトル系列用モデル基準比較指標及びそれを用いたワードスポッティング - Google Patents

ベクトル系列用モデル基準比較指標及びそれを用いたワードスポッティング Download PDF

Info

Publication number
JP5214642B2
JP5214642B2 JP2010016733A JP2010016733A JP5214642B2 JP 5214642 B2 JP5214642 B2 JP 5214642B2 JP 2010016733 A JP2010016733 A JP 2010016733A JP 2010016733 A JP2010016733 A JP 2010016733A JP 5214642 B2 JP5214642 B2 JP 5214642B2
Authority
JP
Japan
Prior art keywords
hmm
ordered
word
sequence
storage medium
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2010016733A
Other languages
English (en)
Japanese (ja)
Other versions
JP2010176672A (ja
JP2010176672A5 (enExample
Inventor
エー. ロドリゲス セラノ ホセ
ペロンナン フローラン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xerox Corp
Original Assignee
Xerox Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xerox Corp filed Critical Xerox Corp
Publication of JP2010176672A publication Critical patent/JP2010176672A/ja
Publication of JP2010176672A5 publication Critical patent/JP2010176672A5/ja
Application granted granted Critical
Publication of JP5214642B2 publication Critical patent/JP5214642B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/22Character recognition characterised by the type of writing
    • G06V30/226Character recognition characterised by the type of writing of cursive writing
    • G06V30/2268Character recognition characterised by the type of writing of cursive writing using stroke segmentation
    • G06V30/2276Character recognition characterised by the type of writing of cursive writing using stroke segmentation with probabilistic networks, e.g. hidden Markov models
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/29Graphical models, e.g. Bayesian networks
    • G06F18/295Markov models or related models, e.g. semi-Markov models; Markov random fields; Networks embedding Markov models
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/761Proximity, similarity or dissimilarity measures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/84Arrangements for image or video recognition or understanding using pattern recognition or machine learning using probabilistic graphical models from image or video features, e.g. Markov models or Bayesian networks
    • G06V10/85Markov-related models; Markov random fields
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/1444Selective acquisition, locating or processing of specific regions, e.g. highlighted text, fiducial marks or predetermined fields
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Multimedia (AREA)
  • Evolutionary Computation (AREA)
  • Probability & Statistics with Applications (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Computing Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Character Discrimination (AREA)
  • Automatic Analysis And Handling Materials Therefor (AREA)
JP2010016733A 2009-01-28 2010-01-28 ベクトル系列用モデル基準比較指標及びそれを用いたワードスポッティング Expired - Fee Related JP5214642B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/361,178 US8311335B2 (en) 2009-01-28 2009-01-28 Model-based comparative measure for vector sequences and word spotting using same
US12/361,178 2009-01-28

Publications (3)

Publication Number Publication Date
JP2010176672A JP2010176672A (ja) 2010-08-12
JP2010176672A5 JP2010176672A5 (enExample) 2013-03-07
JP5214642B2 true JP5214642B2 (ja) 2013-06-19

Family

ID=42167314

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2010016733A Expired - Fee Related JP5214642B2 (ja) 2009-01-28 2010-01-28 ベクトル系列用モデル基準比較指標及びそれを用いたワードスポッティング

Country Status (3)

Country Link
US (1) US8311335B2 (enExample)
EP (1) EP2214123A3 (enExample)
JP (1) JP5214642B2 (enExample)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5524102B2 (ja) * 2011-02-03 2014-06-18 株式会社東芝 手書き単語認識装置および手書き単語認識用モデル学習装置
US8732664B2 (en) 2011-05-17 2014-05-20 Microsoft Corporation Document serialization and comparison via object model
US9430563B2 (en) * 2012-02-02 2016-08-30 Xerox Corporation Document processing employing probabilistic topic modeling of documents represented as text words transformed to a continuous space
US20130282600A1 (en) * 2012-04-23 2013-10-24 Sap Ag Pattern Based Audit Issue Reporting
US9336302B1 (en) 2012-07-20 2016-05-10 Zuci Realty Llc Insight and algorithmic clustering for automated synthesis
JP5735126B2 (ja) * 2013-04-26 2015-06-17 株式会社東芝 システムおよび筆跡検索方法
CN104298982B (zh) * 2013-07-16 2019-03-08 深圳市腾讯计算机系统有限公司 一种文字识别方法及装置
CN103440652B (zh) * 2013-08-27 2016-03-30 电子科技大学 一种基于一二阶合并的目标检测区域特征描述方法
US9646613B2 (en) * 2013-11-29 2017-05-09 Daon Holdings Limited Methods and systems for splitting a digital signal
KR101552525B1 (ko) 2014-02-04 2015-09-14 한국기술교육대학교 산학협력단 폰트를 인식하고 폰트정보를 제공하는 시스템 및 그 방법
US11205103B2 (en) 2016-12-09 2021-12-21 The Research Foundation for the State University Semisupervised autoencoder for sentiment analysis
WO2019113576A1 (en) * 2017-12-10 2019-06-13 Walmart Apollo, Llc Systems and methods for automated classification of regulatory reports
US11462037B2 (en) 2019-01-11 2022-10-04 Walmart Apollo, Llc System and method for automated analysis of electronic travel data
CN111695354A (zh) * 2020-05-20 2020-09-22 平安科技(深圳)有限公司 基于命名实体的文本问答方法、装置及可读存储介质
CN112345261B (zh) * 2020-10-29 2022-05-03 南京航空航天大学 基于改进dbscan算法的航空发动机泵调系统异常检测方法
CN112906755A (zh) * 2021-01-27 2021-06-04 深圳职业技术学院 一种植物抗性蛋白识别方法、装置、设备和存储介质
US20240394332A1 (en) * 2021-09-21 2024-11-28 British Telecommunications Public Limited Company Efficient vector comparison for event identification

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6009390A (en) * 1997-09-11 1999-12-28 Lucent Technologies Inc. Technique for selective use of Gaussian kernels and mixture component weights of tied-mixture hidden Markov models for speech recognition
JP2004294916A (ja) * 2003-03-27 2004-10-21 Matsushita Electric Ind Co Ltd 標準モデル作成装置および標準モデル作成方法
US8014603B2 (en) * 2007-08-30 2011-09-06 Xerox Corporation System and method for characterizing handwritten or typed words in a document

Also Published As

Publication number Publication date
EP2214123A3 (en) 2011-07-06
EP2214123A2 (en) 2010-08-04
JP2010176672A (ja) 2010-08-12
US20100191532A1 (en) 2010-07-29
US8311335B2 (en) 2012-11-13

Similar Documents

Publication Publication Date Title
JP5214642B2 (ja) ベクトル系列用モデル基準比較指標及びそれを用いたワードスポッティング
Fischer et al. Lexicon-free handwritten word spotting using character HMMs
Graves et al. A novel connectionist system for unconstrained handwriting recognition
AlKhateeb et al. Offline handwritten Arabic cursive text recognition using Hidden Markov Models and re-ranking
Wshah et al. Script independent word spotting in offline handwritten documents based on hidden markov models
Nguyen et al. A database of unconstrained Vietnamese online handwriting and recognition experiments by recurrent neural networks
Natarajan et al. Multi-lingual offline handwriting recognition using hidden Markov models: A script-independent approach
FR2963695A1 (fr) Apprentissage de poids de polices pour des echantillons tapes dans le reperage de mots-cles manuscrits
Dreuw et al. RWTH OCR: A large vocabulary optical character recognition system for Arabic scripts
Saabni et al. Keyword searching for Arabic handwritten documents
Addis et al. Printed ethiopic script recognition by using lstm networks
Rodríguez-Serrano et al. A similarity measure between vector sequences with application to handwritten word image retrieval
US8340428B2 (en) Unsupervised writer style adaptation for handwritten word spotting
Singh et al. Handwritten words recognition for legal amounts of bank cheques in English script
Alma'adeed Recognition of off-line handwritten arabic words using neural network
Liwicki et al. Handwriting recognition of whiteboard notes—studying the influence of training set size and type
Kumar et al. Bayesian background models for keyword spotting in handwritten documents
Pechwitz et al. Handwritten Arabic word recognition using the IFN/ENIT-database
Wilkinson et al. Neural word search in historical manuscript collections
Rodríguez-Serrano et al. Unsupervised writer adaptation of whole-word HMMs with application to word-spotting
Kadi et al. Isolated arabic characters recognition using a robust method against noise and scaling based on the «hough transform»
Majeed et al. Ancient but digitized: developing handwritten optical character recognition for east syriac script through creating KHAMIS dataset
Ajao et al. Hidden markov model approach for offline Yoruba handwritten word recognition
Mezghani et al. Arabic offline writer identification on a new version of AHTID/MW database
Dolfing Whole page recognition of historical handwriting

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20130118

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20130118

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20130118

A975 Report on accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A971005

Effective date: 20130205

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20130219

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20130227

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20160308

Year of fee payment: 3

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees