JP6615054B2 - 辞書を使わない、マッチングベースの単語画像認識 - Google Patents

辞書を使わない、マッチングベースの単語画像認識 Download PDF

Info

Publication number
JP6615054B2
JP6615054B2 JP2016123798A JP2016123798A JP6615054B2 JP 6615054 B2 JP6615054 B2 JP 6615054B2 JP 2016123798 A JP2016123798 A JP 2016123798A JP 2016123798 A JP2016123798 A JP 2016123798A JP 6615054 B2 JP6615054 B2 JP 6615054B2
Authority
JP
Japan
Prior art keywords
utility
image
character
embedded
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2016123798A
Other languages
English (en)
Japanese (ja)
Other versions
JP2017021792A5 (enExample
JP2017021792A (ja
Inventor
アルバート・ゴード・ソルデヴィラ
ジョン・アルマーザン
Original Assignee
コンデュエント ビジネス サービシーズ エルエルシー
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by コンデュエント ビジネス サービシーズ エルエルシー filed Critical コンデュエント ビジネス サービシーズ エルエルシー
Publication of JP2017021792A publication Critical patent/JP2017021792A/ja
Publication of JP2017021792A5 publication Critical patent/JP2017021792A5/ja
Application granted granted Critical
Publication of JP6615054B2 publication Critical patent/JP6615054B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/196Recognition using electronic means using sequential comparisons of the image signals with a plurality of references
    • G06V30/1983Syntactic or structural pattern recognition, e.g. symbolic string recognition
    • G06V30/1985Syntactic analysis, e.g. using a grammatical approach
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/29Graphical models, e.g. Bayesian networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/191Design or setup of recognition systems or techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06V30/19187Graphical models, e.g. Bayesian networks or Markov models
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/22Character recognition characterised by the type of writing
    • G06V30/224Character recognition characterised by the type of writing of printed characters having additional code marks or containing code marks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/22Character recognition characterised by the type of writing
    • G06V30/226Character recognition characterised by the type of writing of cursive writing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • G06V30/268Lexical context
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Character Discrimination (AREA)
JP2016123798A 2015-07-08 2016-06-22 辞書を使わない、マッチングベースの単語画像認識 Expired - Fee Related JP6615054B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US14/794,479 US9928436B2 (en) 2015-07-08 2015-07-08 Lexicon-free, matching-based word-image recognition
US14/794,479 2015-07-08

Publications (3)

Publication Number Publication Date
JP2017021792A JP2017021792A (ja) 2017-01-26
JP2017021792A5 JP2017021792A5 (enExample) 2019-08-08
JP6615054B2 true JP6615054B2 (ja) 2019-12-04

Family

ID=56321774

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2016123798A Expired - Fee Related JP6615054B2 (ja) 2015-07-08 2016-06-22 辞書を使わない、マッチングベースの単語画像認識

Country Status (3)

Country Link
US (1) US9928436B2 (enExample)
EP (1) EP3144848A1 (enExample)
JP (1) JP6615054B2 (enExample)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6798055B1 (ja) * 2020-03-24 2020-12-09 株式会社東芝 情報処理装置、情報処理方法、プログラムおよび順序情報
US11252113B1 (en) * 2021-06-15 2022-02-15 Drift.com, Inc. Proactive and reactive directing of conversational bot-human interactions

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ATE75552T1 (de) 1987-10-16 1992-05-15 Computer Ges Konstanz Verfahren zur automatischen zeichenerkennung.
US5774588A (en) 1995-06-07 1998-06-30 United Parcel Service Of America, Inc. Method and system for comparing strings with entries of a lexicon
US5933525A (en) 1996-04-10 1999-08-03 Bbn Corporation Language-independent and segmentation-free optical character recognition system and method
US6823084B2 (en) 2000-09-22 2004-11-23 Sri International Method and apparatus for portably recognizing text in an image sequence of scene imagery
ES2295130T3 (es) 2001-01-18 2008-04-16 Federal Express Corporation Lectura y descodificacion de informacion en paquetes.
US7917286B2 (en) 2005-12-16 2011-03-29 Google Inc. Database assisted OCR for street scenes and other images
US8503797B2 (en) 2007-09-05 2013-08-06 The Neat Company, Inc. Automatic document classification using lexical and physical features
JP5252596B2 (ja) * 2010-11-02 2013-07-31 国立大学法人東京農工大学 文字認識装置、文字認識方法及びプログラム
US8472727B2 (en) 2011-01-07 2013-06-25 Yuval Gronau Document comparison and analysis for improved OCR
US9008429B2 (en) * 2013-02-01 2015-04-14 Xerox Corporation Label-embedding for text recognition
US9384423B2 (en) * 2013-05-28 2016-07-05 Xerox Corporation System and method for OCR output verification
JP6301647B2 (ja) * 2013-12-24 2018-03-28 株式会社東芝 探索装置、探索方法およびプログラム

Also Published As

Publication number Publication date
US20170011273A1 (en) 2017-01-12
US9928436B2 (en) 2018-03-27
EP3144848A1 (en) 2017-03-22
JP2017021792A (ja) 2017-01-26

Similar Documents

Publication Publication Date Title
US11830233B2 (en) Systems and methods for stamp detection and classification
CN109344830B (zh) 语句输出、模型训练方法、装置、计算机设备及存储介质
US8699789B2 (en) Document classification using multiple views
US20180373955A1 (en) Leveraging captions to learn a global visual representation for semantic retrieval
US9454696B2 (en) Dynamically generating table of contents for printable or scanned content
WO2022035942A1 (en) Systems and methods for machine learning-based document classification
CN111797886A (zh) 通过解析pdl文件为神经网络生成ocr用训练数据
US9582483B2 (en) Automatically tagging variable data documents
CN110546603A (zh) 机器学习命令交互
CN112307749B (zh) 文本检错方法、装置、计算机设备和存储介质
Kumar et al. Distortion, rotation and scale invariant recognition of hollow Hindi characters
JP6615054B2 (ja) 辞書を使わない、マッチングベースの単語画像認識
US20060285748A1 (en) Document processing device
CN115545036A (zh) 文档中的阅读顺序检测
US11379534B2 (en) Document feature repository management
JP7322468B2 (ja) 情報処理装置、情報処理方法及びプログラム
US12425524B2 (en) Generating file of distinct writer based on handwriting text
US9665786B2 (en) Confirming automatically recognized handwritten answers
US20220269898A1 (en) Information processing device, information processing system, information processing method, and non-transitory computer readable medium
US20230098086A1 (en) Storing form field data
US20200250841A1 (en) Information processing device and non-transitory computer readable medium
CN118410877B (zh) 一种答案确定方法、装置、电子设备及存储介质
Yuadi et al. Evaluation for Optical Character Recognition of Mobile Application
Nancy Deborah et al. Efficient Information Retrieval: AWS Textract in Action
US20240112348A1 (en) Edge identification of documents within captured image

Legal Events

Date Code Title Description
RD03 Notification of appointment of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7423

Effective date: 20160629

RD04 Notification of resignation of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7424

Effective date: 20160926

A711 Notification of change in applicant

Free format text: JAPANESE INTERMEDIATE CODE: A711

Effective date: 20181010

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20181120

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20181214

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20190624

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20190624

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20190624

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20191007

A975 Report on accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A971005

Effective date: 20191008

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20191023

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20191105

R150 Certificate of patent or registration of utility model

Ref document number: 6615054

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

LAPS Cancellation because of no payment of annual fees