JP7317612B2 - 情報処理装置、情報処理方法及びプログラム - Google Patents

情報処理装置、情報処理方法及びプログラム Download PDF

Info

Publication number
JP7317612B2
JP7317612B2 JP2019132818A JP2019132818A JP7317612B2 JP 7317612 B2 JP7317612 B2 JP 7317612B2 JP 2019132818 A JP2019132818 A JP 2019132818A JP 2019132818 A JP2019132818 A JP 2019132818A JP 7317612 B2 JP7317612 B2 JP 7317612B2
Authority
JP
Japan
Prior art keywords
character
character string
word
cpu
ocr
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2019132818A
Other languages
English (en)
Japanese (ja)
Other versions
JP2021018520A5 (https=
JP2021018520A (ja
Inventor
聡史 河原
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Priority to JP2019132818A priority Critical patent/JP7317612B2/ja
Priority to US16/928,447 priority patent/US11972208B2/en
Publication of JP2021018520A publication Critical patent/JP2021018520A/ja
Publication of JP2021018520A5 publication Critical patent/JP2021018520A5/ja
Application granted granted Critical
Publication of JP7317612B2 publication Critical patent/JP7317612B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/26Techniques for post-processing, e.g. correcting the recognition result
    • G06V30/262Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Character Discrimination (AREA)
JP2019132818A 2019-07-18 2019-07-18 情報処理装置、情報処理方法及びプログラム Active JP7317612B2 (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2019132818A JP7317612B2 (ja) 2019-07-18 2019-07-18 情報処理装置、情報処理方法及びプログラム
US16/928,447 US11972208B2 (en) 2019-07-18 2020-07-14 Information processing device and information processing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2019132818A JP7317612B2 (ja) 2019-07-18 2019-07-18 情報処理装置、情報処理方法及びプログラム

Publications (3)

Publication Number Publication Date
JP2021018520A JP2021018520A (ja) 2021-02-15
JP2021018520A5 JP2021018520A5 (https=) 2022-07-26
JP7317612B2 true JP7317612B2 (ja) 2023-07-31

Family

ID=74343184

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2019132818A Active JP7317612B2 (ja) 2019-07-18 2019-07-18 情報処理装置、情報処理方法及びプログラム

Country Status (2)

Country Link
US (1) US11972208B2 (https=)
JP (1) JP7317612B2 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7234495B2 (ja) * 2018-01-25 2023-03-08 富士フイルムビジネスイノベーション株式会社 画像処理装置及びプログラム

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040086179A1 (en) 2002-11-04 2004-05-06 Yue Ma Post-processing system and method for correcting machine recognized text
JP2010102668A (ja) 2008-10-27 2010-05-06 Hitachi Software Eng Co Ltd メタデータ抽出装置およびその方法
JP2014170452A (ja) 2013-03-05 2014-09-18 Fuji Xerox Co Ltd 画像処理装置及びプログラム
JP2015138396A (ja) 2014-01-22 2015-07-30 富士ゼロックス株式会社 画像処理装置及び画像処理プログラム
JP2016201013A (ja) 2015-04-13 2016-12-01 富士ゼロックス株式会社 文字認識装置、文字認識処理システム、およびプログラム
US20170372161A1 (en) 2016-06-24 2017-12-28 Accenture Global Solutions Limited Intelligent automatic license plate recognition for electronic tolling environments

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0589281A (ja) * 1991-09-26 1993-04-09 Fuji Facom Corp 誤読修正・検出方法
JP3919617B2 (ja) 2002-07-09 2007-05-30 キヤノン株式会社 文字認識装置および文字認識方法、プログラムおよび記憶媒体
US9727804B1 (en) * 2005-04-15 2017-08-08 Matrox Electronic Systems, Ltd. Method of correcting strings
US8340425B2 (en) * 2010-08-10 2012-12-25 Xerox Corporation Optical character recognition with two-pass zoning
US9256795B1 (en) * 2013-03-15 2016-02-09 A9.Com, Inc. Text entity recognition
US9305226B1 (en) * 2013-05-13 2016-04-05 Amazon Technologies, Inc. Semantic boosting rules for improving text recognition
EP3286693A1 (en) * 2015-04-20 2018-02-28 3M Innovative Properties Company Dual embedded optical character recognition (ocr) engines
US10769200B1 (en) * 2015-07-01 2020-09-08 A9.Com, Inc. Result re-ranking for object recognition
US10621237B1 (en) * 2016-08-01 2020-04-14 Amazon Technologies, Inc. Contextual overlay for documents
US10963717B1 (en) * 2018-12-21 2021-03-30 Automation Anywhere, Inc. Auto-correction of pattern defined strings

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040086179A1 (en) 2002-11-04 2004-05-06 Yue Ma Post-processing system and method for correcting machine recognized text
JP2010102668A (ja) 2008-10-27 2010-05-06 Hitachi Software Eng Co Ltd メタデータ抽出装置およびその方法
JP2014170452A (ja) 2013-03-05 2014-09-18 Fuji Xerox Co Ltd 画像処理装置及びプログラム
JP2015138396A (ja) 2014-01-22 2015-07-30 富士ゼロックス株式会社 画像処理装置及び画像処理プログラム
JP2016201013A (ja) 2015-04-13 2016-12-01 富士ゼロックス株式会社 文字認識装置、文字認識処理システム、およびプログラム
US20170372161A1 (en) 2016-06-24 2017-12-28 Accenture Global Solutions Limited Intelligent automatic license plate recognition for electronic tolling environments

Also Published As

Publication number Publication date
US11972208B2 (en) 2024-04-30
JP2021018520A (ja) 2021-02-15
US20210019554A1 (en) 2021-01-21

Similar Documents

Publication Publication Date Title
JP4742404B2 (ja) 画像認識装置、画像形成装置、画像認識方法および画像認識プログラムを記憶したコンピュータ読取り可能な記録媒体
US20210064859A1 (en) Image processing system, image processing method, and storage medium
US11475688B2 (en) Information processing apparatus and information processing method for extracting information from document image
US11418658B2 (en) Image processing apparatus, image processing system, image processing method, and storage medium
US11941903B2 (en) Image processing apparatus, image processing method, and non-transitory storage medium
US20210081660A1 (en) Information processing apparatus and non-transitory computer readable medium
US12412409B2 (en) Information processing apparatus, information processing method, and storage medium
US12148234B2 (en) Information processing with iteratively improved estimates of data attributes based on user modifications, and apparatus, method, and storage medium thereof
US20060285748A1 (en) Document processing device
JP7268389B2 (ja) 情報処理装置及びプログラム
CN111444751B (zh) 信息处理装置、储存介质及信息处理方法
CN112528889A (zh) Ocr信息检测修正方法、装置、终端及存储介质
JP7317612B2 (ja) 情報処理装置、情報処理方法及びプログラム
JP2008257543A (ja) 画像処理システム及びプログラム
JP2022116983A (ja) 画像処理装置、画像処理方法及びプログラム
JP7021496B2 (ja) 情報処理装置及びプログラム
JP7430219B2 (ja) 文書情報構造化装置、文書情報構造化方法およびプログラム
JP7705468B2 (ja) 情報処理システム、原稿種識別方法、モデル生成方法及びプログラム
JP7719878B2 (ja) 情報処理システム、項目値抽出方法、モデル生成方法及びプログラム
JP2020184275A (ja) 画像処理装置、画像処理方法、及びプログラム
JP6763173B2 (ja) 文書修正方法、文書修正装置、およびコンピュータプログラム
JP2024003769A (ja) 文字認識システム、コンピュータによる文字の認識方法、および文字検索システム
JP5284342B2 (ja) 文字認識システムおよび文字認識プログラム
JP6682827B2 (ja) 情報処理装置及び情報処理プログラム
JP7664329B2 (ja) 帳票認識システム及び帳票認識方法

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220715

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20220715

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20230418

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230613

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20230620

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20230719

R151 Written notification of patent or utility model registration

Ref document number: 7317612

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R151