JP6938318B2 - 情報処理装置、情報処理方法及びプログラム - Google Patents

情報処理装置、情報処理方法及びプログラム Download PDF

Info

Publication number
JP6938318B2
JP6938318B2 JP2017193520A JP2017193520A JP6938318B2 JP 6938318 B2 JP6938318 B2 JP 6938318B2 JP 2017193520 A JP2017193520 A JP 2017193520A JP 2017193520 A JP2017193520 A JP 2017193520A JP 6938318 B2 JP6938318 B2 JP 6938318B2
Authority
JP
Japan
Prior art keywords
character string
ocr
character
electronic document
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2017193520A
Other languages
English (en)
Japanese (ja)
Other versions
JP2019067235A5 (enExample
JP2019067235A (ja
Inventor
忠則 中塚
忠則 中塚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Priority to JP2017193520A priority Critical patent/JP6938318B2/ja
Priority to US16/139,987 priority patent/US10970580B2/en
Publication of JP2019067235A publication Critical patent/JP2019067235A/ja
Publication of JP2019067235A5 publication Critical patent/JP2019067235A5/ja
Application granted granted Critical
Publication of JP6938318B2 publication Critical patent/JP6938318B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/1444Selective acquisition, locating or processing of specific regions, e.g. highlighted text, fiducial marks or predetermined fields
    • G06V30/1456Selective acquisition, locating or processing of specific regions, e.g. highlighted text, fiducial marks or predetermined fields based on user interactions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/60Editing figures and text; Combining figures or text
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00127Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture
    • H04N1/00326Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus
    • H04N1/00328Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus with an apparatus processing optically-read information
    • H04N1/00331Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus with an apparatus processing optically-read information with an apparatus performing optical character recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/32Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
    • H04N1/32101Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
    • H04N1/32106Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title separate from the image data, e.g. in a different computer file
    • H04N1/32112Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title separate from the image data, e.g. in a different computer file in a separate computer file, document page or paper sheet, e.g. a fax cover sheet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/387Composing, repositioning or otherwise geometrically modifying originals
    • H04N1/3871Composing, repositioning or otherwise geometrically modifying originals the composed originals being of different kinds, e.g. low- and high-resolution originals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/40Picture signal circuits
    • H04N1/40062Discrimination between different image types, e.g. two-tone, continuous tone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/44Secrecy systems
    • H04N1/4406Restricting access, e.g. according to user identity
    • H04N1/444Restricting access, e.g. according to user identity to a particular document or image or part thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/44Secrecy systems
    • H04N1/448Rendering the image unintelligible, e.g. scrambling
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)
  • Character Input (AREA)
  • Character Discrimination (AREA)
JP2017193520A 2017-10-03 2017-10-03 情報処理装置、情報処理方法及びプログラム Expired - Fee Related JP6938318B2 (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2017193520A JP6938318B2 (ja) 2017-10-03 2017-10-03 情報処理装置、情報処理方法及びプログラム
US16/139,987 US10970580B2 (en) 2017-10-03 2018-09-24 Information processing apparatus, information processing method, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2017193520A JP6938318B2 (ja) 2017-10-03 2017-10-03 情報処理装置、情報処理方法及びプログラム

Publications (3)

Publication Number Publication Date
JP2019067235A JP2019067235A (ja) 2019-04-25
JP2019067235A5 JP2019067235A5 (enExample) 2020-11-19
JP6938318B2 true JP6938318B2 (ja) 2021-09-22

Family

ID=65897274

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2017193520A Expired - Fee Related JP6938318B2 (ja) 2017-10-03 2017-10-03 情報処理装置、情報処理方法及びプログラム

Country Status (2)

Country Link
US (1) US10970580B2 (enExample)
JP (1) JP6938318B2 (enExample)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6797610B2 (ja) * 2016-08-31 2020-12-09 キヤノン株式会社 装置、方法、及びプログラム
JP7225017B2 (ja) * 2019-04-19 2023-02-20 キヤノン株式会社 タッチパネルを用いた文字入力のための画像処理装置、その制御方法及びプログラム
US20220383650A1 (en) 2021-06-01 2022-12-01 Digital Legal Medical Records, LLC d/b/a Advita, LLC Methods and System of Electronic Image Analysis
CN113239156B (zh) * 2021-06-04 2022-05-17 杭州网易智企科技有限公司 文本处理方法、装置、计算设备以及介质
US11966641B1 (en) * 2023-02-13 2024-04-23 Xerox Corporation User-defined boundaries for documents

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0472313B1 (en) 1990-08-03 1998-11-11 Canon Kabushiki Kaisha Image processing method and apparatus therefor
US5703962A (en) 1991-08-29 1997-12-30 Canon Kabushiki Kaisha Image processing method and apparatus
US6504540B1 (en) 1995-06-19 2003-01-07 Canon Kabushiki Kaisha Method and apparatus for altering one or more attributes of one or more blocks of image data in a document
US7698647B2 (en) * 2006-01-30 2010-04-13 Fast-Cat, Llc Portable dataport device and method for retrieving, inter-relating, annotating and managing electronic documents at a point of need
US8494280B2 (en) * 2006-04-27 2013-07-23 Xerox Corporation Automated method for extracting highlighted regions in scanned source
JP2009251655A (ja) 2008-04-01 2009-10-29 Nec Corp フィルタリング装置、フィルタリング方法、プログラムおよび記録媒体
JP4783441B2 (ja) * 2009-02-09 2011-09-28 シャープ株式会社 画像処理装置及びスキャナ装置
US8879846B2 (en) * 2009-02-10 2014-11-04 Kofax, Inc. Systems, methods and computer program products for processing financial documents
JP5197464B2 (ja) * 2009-03-27 2013-05-15 シャープ株式会社 画像処理方法、画像処理装置、画像形成装置、コンピュータプログラム及び記録媒体
JP5482223B2 (ja) * 2010-01-22 2014-05-07 株式会社リコー 情報処理装置、情報処理方法
JP5789120B2 (ja) 2011-04-20 2015-10-07 株式会社沖データ 画像読み取り装置
US8787670B2 (en) * 2011-08-15 2014-07-22 Victor John Cowley Software for text and image edit recognition for editing of images that contain text
JP2015118612A (ja) * 2013-12-19 2015-06-25 キヤノン株式会社 墨消し装置、墨消し方法
US10320807B2 (en) * 2014-02-25 2019-06-11 Sal Khan Systems and methods relating to the authenticity and verification of photographic identity documents
US10607381B2 (en) 2014-07-07 2020-03-31 Canon Kabushiki Kaisha Information processing apparatus

Also Published As

Publication number Publication date
US10970580B2 (en) 2021-04-06
JP2019067235A (ja) 2019-04-25
US20190102645A1 (en) 2019-04-04

Similar Documents

Publication Publication Date Title
JP6938318B2 (ja) 情報処理装置、情報処理方法及びプログラム
US8456654B2 (en) Process for electronic document redaction
US8155444B2 (en) Image text to character information conversion
US20070041668A1 (en) Search apparatus and search method
US20040234169A1 (en) Image processing apparatus, control method therefor, and program
JPH11203491A (ja) 画像処理装置及び方法
JP4920928B2 (ja) 画像処理装置及びその制御方法、プログラム
US11182343B2 (en) File management device and file management method and non-transitory computer readable medium
US9798724B2 (en) Document discovery strategy to find original electronic file from hardcopy version
RU2571379C2 (ru) Интеллектуальная обработка электронного документа
JP6262708B2 (ja) 深い検索性を有するオブジェクト化及びハードコピーからオリジナルの電子ファイルを検出するドキュメント検出方法
US20140211229A1 (en) Image processing apparatus, an image processing method, and an image processing program
US20090300001A1 (en) Server apparatus, catalog processing method, and computer-readable storage medium
US12143550B2 (en) Information processing apparatus, information processing method, and storage medium
US10803308B2 (en) Apparatus for deciding whether to include text in searchable data, and method and storage medium thereof
KR20220005243A (ko) 수기로 작성된 스캔본 전자파일의 인식과 공유 방법 및 그 장치
JP5569367B2 (ja) 画像処理装置、画像処理方法及びプログラム
JP2004334340A (ja) 画像処理方法及び装置
JP2006134041A (ja) データ管理装置
JP2022019445A (ja) 画像処理装置、方法、プログラム
JPH0793348A (ja) 画像情報処理装置
CN116226885B (zh) 一种复印机保密检查取证系统及方法
JP4489828B1 (ja) 情報処理装置、情報処理方法、およびプログラム
JP5121591B2 (ja) 画像処理装置、画像処理装置における画像処理方法、プログラムおよびプログラムを記憶したコンピュータ可読記憶媒体
JP2002132755A (ja) 文書処理システム

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20201005

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20201005

TRDD Decision of grant or rejection written
A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20210730

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20210803

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20210901

R151 Written notification of patent or utility model registration

Ref document number: 6938318

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R151

LAPS Cancellation because of no payment of annual fees