CN109543501B - 图像处理装置、图像处理方法和存储介质 - Google Patents

图像处理装置、图像处理方法和存储介质 Download PDF

Info

Publication number
CN109543501B
CN109543501B CN201811107931.6A CN201811107931A CN109543501B CN 109543501 B CN109543501 B CN 109543501B CN 201811107931 A CN201811107931 A CN 201811107931A CN 109543501 B CN109543501 B CN 109543501B
Authority
CN
China
Prior art keywords
image
similarity
document
block
document image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811107931.6A
Other languages
English (en)
Chinese (zh)
Other versions
CN109543501A (zh
Inventor
荒川纯也
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Publication of CN109543501A publication Critical patent/CN109543501A/zh
Application granted granted Critical
Publication of CN109543501B publication Critical patent/CN109543501B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/414Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/0035User-machine interface; Control console
    • H04N1/00352Input means
    • H04N1/00355Mark-sheet input
    • H04N1/00376Means for identifying a mark sheet or area
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/174Form filling; Merging
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/0007Image acquisition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/13Edge detection
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/22Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition
    • G06V10/235Image preprocessing by selection of a specific region containing or referencing a pattern; Locating or processing of specific regions to guide the detection or recognition based on user input or interaction
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • G06V10/751Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00002Diagnosis, testing or measuring; Detecting, analysing or monitoring not otherwise provided for
    • H04N1/00005Diagnosis, testing or measuring; Detecting, analysing or monitoring not otherwise provided for relating to image data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00127Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture
    • H04N1/00326Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus
    • H04N1/00328Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus with an apparatus processing optically-read information
    • H04N1/00331Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus with an apparatus processing optically-read information with an apparatus performing optical character recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00912Arrangements for controlling a still picture apparatus or components thereof not otherwise provided for
    • H04N1/00938Software related arrangements, e.g. loading applications
    • H04N1/00949Combining applications, e.g. to create workflows
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/40Picture signal circuits
    • H04N1/40062Discrimination between different image types, e.g. two-tone, continuous tone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/40Picture signal circuits
    • H04N1/407Control or modification of tonal gradation or of extreme levels, e.g. background level
    • H04N1/4072Control or modification of tonal gradation or of extreme levels, e.g. background level dependent on the contents of the original
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/40Picture signal circuits
    • H04N1/407Control or modification of tonal gradation or of extreme levels, e.g. background level
    • H04N1/4072Control or modification of tonal gradation or of extreme levels, e.g. background level dependent on the contents of the original
    • H04N1/4074Control or modification of tonal gradation or of extreme levels, e.g. background level dependent on the contents of the original using histograms
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/0077Types of the still picture apparatus
    • H04N2201/0094Multifunctional device, i.e. a device capable of all of reading, reproducing, copying, facsimile transception, file transception

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Databases & Information Systems (AREA)
  • Library & Information Science (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Business, Economics & Management (AREA)
  • Business, Economics & Management (AREA)
  • Computing Systems (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Biomedical Technology (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Processing Or Creating Images (AREA)
  • Character Input (AREA)
CN201811107931.6A 2017-09-21 2018-09-21 图像处理装置、图像处理方法和存储介质 Active CN109543501B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2017181695A JP7013182B2 (ja) 2017-09-21 2017-09-21 情報処理装置、情報処理方法およびプログラム
JP2017-181695 2017-09-21

Publications (2)

Publication Number Publication Date
CN109543501A CN109543501A (zh) 2019-03-29
CN109543501B true CN109543501B (zh) 2023-07-04

Family

ID=65719329

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811107931.6A Active CN109543501B (zh) 2017-09-21 2018-09-21 图像处理装置、图像处理方法和存储介质

Country Status (4)

Country Link
US (1) US10817559B2 (https=)
JP (1) JP7013182B2 (https=)
KR (1) KR102403964B1 (https=)
CN (1) CN109543501B (https=)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7013182B2 (ja) * 2017-09-21 2022-01-31 キヤノン株式会社 情報処理装置、情報処理方法およびプログラム
JP2021027556A (ja) 2019-08-08 2021-02-22 キヤノン株式会社 情報処理装置、情報処理方法及びプログラム
JP7486954B2 (ja) * 2020-01-08 2024-05-20 Tis株式会社 帳票処理プログラム、帳票処理装置及び帳票処理方法
JP7391672B2 (ja) * 2020-01-21 2023-12-05 キヤノン株式会社 文書を電子化するための画像処理システム、その制御方法及びプログラム
US12223261B2 (en) * 2020-03-12 2025-02-11 Canon Kabushiki Kaisha Image processing apparatus, image processing method, and storage medium
JP7516170B2 (ja) * 2020-03-12 2024-07-16 キヤノン株式会社 画像処理装置、画像処理方法、およびプログラム
KR102284781B1 (ko) * 2020-05-19 2021-08-02 (주)가온아이 문서의 스캔 이미지에 대한 보정이 가능한 전자 장치 및 그 동작 방법
CN112783840B (zh) * 2020-06-08 2024-06-25 北京金山办公软件股份有限公司 一种存储文档的方法、装置、电子设备及存储介质
CN112000834B (zh) * 2020-08-26 2024-08-09 北京百度网讯科技有限公司 文档处理方法、装置、系统、电子设备及存储介质
US11500843B2 (en) * 2020-09-02 2022-11-15 Coupa Software Incorporated Text-based machine learning extraction of table data from a read-only document
CN112052835B (zh) * 2020-09-29 2022-10-11 北京百度网讯科技有限公司 信息处理方法、信息处理装置、电子设备和存储介质
JP2022100071A (ja) 2020-12-23 2022-07-05 キヤノン株式会社 画像処理装置、画像処理システム、その制御方法及びプログラム
JP2022101136A (ja) * 2020-12-24 2022-07-06 キヤノン株式会社 情報処理装置、情報処理方法およびプログラム
CN113569886A (zh) * 2021-01-15 2021-10-29 腾讯科技(深圳)有限公司 网络结构调整方法、装置和存储介质及电子设备
JP2022159774A (ja) 2021-04-05 2022-10-18 キヤノン株式会社 画像処理装置、画像処理システム、その制御方法及びプログラム
CN113095316B (zh) * 2021-04-15 2023-04-07 西安电子科技大学 基于多级融合和角点偏移的图像旋转目标检测方法
JP2022170175A (ja) * 2021-04-28 2022-11-10 キヤノン株式会社 情報処理装置、情報処理方法、及びプログラム
JP7690354B2 (ja) * 2021-08-26 2025-06-10 キヤノン株式会社 画像処理装置、プログラム、画像処理方法
KR102394483B1 (ko) * 2021-09-02 2022-05-04 (주)가온아이 전자 문서에 오류가 있는지 여부를 판단하는 오류 판단 서비스를 제공하기 위한 서비스 제공 서버 및 그 동작 방법

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105426884A (zh) * 2015-11-10 2016-03-23 佛山科学技术学院 一种基于全幅特征提取的快速文档类型识别方法

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000322512A (ja) * 1999-05-13 2000-11-24 Canon Inc 帳票処理装置及び帳票処理方法
JP4140221B2 (ja) * 2001-09-18 2008-08-27 富士ゼロックス株式会社 画像照合装置および画像照合プログラム
JP2004348706A (ja) 2003-04-30 2004-12-09 Canon Inc 情報処理装置及び情報処理方法ならびに記憶媒体、プログラム
JP2004334337A (ja) 2003-04-30 2004-11-25 Canon Inc 画像処理装置
JP4366119B2 (ja) * 2003-05-29 2009-11-18 キヤノン株式会社 文書処理装置
JP4328692B2 (ja) * 2004-08-11 2009-09-09 国立大学法人東京工業大学 物体検出装置
JP2007172077A (ja) 2005-12-19 2007-07-05 Fuji Xerox Co Ltd 画像検索システム及び方法及びプログラム
JP4859025B2 (ja) * 2005-12-16 2012-01-18 株式会社リコー 類似画像検索装置、類似画像検索処理方法、プログラム及び情報記録媒体
US7639893B2 (en) * 2006-05-17 2009-12-29 Xerox Corporation Histogram adjustment for high dynamic range image mapping
JP2008181460A (ja) * 2007-01-26 2008-08-07 Ricoh Co Ltd 文書画像検索装置および文書画像検索方法
JP4420085B2 (ja) * 2007-08-20 2010-02-24 ソニー株式会社 データ処理装置、データ処理方法、プログラムおよび記録媒体
JP5006764B2 (ja) * 2007-11-08 2012-08-22 キヤノン株式会社 画像処理装置、画像処理方法、プログラム、および記憶媒体
JP5111268B2 (ja) 2008-07-09 2013-01-09 キヤノン株式会社 画像処理装置、画像処理方法、そのプログラムおよび記憶媒体
WO2010122721A1 (ja) * 2009-04-22 2010-10-28 日本電気株式会社 照合装置、照合方法および照合プログラム
JP4934701B2 (ja) * 2009-06-30 2012-05-16 株式会社日立製作所 ステレオ画像処理装置およびステレオ画像処理方法
JP4940270B2 (ja) 2009-07-06 2012-05-30 シャープ株式会社 画像形成装置
JP2011141664A (ja) * 2010-01-06 2011-07-21 Canon Inc 文書比較装置、文書比較方法、及びプログラム
US8582890B2 (en) * 2010-10-15 2013-11-12 DigitalOptics Corporation Europe Limited Image sharpening via gradient environment detection
JP6511986B2 (ja) * 2015-06-26 2019-05-15 富士通株式会社 プログラム生成装置、プログラム生成方法および生成プログラム
JP6496025B2 (ja) 2015-07-10 2019-04-03 株式会社日立製作所 文書処理システム及び文書処理方法
US10528542B2 (en) * 2016-08-24 2020-01-07 Google Llc Change direction based map interface updating system
JP7013182B2 (ja) * 2017-09-21 2022-01-31 キヤノン株式会社 情報処理装置、情報処理方法およびプログラム

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105426884A (zh) * 2015-11-10 2016-03-23 佛山科学技术学院 一种基于全幅特征提取的快速文档类型识别方法

Also Published As

Publication number Publication date
KR20190033451A (ko) 2019-03-29
CN109543501A (zh) 2019-03-29
JP7013182B2 (ja) 2022-01-31
US20190087444A1 (en) 2019-03-21
US10817559B2 (en) 2020-10-27
JP2019057173A (ja) 2019-04-11
KR102403964B1 (ko) 2022-06-02

Similar Documents

Publication Publication Date Title
CN109543501B (zh) 图像处理装置、图像处理方法和存储介质
US12294678B2 (en) Image processing apparatus, control method for image processing apparatus, and non-transitory storage medium
US7593961B2 (en) Information processing apparatus for retrieving image data similar to an entered image
JP5059545B2 (ja) 画像処理装置及び画像処理方法
US20040218838A1 (en) Image processing apparatus and method therefor
US12223261B2 (en) Image processing apparatus, image processing method, and storage medium
US20100158375A1 (en) Signal processing apparatus, signal processing method, computer-readable medium and computer data signal
US20170099403A1 (en) Document distribution system, document distribution apparatus, information processing method, and storage medium
JP4533273B2 (ja) 画像処理装置及び画像処理方法、プログラム
US20220350956A1 (en) Information processing apparatus, information processing method, and storage medium
US20130050765A1 (en) Method and apparatus for document authentication using image comparison on a block-by-block basis
JP2018042067A (ja) 画像処理システム、画像処理方法、情報処理装置
JP2019153919A (ja) 画像処理装置、その制御方法、及びプログラム
US12423350B2 (en) Image processing apparatus deriving condition for estimating text block, image processing method, and storage medium
JP3733310B2 (ja) 文書書式識別装置および識別方法
JP2008022159A (ja) 文書処理装置及び文書処理方法
US12475164B2 (en) Drawing search device, drawing database construction device, drawing search system, drawing search method, and recording medium
JP6700705B2 (ja) 振り分けシステム、情報処理方法、及びプログラム
JP7516170B2 (ja) 画像処理装置、画像処理方法、およびプログラム
JP7570843B2 (ja) 画像処理装置、画像形成システム、画像処理方法、およびプログラム
JP2020047138A (ja) 情報処理装置
JP2001034763A (ja) 文書画像処理装置、その文書タイトル抽出方法及び文書タグ情報付与方法
CN110390323B (zh) 信息处理装置以及计算机可读介质
JP2007041709A (ja) 文書処理システム、文書処理システムの制御方法、文書処理装置、並びに、コンピュータプログラム及びコンピュータ可読記憶媒体
US20220309812A1 (en) Information processing apparatus, information processing system, computer-readable non-transitory recording medium storing information processing program, and information processing method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant