DE602004006682D1 - Extraktion von Metadaten aus gekennzeichneten Bereichen eines Dokuments - Google Patents

Extraktion von Metadaten aus gekennzeichneten Bereichen eines Dokuments

Info

Publication number
DE602004006682D1
DE602004006682D1 DE602004006682T DE602004006682T DE602004006682D1 DE 602004006682 D1 DE602004006682 D1 DE 602004006682D1 DE 602004006682 T DE602004006682 T DE 602004006682T DE 602004006682 T DE602004006682 T DE 602004006682T DE 602004006682 D1 DE602004006682 D1 DE 602004006682D1
Authority
DE
Germany
Prior art keywords
metadata
image
pixels
document
extraction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
DE602004006682T
Other languages
English (en)
Other versions
DE602004006682T2 (de
Inventor
Jodocus F Jager
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Production Printing Netherlands BV
Original Assignee
Oce Technologies BV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oce Technologies BV filed Critical Oce Technologies BV
Publication of DE602004006682D1 publication Critical patent/DE602004006682D1/de
Application granted granted Critical
Publication of DE602004006682T2 publication Critical patent/DE602004006682T2/de
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/416Extracting the logical structure, e.g. chapters, sections or page numbers; Identifying elements of the document, e.g. authors

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Business, Economics & Management (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Processing Or Creating Images (AREA)
  • User Interface Of Digital Computer (AREA)
  • Character Input (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Editing Of Facsimile Originals (AREA)
DE602004006682T 2003-08-20 2004-08-13 Extraktion von Metadaten aus gekennzeichneten Bereichen eines Dokuments Active DE602004006682T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP03077643 2003-08-20
EP03077643 2003-08-20

Publications (2)

Publication Number Publication Date
DE602004006682D1 true DE602004006682D1 (de) 2007-07-12
DE602004006682T2 DE602004006682T2 (de) 2008-01-31

Family

ID=34178536

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602004006682T Active DE602004006682T2 (de) 2003-08-20 2004-08-13 Extraktion von Metadaten aus gekennzeichneten Bereichen eines Dokuments

Country Status (6)

Country Link
US (1) US7756332B2 (de)
EP (1) EP1510962B1 (de)
JP (2) JP4970714B2 (de)
CN (2) CN100382096C (de)
AT (1) ATE363700T1 (de)
DE (1) DE602004006682T2 (de)

Families Citing this family (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE10342594B4 (de) 2003-09-15 2005-09-15 Océ Document Technologies GmbH Verfahren und System zum Erfassen von Daten aus mehreren maschinell lesbaren Dokumenten
US20060004745A1 (en) * 2004-06-04 2006-01-05 Agfa Corporation Structured reporting report data manager
US7475336B2 (en) * 2004-08-11 2009-01-06 Kabushiki Kaisha Toshiba Document information processing apparatus and document information processing program
JP4536461B2 (ja) * 2004-09-06 2010-09-01 株式会社沖データ 画像処理装置
US8495061B1 (en) 2004-09-29 2013-07-23 Google Inc. Automatic metadata identification
EP1729235A1 (de) * 2005-06-03 2006-12-06 Agfa Corporation Berichtdatenmanager mit strukturierten Berichten
US20060290789A1 (en) * 2005-06-22 2006-12-28 Nokia Corporation File naming with optical character recognition
DE102005032046A1 (de) * 2005-07-08 2007-01-11 Océ Document Technologies GmbH Verfahren, System und Computerprogramm-Produkt zum Übertragen von Daten aus einer Dokumentenanwendung in eine Datenanwendung
US20070035780A1 (en) * 2005-08-02 2007-02-15 Kabushiki Kaisha Toshiba System and method for defining characteristic data of a scanned document
US7765184B2 (en) * 2005-09-22 2010-07-27 Nokia Corporation Metadata triggered notification for content searching
JP4856925B2 (ja) * 2005-10-07 2012-01-18 株式会社リコー 画像処理装置、画像処理方法及び画像処理プログラム
JP2007249429A (ja) * 2006-03-14 2007-09-27 Ricoh Co Ltd 電子メール編集装置、画像形成装置、電子メール編集方法、およびその方法をコンピュータに実行させるプログラム
JP5078413B2 (ja) * 2006-04-17 2012-11-21 株式会社リコー 画像閲覧システム
US10380231B2 (en) * 2006-05-24 2019-08-13 International Business Machines Corporation System and method for dynamic organization of information sets
US8768983B2 (en) * 2006-10-04 2014-07-01 International Business Machines Corporation Dynamic configuration of multiple sources and source types in a business process
US20080162602A1 (en) * 2006-12-28 2008-07-03 Google Inc. Document archiving system
JP4501016B2 (ja) * 2007-03-22 2010-07-14 村田機械株式会社 原稿読取装置
EP2015554B1 (de) 2007-07-13 2012-05-16 Ricoh Company, Ltd. Verfahren zur Erzeugung von Benutzerschnittstellen, Bildgebungsvorrichtung und Computerprogrammprodukt
US8144988B2 (en) * 2007-09-06 2012-03-27 Ricoh Company, Ltd. Document-image-data providing system, document-image-data providing device, information processing device, document-image-data providing method, information processing method, document-image-data providing program, and information processing program
US8194982B2 (en) * 2007-09-18 2012-06-05 Ricoh Company, Ltd. Document-image-data providing system, document-image-data providing device, information processing device, document-image-data providing method, information processing method, document-image-data providing program, and information processing program
US8510312B1 (en) * 2007-09-28 2013-08-13 Google Inc. Automatic metadata identification
US8009316B2 (en) * 2007-10-26 2011-08-30 Ricoh Production Print Solutions LLC Methods and apparatus for efficient sheetside bitmap processing using meta-data information
JP4604100B2 (ja) * 2008-03-21 2010-12-22 シャープ株式会社 画像処理方法、画像処理装置、画像形成装置、プログラムおよび記憶媒体
KR101023309B1 (ko) 2008-03-31 2011-03-18 후지츠 프론테크 가부시키가이샤 문자 인식 장치
JP4909311B2 (ja) 2008-03-31 2012-04-04 富士通フロンテック株式会社 文字認識装置
CN101577832B (zh) * 2008-05-06 2012-03-21 联咏科技股份有限公司 用于加强文字显示效果的图像处理电路及其方法
US20090279127A1 (en) * 2008-05-08 2009-11-12 Infoprint Solutions Company Llc Mechanism for data extraction of variable positioned data
TWI423052B (zh) * 2008-07-04 2014-01-11 Hon Hai Prec Ind Co Ltd 資料庫主動掃描系統及方法
US8682072B2 (en) * 2008-12-30 2014-03-25 Yahoo! Inc. Image segmentation
JP2010252266A (ja) * 2009-04-20 2010-11-04 Olympus Imaging Corp 画像整理装置
JP5340847B2 (ja) * 2009-07-27 2013-11-13 株式会社日立ソリューションズ 文書データ処理装置
US8542198B2 (en) * 2009-08-24 2013-09-24 Xerox Corporation Multi-touch input actual-size display screen for scanned items
KR101164353B1 (ko) * 2009-10-23 2012-07-09 삼성전자주식회사 미디어 콘텐츠 열람 및 관련 기능 실행 방법과 장치
KR20120033718A (ko) * 2010-09-30 2012-04-09 삼성전자주식회사 화상형성장치 및 그 장치에서의 이메일 전송 방법
CN102147684B (zh) * 2010-11-30 2014-04-23 广东威创视讯科技股份有限公司 一种触摸屏屏幕扫描方法及其系统
CN102253746B (zh) 2011-06-23 2017-05-03 中兴通讯股份有限公司 用于具有触控屏的电子设备的信息处理方法及设备
CN102855264B (zh) * 2011-07-01 2015-11-25 富士通株式会社 文档处理方法及其装置
US9292537B1 (en) 2013-02-23 2016-03-22 Bryant Christopher Lee Autocompletion of filename based on text in a file to be saved
JP2014174923A (ja) * 2013-03-12 2014-09-22 Ricoh Co Ltd 文書処理装置、文書処理方法、および文書処理プログラム
JP6163839B2 (ja) 2013-04-09 2017-07-19 富士通株式会社 電子機器および複写制御プログラム
US10325511B2 (en) * 2015-01-30 2019-06-18 Conduent Business Services, Llc Method and system to attribute metadata to preexisting documents
US20170039683A1 (en) * 2015-08-06 2017-02-09 Fuji Xerox Co., Ltd. Image processing apparatus, image processing method, image processing system, and non-transitory computer readable medium
US10810240B2 (en) * 2015-11-06 2020-10-20 RedShred LLC Automatically assessing structured data for decision making
CN107229932B (zh) * 2016-03-25 2021-05-28 阿里巴巴集团控股有限公司 一种图像文本的识别方法和装置
JP6891073B2 (ja) * 2017-08-22 2021-06-18 キヤノン株式会社 スキャン画像にファイル名等を設定するための装置、その制御方法及びプログラム
JP7043929B2 (ja) * 2018-03-29 2022-03-30 株式会社リコー 情報処理システムおよび情報処理方法
US11227153B2 (en) 2019-12-11 2022-01-18 Optum Technology, Inc. Automated systems and methods for identifying fields and regions of interest within a document image
US11210507B2 (en) 2019-12-11 2021-12-28 Optum Technology, Inc. Automated systems and methods for identifying fields and regions of interest within a document image
US11228687B2 (en) * 2020-01-21 2022-01-18 Canon Kabushiki Kaisha Image processing system that computerizes document, control method thereof, and storage medium
JP7434001B2 (ja) 2020-03-13 2024-02-20 キヤノン株式会社 情報処理装置、プログラム、情報処理方法
CN111949230B (zh) * 2020-08-10 2022-07-05 智业软件股份有限公司 一种基于LibreOffice文档的覆盖打印方法、终端设备及存储介质
US11960816B2 (en) 2021-01-15 2024-04-16 RedShred LLC Automatic document generation and segmentation system

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04276885A (ja) * 1991-03-04 1992-10-01 Sumitomo Electric Ind Ltd 文字切出し装置
JPH04276855A (ja) 1991-03-04 1992-10-01 Nippon Telegr & Teleph Corp <Ntt> 文書保管方式
JP3285686B2 (ja) * 1993-06-29 2002-05-27 株式会社リコー 領域分割方法
JPH08166959A (ja) * 1994-12-12 1996-06-25 Canon Inc 画像処理方法
JPH09128479A (ja) * 1995-11-01 1997-05-16 Ricoh Co Ltd 領域分割方法及び領域分割装置
US5761686A (en) * 1996-06-27 1998-06-02 Xerox Corporation Embedding encoded information in an iconic version of a text image
CN1266646C (zh) * 1997-09-19 2006-07-26 王永民 名片管理器及其操作系统
JP3773642B2 (ja) * 1997-12-18 2006-05-10 株式会社東芝 画像処理装置および画像形成装置
AUPP400998A0 (en) * 1998-06-10 1998-07-02 Canon Kabushiki Kaisha Face detection in digital images
US6353823B1 (en) * 1999-03-08 2002-03-05 Intel Corporation Method and system for using associative metadata
JP2001084332A (ja) * 1999-09-10 2001-03-30 Toshiba Corp 読取装置と読取方法
US6360951B1 (en) * 1999-12-16 2002-03-26 Xerox Corporation Hand-held scanning system for heuristically organizing scanned information
FR2806814B1 (fr) * 2000-03-22 2006-02-03 Oce Ind Sa Procede de reconnaissance et d'indexation de documents
NL1015943C2 (nl) 2000-08-16 2002-02-19 Ocu Technologies B V Interpretatie van gekleurde documenten.
KR100411894B1 (ko) * 2000-12-28 2003-12-24 한국전자통신연구원 문서영상 영역해석 방법
US6804684B2 (en) * 2001-05-07 2004-10-12 Eastman Kodak Company Method for associating semantic information with multiple images in an image database environment
EP1256900A1 (de) * 2001-05-09 2002-11-13 Requisite Technology Inc. Datenbankeingabesystem und -methode mit optischer Zeichenerkennung
US7432940B2 (en) * 2001-10-12 2008-10-07 Canon Kabushiki Kaisha Interactive animation of sprites in a video production
US7043474B2 (en) * 2002-04-15 2006-05-09 International Business Machines Corporation System and method for measuring image similarity based on semantic meaning
US7050629B2 (en) * 2002-05-31 2006-05-23 Intel Corporation Methods and systems to index and retrieve pixel data
GB2399245B (en) * 2003-03-03 2005-07-27 Motorola Inc Method for segmenting an image and an image transmission system and image transmission unit therefor
US7236632B2 (en) * 2003-04-11 2007-06-26 Ricoh Company, Ltd. Automated techniques for comparing contents of images

Also Published As

Publication number Publication date
CN1839396A (zh) 2006-09-27
EP1510962A1 (de) 2005-03-02
ATE363700T1 (de) 2007-06-15
US7756332B2 (en) 2010-07-13
US20050041860A1 (en) 2005-02-24
JP4970714B2 (ja) 2012-07-11
JP2005071349A (ja) 2005-03-17
EP1510962B1 (de) 2007-05-30
CN100382096C (zh) 2008-04-16
CN100476859C (zh) 2009-04-08
DE602004006682T2 (de) 2008-01-31
JP2012053911A (ja) 2012-03-15
CN1604120A (zh) 2005-04-06

Similar Documents

Publication Publication Date Title
DE602004006682D1 (de) Extraktion von Metadaten aus gekennzeichneten Bereichen eines Dokuments
ATE356389T1 (de) Dokumentenscanner
CN1251056C (zh) 计算机设备
Arai et al. PaperLink: a technique for hyperlinking from real paper to electronic content
JP6317772B2 (ja) 外国語の文字セットおよびそれらの翻訳を資源に制約のあるモバイル機器上にリアルタイムで表示するためのシステムおよび方法
EP2306270B1 (de) Zeicheneingabeverfahren und -system
WO2014176912A1 (en) Two dimensional-code scanning method and device
JP6010253B2 (ja) 電子機器、方法およびプログラム
WO2009075061A1 (ja) 情報入力装置、情報処理装置、情報入力システム、情報処理システム、2次元書式情報サーバ、情報入力方法、制御プログラム、および記録媒体
TW200603007A (en) Apparatus and method for handwriting recognition
KR20100051648A (ko) 디지털 영상의 영역들을 조작하는 방법
JPH07141101A (ja) 画像を用いた入力システム
AU2003283447A1 (en) Method and user interface for entering characters
EP1416426A3 (de) Vorrichtung, Programm und Verfahren zur handschriftlichen Zeicheneingabe
SE0104041L (sv) Elektronisk penna och metod för registrering av handskriven information
US20020191005A1 (en) Visual cue for on-screen scrolling
US8787670B2 (en) Software for text and image edit recognition for editing of images that contain text
CN101354789A (zh) 一种图像面具特效的实现方法和设备
CN104020853A (zh) 基于Kinect的操纵网络浏览器的系统及方法
CN102053949A (zh) 处理生僻字的方法和装置
KR19990045918A (ko) 영상표시기능을수반한컴퓨터포인터및표시방법
EP1701292A3 (de) Dokumentenlayoutanalyse mit Steuerung des zeichenlosen Bereiches
KR101550419B1 (ko) 웹 이미지 대체 텍스트 생성 장치 및 방법
JP2015114955A (ja) 情報処理装置、情報処理方法、およびプログラム
WO2015107692A1 (ja) 手書きのための電子機器および方法

Legal Events

Date Code Title Description
8364 No opposition during term of opposition