EP2682881A3 - Document Processing Apparatus, Image Processing Apparatus, Document Processing Method, and Medium - Google Patents

Document Processing Apparatus, Image Processing Apparatus, Document Processing Method, and Medium Download PDF

Info

Publication number
EP2682881A3
EP2682881A3 EP13172935.2A EP13172935A EP2682881A3 EP 2682881 A3 EP2682881 A3 EP 2682881A3 EP 13172935 A EP13172935 A EP 13172935A EP 2682881 A3 EP2682881 A3 EP 2682881A3
Authority
EP
European Patent Office
Prior art keywords
processing apparatus
document
document processing
title
medium
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP13172935.2A
Other languages
German (de)
French (fr)
Other versions
EP2682881A2 (en
Inventor
Yoshihisa Ohguro
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ricoh Co Ltd
Original Assignee
Ricoh Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ricoh Co Ltd filed Critical Ricoh Co Ltd
Publication of EP2682881A2 publication Critical patent/EP2682881A2/en
Publication of EP2682881A3 publication Critical patent/EP2682881A3/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/62Text, e.g. of license plates, overlay texts or captions on TV images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • General Business, Economics & Management (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Character Discrimination (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Character Input (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)

Abstract

In a document processing apparatus (1), an OCR unit (13) extracts character information from document image data scanned by a document scanner (12), a title generator (14) extracts a predefined number of strings that indicate the characteristic of the document image data as a title string from the character information extracted by the OCR unit (13), and a document name generator (15) generates a string suitable for a predefined output condition as the document name from the title strings extracted by the title generator (14).
EP13172935.2A 2012-07-05 2013-06-20 Document Processing Apparatus, Image Processing Apparatus, Document Processing Method, and Medium Withdrawn EP2682881A3 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2012151256A JP2014013534A (en) 2012-07-05 2012-07-05 Document processor, image processor, image processing method and document processing program

Publications (2)

Publication Number Publication Date
EP2682881A2 EP2682881A2 (en) 2014-01-08
EP2682881A3 true EP2682881A3 (en) 2016-10-26

Family

ID=48793869

Family Applications (1)

Application Number Title Priority Date Filing Date
EP13172935.2A Withdrawn EP2682881A3 (en) 2012-07-05 2013-06-20 Document Processing Apparatus, Image Processing Apparatus, Document Processing Method, and Medium

Country Status (3)

Country Link
US (1) US20140013220A1 (en)
EP (1) EP2682881A3 (en)
JP (1) JP2014013534A (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6051827B2 (en) * 2012-12-07 2016-12-27 株式会社リコー Document processing apparatus, image processing apparatus, document processing method, and document processing program
US9400833B2 (en) * 2013-11-15 2016-07-26 Citrix Systems, Inc. Generating electronic summaries of online meetings
US9342561B2 (en) * 2014-01-08 2016-05-17 International Business Machines Corporation Creating and using titles in untitled documents to answer questions
CN103870939B (en) * 2014-04-01 2017-08-29 北京中电普华信息技术有限公司 A kind of object oriented generation method and system
JP6470071B2 (en) * 2015-03-06 2019-02-13 シャープ株式会社 Image processing device
US9542136B2 (en) 2015-03-19 2017-01-10 Ricoh Company, Ltd. Communication control system, communication control apparatus, and communication control method
EP3507722A4 (en) 2016-09-02 2020-03-18 FutureVault Inc. Automated document filing and processing methods and systems
US10289963B2 (en) * 2017-02-27 2019-05-14 International Business Machines Corporation Unified text analytics annotator development life cycle combining rule-based and machine learning based techniques
JP6690596B2 (en) * 2017-04-28 2020-04-28 京セラドキュメントソリューションズ株式会社 Information processing equipment
JP7129357B2 (en) * 2019-02-18 2022-09-01 株式会社東芝 METHOD FOR VERTICAL INVERSION OF CASING HALF, ROTATING SHAFT BRACKET AND INVERSION BASE USED FOR THE METHOD
JP2024010503A (en) * 2022-07-12 2024-01-24 京セラドキュメントソリューションズ株式会社 Image read-out device and image formation device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003016076A (en) * 2001-06-28 2003-01-17 Ricoh Co Ltd Method for title extraction from document image
JP2007122403A (en) * 2005-10-28 2007-05-17 Fuji Xerox Co Ltd Device, method, and program for automatically extracting document title and relevant information
JP2010113735A (en) * 2010-01-21 2010-05-20 Omron Corp Data name determination device
JP2011155548A (en) * 2010-01-28 2011-08-11 Kyocera Mita Corp Device, program and method for creation of file

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3070436B2 (en) * 1995-04-17 2000-07-31 ブラザー工業株式会社 Facsimile machine
JP3425834B2 (en) * 1995-09-06 2003-07-14 富士通株式会社 Title extraction apparatus and method from document image
WO1999042936A1 (en) * 1998-02-24 1999-08-26 Gateway 2000, Inc. Software management system
US7099507B2 (en) * 1998-11-05 2006-08-29 Ricoh Company, Ltd Method and system for extracting title from document image
US20020078069A1 (en) * 2000-12-15 2002-06-20 International Business Machines Corporation Automatic file name/attribute generator for object oriented desktop shells
US20020143804A1 (en) * 2001-04-02 2002-10-03 Dowdy Jacklyn M. Electronic filer
JP2004070523A (en) * 2002-08-02 2004-03-04 Canon Inc Information processor and its' method
GB0327694D0 (en) * 2003-11-28 2003-12-31 Ibm A system for distributed communications
JP2005202714A (en) * 2004-01-16 2005-07-28 Giken Shoji International Co Ltd Document retrieval system
JP4134056B2 (en) * 2005-01-27 2008-08-13 京セラミタ株式会社 Image reading apparatus and image reading program
JP4964080B2 (en) * 2007-01-17 2012-06-27 株式会社東芝 Image processing system, image processing method, and image processing program
US8189920B2 (en) * 2007-01-17 2012-05-29 Kabushiki Kaisha Toshiba Image processing system, image processing method, and image processing program
WO2008120030A1 (en) * 2007-04-02 2008-10-09 Sobha Renaissance Information Latent metonymical analysis and indexing [lmai]
JP2009027648A (en) * 2007-07-23 2009-02-05 Murata Mach Ltd Image processing device
JP5256099B2 (en) * 2009-03-31 2013-08-07 株式会社日立ソリューションズ Recognition parameter tuning method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003016076A (en) * 2001-06-28 2003-01-17 Ricoh Co Ltd Method for title extraction from document image
JP2007122403A (en) * 2005-10-28 2007-05-17 Fuji Xerox Co Ltd Device, method, and program for automatically extracting document title and relevant information
JP2010113735A (en) * 2010-01-21 2010-05-20 Omron Corp Data name determination device
JP2011155548A (en) * 2010-01-28 2011-08-11 Kyocera Mita Corp Device, program and method for creation of file

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ANONYMOUS: "Convert double-byte numbers and spaces in filenames to ASCII - Stack Overflow", 26 September 2010 (2010-09-26), XP055303123, Retrieved from the Internet <URL:http://web.archive.org/web/20100926143040/http://stackoverflow.com/questions/1992581/convert-double-byte-numbers-and-spaces-in-filenames-to-ascii> [retrieved on 20160915] *

Also Published As

Publication number Publication date
EP2682881A2 (en) 2014-01-08
US20140013220A1 (en) 2014-01-09
JP2014013534A (en) 2014-01-23

Similar Documents

Publication Publication Date Title
EP2682881A3 (en) Document Processing Apparatus, Image Processing Apparatus, Document Processing Method, and Medium
EP2343670A3 (en) Apparatus and method for digitizing documents
EP2584800A3 (en) Digital system and method of processing service data thereof
EP2565825A3 (en) Schedule managing method and apparatus using optical character reader
EP2746989A3 (en) Document processing device, image processing apparatus, document processing method and computer program product
EP2306301A3 (en) Image processing system, image processing method and image processing program
EP1603059A3 (en) Paper-based document upload and tracking system
EP2573711A3 (en) Traffic sign detecting method and traffic sign detecing device
EP2230593A3 (en) Job management apparatus, control method, and program
EP2202992A3 (en) Image processing method and apparatus therefor
EP2704061A3 (en) Apparatus and method for recognizing a character in terminal equipment
EP2364011A3 (en) Fine-grained visual document fingerprinting for accurate document comparison and retrieval
EP2426622A3 (en) Finding low variance regions in document images for generating image anchor templates for content anchoring, data extraction, and document classification
EP2019553A3 (en) Image transmitting apparatus, image transmitting method, receiving apparatus, and image transmitting system
EP2506154A3 (en) Text, character encoding and language recognition
EP2814233A3 (en) Image forming apparatus and image forming method
EP2418606A3 (en) Apparatus and method for recognizing objects using filter information
EP2264995A3 (en) Image processing apparatus, image processing method, and computer program
EP2731054A3 (en) Method and device for recognizing document image, and photographing method using the same
EP2587477A3 (en) Document reading-out support apparatus and method
EP2407898A3 (en) Information processing apparatus, processing method of the same, and non-transitory computer-readable storage
EP2919115A3 (en) Task migration method and apparatus
EP1901233A3 (en) Techniques for image segment accumulation in document rendering
EP2772870A3 (en) Image forming apparatus and image forming method
EP1981272A3 (en) Image capturing apparatus, image processing apparatus and control methods thereof

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20130620

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

RIC1 Information provided on ipc code assigned before grant

Ipc: G06K 9/32 20060101ALI20160920BHEP

Ipc: G06K 9/00 20060101ALI20160920BHEP

Ipc: G06F 17/30 20060101AFI20160920BHEP

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20170427