DE602007010850D1 - Schnelle Berechnung von Ähnlichkeitsverbindungen zur Bestimmung eines Inhaltsverzeichnisses - Google Patents

Schnelle Berechnung von Ähnlichkeitsverbindungen zur Bestimmung eines Inhaltsverzeichnisses

Info

Publication number
DE602007010850D1
DE602007010850D1 DE602007010850T DE602007010850T DE602007010850D1 DE 602007010850 D1 DE602007010850 D1 DE 602007010850D1 DE 602007010850 T DE602007010850 T DE 602007010850T DE 602007010850 T DE602007010850 T DE 602007010850T DE 602007010850 D1 DE602007010850 D1 DE 602007010850D1
Authority
DE
Germany
Prior art keywords
contents
quick calculation
similarity links
similarity
links
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
DE602007010850T
Other languages
English (en)
Inventor
Jean-Luc Meunier
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xerox Corp
Original Assignee
Xerox Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xerox Corp filed Critical Xerox Corp
Publication of DE602007010850D1 publication Critical patent/DE602007010850D1/de
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/258Heading extraction; Automatic titling; Numbering

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
DE602007010850T 2006-02-23 2007-02-21 Schnelle Berechnung von Ähnlichkeitsverbindungen zur Bestimmung eines Inhaltsverzeichnisses Active DE602007010850D1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US11/360,951 US7890859B2 (en) 2006-02-23 2006-02-23 Rapid similarity links computation for table of contents determination

Publications (1)

Publication Number Publication Date
DE602007010850D1 true DE602007010850D1 (de) 2011-01-13

Family

ID=38230118

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602007010850T Active DE602007010850D1 (de) 2006-02-23 2007-02-21 Schnelle Berechnung von Ähnlichkeitsverbindungen zur Bestimmung eines Inhaltsverzeichnisses

Country Status (4)

Country Link
US (1) US7890859B2 (de)
EP (1) EP1826683B1 (de)
JP (1) JP5037965B2 (de)
DE (1) DE602007010850D1 (de)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9317494B2 (en) * 2007-04-03 2016-04-19 Sap Se Graphical hierarchy conversion
US8504553B2 (en) * 2007-04-19 2013-08-06 Barnesandnoble.Com Llc Unstructured and semistructured document processing and searching
US8065599B1 (en) * 2007-06-29 2011-11-22 Emc Corporation Electronic submission preparation
US7991709B2 (en) * 2008-01-28 2011-08-02 Xerox Corporation Method and apparatus for structuring documents utilizing recognition of an ordered sequence of identifiers
JP5412903B2 (ja) * 2009-03-17 2014-02-12 コニカミノルタ株式会社 文書画像処理装置、文書画像処理方法および文書画像処理プログラム
US8719702B2 (en) * 2010-03-09 2014-05-06 Xerox Corporation Document organizing based on page numbers
US8340425B2 (en) 2010-08-10 2012-12-25 Xerox Corporation Optical character recognition with two-pass zoning
JP5536687B2 (ja) * 2011-01-31 2014-07-02 インターナショナル・ビジネス・マシーンズ・コーポレーション 目次と見出しの対応付け方法、対応付け装置、及び対応付けプログラム
US20130174030A1 (en) * 2012-01-04 2013-07-04 Freedom Solutions Group, LLC, d/b/a Microsystems Method and apparatus for analyzing abbreviations in a document
US8830487B2 (en) 2012-07-09 2014-09-09 Xerox Corporation System and method for separating image and text in a document
US8812870B2 (en) 2012-10-10 2014-08-19 Xerox Corporation Confidentiality preserving document analysis system and method
US20140258851A1 (en) * 2013-03-11 2014-09-11 Microsoft Corporation Table of Contents Detection in a Fixed Format Document
CN103729422A (zh) * 2013-12-23 2014-04-16 武汉传神信息技术有限公司 一种信息碎片关联输出的方法及系统
CN103744883A (zh) * 2013-12-23 2014-04-23 武汉传神信息技术有限公司 一种快速选取信息碎片的方法及系统
CN103744884A (zh) * 2013-12-23 2014-04-23 武汉传神信息技术有限公司 一种整理信息碎片的方法及系统
US9454696B2 (en) * 2014-04-17 2016-09-27 Xerox Corporation Dynamically generating table of contents for printable or scanned content
US20150310128A1 (en) * 2014-04-28 2015-10-29 Elwha Llc Methods, systems, and devices for machines and machine states that manage relation data for modification of documents based on various corpora and/or modification data
US10635743B2 (en) * 2018-03-12 2020-04-28 Microsoft Technology Licensing, Llc Automatic extraction of document page numbers from PDF

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5434962A (en) 1990-09-07 1995-07-18 Fuji Xerox Co., Ltd. Method and system for automatically generating logical structures of electronic documents
US5359729A (en) * 1991-05-31 1994-10-25 Timeline, Inc. Method for searching for a given point in regions defined by attribute ranges, then sorted by lower and upper range values and dimension
US5491628A (en) 1993-12-10 1996-02-13 Xerox Corporation Method and apparatus for document transformation based on attribute grammars and attribute couplings
US6298357B1 (en) 1997-06-03 2001-10-02 Adobe Systems Incorporated Structure extraction on electronic documents
IE980959A1 (en) 1998-03-31 1999-10-20 Datapage Ireland Ltd Document Production
US6487566B1 (en) 1998-10-05 2002-11-26 International Business Machines Corporation Transforming documents using pattern matching and a replacement language
JP2000330979A (ja) * 1999-05-18 2000-11-30 Ntt Data Corp 検索対象となる電子文書の解析方法及び電子文書登録システム
US20040003028A1 (en) * 2002-05-08 2004-01-01 David Emmett Automatic display of web content to smaller display devices: improved summarization and navigation
US20020143818A1 (en) 2001-03-30 2002-10-03 Roberts Elizabeth A. System for generating a structured document
JP2003150586A (ja) 2001-11-12 2003-05-23 Ntt Docomo Inc 文書変換システム、文書変換方法及び文書変換プログラムを記録したコンピュータ読み取り可能な記録媒体
US7137062B2 (en) * 2001-12-28 2006-11-14 International Business Machines Corporation System and method for hierarchical segmentation with latent semantic indexing in scale space
US6907431B2 (en) 2002-05-03 2005-06-14 Hewlett-Packard Development Company, L.P. Method for determining a logical structure of a document
US20040024780A1 (en) 2002-08-01 2004-02-05 Koninklijke Philips Electronics N.V. Method, system and program product for generating a content-based table of contents

Also Published As

Publication number Publication date
EP1826683B1 (de) 2010-12-01
JP5037965B2 (ja) 2012-10-03
EP1826683A2 (de) 2007-08-29
JP2007226797A (ja) 2007-09-06
EP1826683A3 (de) 2008-07-02
US7890859B2 (en) 2011-02-15
US20070198912A1 (en) 2007-08-23

Similar Documents

Publication Publication Date Title
DE602007010850D1 (de) Schnelle Berechnung von Ähnlichkeitsverbindungen zur Bestimmung eines Inhaltsverzeichnisses
BRPI0906031A2 (pt) "aperfeiçoamentos relacionados à manipulação e processamento de elevados números de instruções de processamento em tempo real"
FR2962131B1 (fr) Procede de fonctionnalisation de corps gras d'origine naturelle
BRPI0917093A2 (pt) anotação dos itens de conteúdo de mídia
ATE534294T1 (de) Verwendung von dithiin-tetracarboximiden zum bekämpfen phytopathogener pilze
BR112013007361A2 (pt) papelão ou cartolina
DE602006002287D1 (de) Lernen zur Spracherkennung
BRPI1006392A2 (pt) comutador e método para fazer um computador
DE602006012716D1 (de) Verfahren zur induktion von resistenz gegen schadpilze
BRPI0913705A2 (pt) processo para formarum corpo compósito e compósito
BRPI0917191A2 (pt) aperfeiçoamentos em ou relacionados a composições
BRPI1010803A2 (pt) "resposta imune aumentada na espécie aviária"
DE102009035615B8 (de) Entfernung von Ausbuchtungseffekten bei einer Nanomusterung
DE502007001254D1 (de) Bestimmung von Korrespondenz-Objekt-Paaren zur medizinischen Navigation
IT1399570B1 (it) "impianto di legatura per libri a copertina rigida e metodo dilegatura per tali libri"
BRPI0920872A2 (pt) Mutações relacionadas com heterose induzida.
DE112008000113A5 (de) Anordnung zur Unterdrückung von Eigen-Resonanzen in einer hydraulischen Strecke
DK2716299T3 (da) Peptid bfp 4 til fremme af knogledannelse eller vaskulogenese og anvendelse dertil
BRPI1011480A2 (pt) aparelho compreendendo uma arvore e uma luva de equilibrio
DE112006003845A5 (de) Vorrichtung zum Abtrennen von Raumbereichen eines Raums
DE602006014154D1 (de) Festschnallen/verriegeln eines schenkels zur gewährleistung von eingriff
BRPI0813147A2 (pt) Oleos ricos em fitoquímicos e métodos relacionados a mesmos
DE112009002509A5 (de) Verfahren zur Wegleitung eines Benutzers in einem Gebäude
ES1062345Y (es) Maquina para reponer y quitar plasticos en invernaderos
DE502007005041D1 (de) Vorrichtung zum stapelförmigen ablegen von blättern