WO2000049517A2 - Multi-document summarization system and method - Google Patents
Multi-document summarization system and method Download PDFInfo
- Publication number
- WO2000049517A2 WO2000049517A2 PCT/US2000/004118 US0004118W WO0049517A2 WO 2000049517 A2 WO2000049517 A2 WO 2000049517A2 US 0004118 W US0004118 W US 0004118W WO 0049517 A2 WO0049517 A2 WO 0049517A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- phrases
- nodes
- phrase
- temporal
- documents
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/34—Browsing; Visualisation therefor
- G06F16/345—Summarisation for human users
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/194—Calculation of difference between files
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
- G06F40/35—Discourse or dialogue representation
Definitions
- a present method for generating a summary of related documents in a collection includes extracting phrases from the documents which have common focus elements. Phrase intersection analysis is performed on the extracted phrases to generate a phrase intersection table. Temporal processing can be performed on the phrases in the phrase intersection table to remove ambiguous temporal references and to sort the phrases in a temporal sequence. Sentence generation is performed using the phrases in the phrase intersection table to generate the multi document summary.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Abstract
Description
Claims
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP00919318A EP1190343A4 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
IL14495100A IL144951A0 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
AU40026/00A AU775978B2 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
CA2363017A CA2363017C (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
US09/913,745 US7366711B1 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
IL144951A IL144951A (en) | 1999-02-19 | 2001-08-16 | Multi-document summarization system and method |
HK02106992.3A HK1045391A1 (en) | 1999-02-19 | 2002-09-25 | Multi-document summarization system and method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12065999P | 1999-02-19 | 1999-02-19 | |
US60/120,659 | 1999-02-19 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2000049517A2 true WO2000049517A2 (en) | 2000-08-24 |
WO2000049517A3 WO2000049517A3 (en) | 2000-11-30 |
Family
ID=22391735
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2000/004118 WO2000049517A2 (en) | 1999-02-19 | 2000-02-18 | Multi-document summarization system and method |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP1190343A4 (en) |
AU (1) | AU775978B2 (en) |
CA (1) | CA2363017C (en) |
HK (1) | HK1045391A1 (en) |
IL (2) | IL144951A0 (en) |
WO (1) | WO2000049517A2 (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002035376A2 (en) * | 2000-10-27 | 2002-05-02 | Science Applications International Corporation | Ontology-based parser for natural language processing |
WO2008155225A1 (en) * | 2007-06-20 | 2008-12-24 | Amadeus S.A.S. | System and method for integrating and displaying travel advices gathered from a plurality of reliable sources |
US7496561B2 (en) | 2001-01-18 | 2009-02-24 | Science Applications International Corporation | Method and system of ranking and clustering for document indexing and retrieval |
US11374888B2 (en) | 2015-09-25 | 2022-06-28 | Microsoft Technology Licensing, Llc | User-defined notification templates |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4965763A (en) * | 1987-03-03 | 1990-10-23 | International Business Machines Corporation | Computer method for automatic extraction of commonly specified information from business correspondence |
US5077668A (en) * | 1988-09-30 | 1991-12-31 | Kabushiki Kaisha Toshiba | Method and apparatus for producing an abstract of a document |
US5297027A (en) * | 1990-05-11 | 1994-03-22 | Hitachi, Ltd. | Method of and apparatus for promoting the understanding of a text by using an abstract of that text |
US5384703A (en) * | 1993-07-02 | 1995-01-24 | Xerox Corporation | Method and apparatus for summarizing documents according to theme |
US5638543A (en) * | 1993-06-03 | 1997-06-10 | Xerox Corporation | Method and apparatus for automatic document summarization |
US5689716A (en) * | 1995-04-14 | 1997-11-18 | Xerox Corporation | Automatic method of generating thematic summaries |
US5778397A (en) * | 1995-06-28 | 1998-07-07 | Xerox Corporation | Automatic method of generating feature probabilities for automatic extracting summarization |
US5838323A (en) * | 1995-09-29 | 1998-11-17 | Apple Computer, Inc. | Document summary computer system user interface |
US5848191A (en) * | 1995-12-14 | 1998-12-08 | Xerox Corporation | Automatic method of generating thematic summaries from a document image without performing character recognition |
US5924108A (en) * | 1996-03-29 | 1999-07-13 | Microsoft Corporation | Document summarizer for word processors |
-
2000
- 2000-02-18 CA CA2363017A patent/CA2363017C/en not_active Expired - Fee Related
- 2000-02-18 IL IL14495100A patent/IL144951A0/en active IP Right Grant
- 2000-02-18 WO PCT/US2000/004118 patent/WO2000049517A2/en active Application Filing
- 2000-02-18 AU AU40026/00A patent/AU775978B2/en not_active Ceased
- 2000-02-18 EP EP00919318A patent/EP1190343A4/en not_active Ceased
-
2001
- 2001-08-16 IL IL144951A patent/IL144951A/en not_active IP Right Cessation
-
2002
- 2002-09-25 HK HK02106992.3A patent/HK1045391A1/en unknown
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4965763A (en) * | 1987-03-03 | 1990-10-23 | International Business Machines Corporation | Computer method for automatic extraction of commonly specified information from business correspondence |
US5077668A (en) * | 1988-09-30 | 1991-12-31 | Kabushiki Kaisha Toshiba | Method and apparatus for producing an abstract of a document |
US5297027A (en) * | 1990-05-11 | 1994-03-22 | Hitachi, Ltd. | Method of and apparatus for promoting the understanding of a text by using an abstract of that text |
US5638543A (en) * | 1993-06-03 | 1997-06-10 | Xerox Corporation | Method and apparatus for automatic document summarization |
US5384703A (en) * | 1993-07-02 | 1995-01-24 | Xerox Corporation | Method and apparatus for summarizing documents according to theme |
US5689716A (en) * | 1995-04-14 | 1997-11-18 | Xerox Corporation | Automatic method of generating thematic summaries |
US5778397A (en) * | 1995-06-28 | 1998-07-07 | Xerox Corporation | Automatic method of generating feature probabilities for automatic extracting summarization |
US5838323A (en) * | 1995-09-29 | 1998-11-17 | Apple Computer, Inc. | Document summary computer system user interface |
US5848191A (en) * | 1995-12-14 | 1998-12-08 | Xerox Corporation | Automatic method of generating thematic summaries from a document image without performing character recognition |
US5924108A (en) * | 1996-03-29 | 1999-07-13 | Microsoft Corporation | Document summarizer for word processors |
Non-Patent Citations (1)
Title |
---|
See also references of EP1190343A2 * |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002035376A2 (en) * | 2000-10-27 | 2002-05-02 | Science Applications International Corporation | Ontology-based parser for natural language processing |
WO2002035376A3 (en) * | 2000-10-27 | 2003-08-28 | Science Applic Int Corp | Ontology-based parser for natural language processing |
US7027974B1 (en) | 2000-10-27 | 2006-04-11 | Science Applications International Corporation | Ontology-based parser for natural language processing |
US7496561B2 (en) | 2001-01-18 | 2009-02-24 | Science Applications International Corporation | Method and system of ranking and clustering for document indexing and retrieval |
WO2008155225A1 (en) * | 2007-06-20 | 2008-12-24 | Amadeus S.A.S. | System and method for integrating and displaying travel advices gathered from a plurality of reliable sources |
CN101765857A (en) * | 2007-06-20 | 2010-06-30 | 阿玛得斯两合公司 | System and method for integrating and displaying travel advices gathered from a plurality of reliable sources |
JP2010530580A (en) * | 2007-06-20 | 2010-09-09 | アマデウス エス.エイ.エス | System and method for integrated display of travel advice collected from multiple trusted sources |
US7818117B2 (en) | 2007-06-20 | 2010-10-19 | Amadeus S.A.S. | System and method for integrating and displaying travel advices gathered from a plurality of reliable sources |
CN101765857B (en) * | 2007-06-20 | 2013-06-19 | 阿玛得斯两合公司 | System and method for integrating and displaying travel advices gathered from a plurality of reliable sources |
US11374888B2 (en) | 2015-09-25 | 2022-06-28 | Microsoft Technology Licensing, Llc | User-defined notification templates |
Also Published As
Publication number | Publication date |
---|---|
EP1190343A4 (en) | 2006-08-09 |
HK1045391A1 (en) | 2002-11-22 |
IL144951A0 (en) | 2002-06-30 |
AU775978B2 (en) | 2004-08-19 |
CA2363017C (en) | 2011-04-19 |
WO2000049517A3 (en) | 2000-11-30 |
CA2363017A1 (en) | 2000-08-24 |
EP1190343A2 (en) | 2002-03-27 |
IL144951A (en) | 2006-08-01 |
AU4002600A (en) | 2000-09-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7366711B1 (en) | Multi-document summarization system and method | |
US7412385B2 (en) | System for identifying paraphrases using machine translation | |
US8260817B2 (en) | Semantic matching using predicate-argument structure | |
Harabagiu et al. | Topic themes for multi-document summarization | |
US6658377B1 (en) | Method and system for text analysis based on the tagging, processing, and/or reformatting of the input text | |
US20020078090A1 (en) | Ontological concept-based, user-centric text summarization | |
EP0886226A1 (en) | Linguistic search system | |
US20020046018A1 (en) | Discourse parsing and summarization | |
Jungermann | Information extraction with rapidminer | |
Smadja | From n-grams to collocations: An evaluation of Xtract | |
Moschitti et al. | Open Domain Information Extraction via Automatic Semantic Labeling. | |
AU775978B2 (en) | Multi-document summarization system and method | |
Alias et al. | A Malay text corpus analysis for sentence compression using pattern-growth method | |
MalarSelvi et al. | Analysis of Different Approaches for Automatic Text Summarization | |
Rahat et al. | A recursive algorithm for open information extraction from Persian texts | |
Zeni et al. | Annotating legal documents with GaiusT 2.0 | |
Al-sarrayrih et al. | Clustering arabic documents using frequent itemset-based hierarchical clustering with an N-grams | |
Muthusamy | Processing the Textual Information Using Open Natural Language Processing | |
Piotrowski | NLP-supported full-text retrieval | |
Osoba | Information Extraction for Road Accident Data | |
Chebanyuk | Multilingual Question-Driven Approach and Software System to Obtaining Information from Texts | |
Benafia et al. | From Linguistic to Conceptual: A Framework Based on a Pipeline for Building Ontologies from Texts. | |
Mashkovskyi | Developing of the related data search lsa-based algorithm and its programmed realization | |
Ou et al. | Multi‐document summarization of news articles using an event‐based framework | |
Harikumar et al. | An augmented semantic search tool for multilingual news analytics |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 144951 Country of ref document: IL |
|
ENP | Entry into the national phase |
Ref document number: 2363017 Country of ref document: CA Ref document number: 2363017 Country of ref document: CA Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: IN/PCT/2001/00737/DE Country of ref document: IN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2000919318 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 09913745 Country of ref document: US |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
WWP | Wipo information: published in national office |
Ref document number: 2000919318 Country of ref document: EP |