WO2009029903A3 - Coreference resolution in an ambiguity-sensitive natural language processing system - Google Patents

Coreference resolution in an ambiguity-sensitive natural language processing system Download PDF

Info

Publication number
WO2009029903A3
WO2009029903A3 PCT/US2008/074935 US2008074935W WO2009029903A3 WO 2009029903 A3 WO2009029903 A3 WO 2009029903A3 US 2008074935 W US2008074935 W US 2008074935W WO 2009029903 A3 WO2009029903 A3 WO 2009029903A3
Authority
WO
WIPO (PCT)
Prior art keywords
ambiguity
processing system
natural language
language processing
resolution
Prior art date
Application number
PCT/US2008/074935
Other languages
French (fr)
Other versions
WO2009029903A2 (en
Inventor
Den Berg Martin Van
Richard Crouch
Franco Salvetti
Giovanni Lorenzo Thione
David Ahn
Original Assignee
Powerset Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to AU2008292779A priority Critical patent/AU2008292779B2/en
Application filed by Powerset Inc filed Critical Powerset Inc
Priority to CN200880105563XA priority patent/CN101796508B/en
Priority to EP08828084.7A priority patent/EP2183684A4/en
Priority to JP2010523185A priority patent/JP2010538374A/en
Priority to MX2010002349A priority patent/MX2010002349A/en
Priority to CA2698054A priority patent/CA2698054C/en
Priority to RU2010107148/08A priority patent/RU2480822C2/en
Priority claimed from US12/200,962 external-priority patent/US8712758B2/en
Priority to KR1020107006475A priority patent/KR101522049B1/en
Priority to BRPI0815826-6A2A priority patent/BRPI0815826A2/en
Publication of WO2009029903A2 publication Critical patent/WO2009029903A2/en
Publication of WO2009029903A3 publication Critical patent/WO2009029903A3/en
Priority to ZA2010/01259A priority patent/ZA201001259B/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Abstract

Technologies are described herein for coreference resolution in an ambiguity-sensitive natural language processing system. Techniques for integrating reference resolution functionality into a natural language processing system can processes documents to be indexed within an information search and retrieval system. Ambiguity awareness features, as well as ambiguity resolution functionality, can operate in coordination with coreference resolution. Annotation of coreference entities, as well as ambiguous interpretations, can be supported by in-line markup within text content or by external entity maps. Information expressed within documents can be formally organized in terms of facts, or relationships between entities in the text. Expansion can support applying multiple aliases, or ambiguities, to an entity being indexed so that all of the possibly references or interpretations for that entity are captured into the index. Alternative stored descriptions can support retrieval of a fact by either the original description or a coreferential description.
PCT/US2008/074935 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system WO2009029903A2 (en)

Priority Applications (10)

Application Number Priority Date Filing Date Title
CA2698054A CA2698054C (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system
CN200880105563XA CN101796508B (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system
EP08828084.7A EP2183684A4 (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system
JP2010523185A JP2010538374A (en) 2007-08-31 2008-08-29 Resolving the same instructions in an ambiguous natural language processing system
MX2010002349A MX2010002349A (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system.
AU2008292779A AU2008292779B2 (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system
RU2010107148/08A RU2480822C2 (en) 2007-08-31 2008-08-29 Coreference resolution in ambiguity-sensitive natural language processing system
BRPI0815826-6A2A BRPI0815826A2 (en) 2007-08-31 2008-08-29 CO-REFERENCE RESOLUTION IN AN AMBIGUITY-SENSING NATURAL LANGUAGE PROCESSING SYSTEM
KR1020107006475A KR101522049B1 (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system
ZA2010/01259A ZA201001259B (en) 2007-08-31 2010-02-22 Coreference resolution in an ambiguity-sensitive natural language processing system

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US96948307P 2007-08-31 2007-08-31
US96942607P 2007-08-31 2007-08-31
US60/969,483 2007-08-31
US60/969,426 2007-08-31
US12/200,962 2008-08-29
US12/200,962 US8712758B2 (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system

Publications (2)

Publication Number Publication Date
WO2009029903A2 WO2009029903A2 (en) 2009-03-05
WO2009029903A3 true WO2009029903A3 (en) 2009-05-07

Family

ID=42041476

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/074935 WO2009029903A2 (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system

Country Status (11)

Country Link
EP (1) EP2183684A4 (en)
JP (2) JP2010538374A (en)
KR (1) KR101522049B1 (en)
CN (1) CN101796508B (en)
AU (1) AU2008292779B2 (en)
BR (1) BRPI0815826A2 (en)
CA (1) CA2698054C (en)
MX (1) MX2010002349A (en)
RU (1) RU2480822C2 (en)
WO (1) WO2009029903A2 (en)
ZA (1) ZA201001259B (en)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2563148C2 (en) * 2013-07-15 2015-09-20 Общество с ограниченной ответственностью "Аби ИнфоПоиск" System and method for semantic search
RU2643438C2 (en) * 2013-12-25 2018-02-01 Общество с ограниченной ответственностью "Аби Продакшн" Detection of linguistic ambiguity in a text
JP5699789B2 (en) * 2011-05-10 2015-04-15 ソニー株式会社 Information processing apparatus, information processing method, program, and information processing system
US9286291B2 (en) * 2013-02-15 2016-03-15 International Business Machines Corporation Disambiguation of dependent referring expression in natural language processing
CN104462053B (en) * 2013-09-22 2018-10-12 江苏金鸽网络科技有限公司 A kind of personal pronoun reference resolution method based on semantic feature in text
US9606977B2 (en) * 2014-01-22 2017-03-28 Google Inc. Identifying tasks in messages
US9497153B2 (en) * 2014-01-30 2016-11-15 Google Inc. Associating a segment of an electronic message with one or more segment addressees
US9678945B2 (en) * 2014-05-12 2017-06-13 Google Inc. Automated reading comprehension
RU2674331C2 (en) * 2014-09-03 2018-12-06 Дзе Дан Энд Брэдстрит Корпорейшн System and process for analysis, qualification and acquisition of sources of unstructured data by means of empirical attribution
RU2591175C1 (en) * 2015-03-19 2016-07-10 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Method and system for global identification in collection of documents
CN106815215B (en) * 2015-11-30 2019-11-26 华为技术有限公司 The method and apparatus for generating annotation repository
CN107515851B (en) * 2016-06-16 2021-09-10 佳能株式会社 Apparatus and method for coreference resolution, information extraction and similar document retrieval
JP7135399B2 (en) * 2018-04-12 2022-09-13 富士通株式会社 Specific program, specific method and information processing device
WO2020005986A1 (en) * 2018-06-25 2020-01-02 Diffeo, Inc. Systems and method for investigating relationships among entities
US20200074322A1 (en) * 2018-09-04 2020-03-05 Rovi Guides, Inc. Methods and systems for using machine-learning extracts and semantic graphs to create structured data to drive search, recommendation, and discovery
CN109815482B (en) * 2018-12-17 2023-05-23 北京百度网讯科技有限公司 News interaction method, device, equipment and computer storage medium
WO2021012263A1 (en) * 2019-07-25 2021-01-28 Baidu.Com Times Technology (Beijing) Co., Ltd. Systems and methods for end-to-end deep reinforcement learning based coreference resolution
US11151321B2 (en) * 2019-12-10 2021-10-19 International Business Machines Corporation Anaphora resolution

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6185592B1 (en) * 1997-11-18 2001-02-06 Apple Computer, Inc. Summarizing text documents by resolving co-referentiality among actors or objects around which a story unfolds

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0268661A (en) * 1988-09-05 1990-03-08 Agency Of Ind Science & Technol Context comprehending device
DE897158T1 (en) * 1996-04-29 1999-07-22 G Nauchnotekhnichesky Ts Giper METHOD FOR THE AUTOMATIC PROCESSING OF INFORMATION ABOUT USER DATA
JPH1011462A (en) * 1996-06-26 1998-01-16 Fuji Xerox Co Ltd Similar relation development dictionary, similarity evaluating device, and retrieval device
JP3504439B2 (en) * 1996-07-25 2004-03-08 日本電信電話株式会社 Video search method
JPH11282844A (en) * 1998-03-26 1999-10-15 Toshiba Corp Preparing method of document, information processor and recording medium
CA2419105C (en) * 2002-02-20 2007-01-09 Xerox Corporation Generating with lexical functional grammars
US20050108630A1 (en) * 2003-11-19 2005-05-19 Wasson Mark D. Extraction of facts from text
US20050149499A1 (en) * 2003-12-30 2005-07-07 Google Inc., A Delaware Corporation Systems and methods for improving search quality
US7401077B2 (en) * 2004-12-21 2008-07-15 Palo Alto Research Center Incorporated Systems and methods for using and constructing user-interest sensitive indicators of search results
JP4439431B2 (en) * 2005-05-25 2010-03-24 株式会社東芝 Communication support device, communication support method, and communication support program
JP4654780B2 (en) * 2005-06-10 2011-03-23 富士ゼロックス株式会社 Question answering system, data retrieval method, and computer program
US8060357B2 (en) * 2006-01-27 2011-11-15 Xerox Corporation Linguistic user interface

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6185592B1 (en) * 1997-11-18 2001-02-06 Apple Computer, Inc. Summarizing text documents by resolving co-referentiality among actors or objects around which a story unfolds

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
"Proceedings of the Twen-tieth International Joint Conference on Artificial Intelligence (IJCAI-07), JANUARY 2007", article VINCENT NG: "Shallow semantics for coreference resolution", pages: 1689 - 1694, XP008130205 *
BONTCHEVA K. ET AL.: "The Proceedings of TALN 2002 Workshop, Nancy, France, 24-27 JUNE 2002", article "Shallow Methods for Named Entity coreference Resolution", XP055003530 *
See also references of EP2183684A4 *

Also Published As

Publication number Publication date
CA2698054C (en) 2015-12-22
EP2183684A4 (en) 2017-10-18
CA2698054A1 (en) 2009-03-05
MX2010002349A (en) 2010-07-30
JP2014238865A (en) 2014-12-18
ZA201001259B (en) 2012-05-30
AU2008292779B2 (en) 2012-09-06
RU2480822C2 (en) 2013-04-27
RU2010107148A (en) 2011-09-10
WO2009029903A2 (en) 2009-03-05
CN101796508A (en) 2010-08-04
BRPI0815826A2 (en) 2015-02-18
JP2010538374A (en) 2010-12-09
KR101522049B1 (en) 2015-05-20
AU2008292779A1 (en) 2009-03-05
EP2183684A2 (en) 2010-05-12
CN101796508B (en) 2013-03-06
KR20100075451A (en) 2010-07-02

Similar Documents

Publication Publication Date Title
WO2009029903A3 (en) Coreference resolution in an ambiguity-sensitive natural language processing system
Staal Agni: The Vedic ritual of the fire altar
WO2012118764A3 (en) Systems, methods and media for translating informational content
WO2007143666A3 (en) Element query method and system
MX2008000176A (en) Processing collocation mistakes in documents.
AU2016219688A1 (en) Matching techniques for cross-platform monitoring and information
WO2008070362A3 (en) System and method for converting a natural language query into a logical query
WO2008002578A3 (en) Methods and apparatus for improving data warehouse performance
WO2008100938A3 (en) A method and system for integrating a social network and data repository to enable map creation
WO2008079850A3 (en) Annotation framework for video
WO2009014993A3 (en) Composite nested streams
WO2011088080A3 (en) Crowdsourced multi-media data relationships
WO2007041607A3 (en) System and method for supplementing a radio playlist with local content
WO2012012080A3 (en) Extracting facts from social network messages
WO2009057047A3 (en) Fast and editing-friendly sample association method for multimedia file formats
WO2009055689A3 (en) Text enhancement mechanism
WO2009005989A3 (en) Server directory schema comparator
WO2014130136A3 (en) Method and system for global federation of wide area motion imagery collection web services
WO2006107347A3 (en) System and method for grouping a collection of documents using document series
WO2012076376A3 (en) Generating semantic structured documents from text documents
Salehi et al. Perceptions on Qualitative Characteristics in Financial Reporting: Iranian Evidence
Surve et al. Simultaneous determination of pseudoephedrine hydrochloride, Ambroxol hydrochloride, guaiphenesin and chlorpheniramine maleate in multicomponent pharmaceutical preparations (syrup) by RP-HPLC
Feng et al. Retraction Note to: The model for improving big data sub-image retrieval performance using scalable vocabulary tree based on predictive clustering
Fuller TECHNOLOGY TRANSFER: RECENT ADVANCES
Grass Building bridges: between the Orthodox and Evangelical traditions

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880105563.X

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08828084

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 204107

Country of ref document: IL

Ref document number: 2008828084

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2010523185

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2698054

Country of ref document: CA

Ref document number: 2010107148

Country of ref document: RU

Ref document number: 2008292779

Country of ref document: AU

Ref document number: MX/A/2010/002349

Country of ref document: MX

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 1395/CHENP/2010

Country of ref document: IN

ENP Entry into the national phase

Ref document number: 20107006475

Country of ref document: KR

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2008292779

Country of ref document: AU

Date of ref document: 20080829

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: PI 2010000820

Country of ref document: MY

ENP Entry into the national phase

Ref document number: PI0815826

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20100226