CN101796508B - 歧义敏感自然语言处理系统中的共指消解 - Google Patents

歧义敏感自然语言处理系统中的共指消解 Download PDF

Info

Publication number
CN101796508B
CN101796508B CN200880105563XA CN200880105563A CN101796508B CN 101796508 B CN101796508 B CN 101796508B CN 200880105563X A CN200880105563X A CN 200880105563XA CN 200880105563 A CN200880105563 A CN 200880105563A CN 101796508 B CN101796508 B CN 101796508B
Authority
CN
China
Prior art keywords
ambiguity
text
fact
information
identification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN200880105563XA
Other languages
English (en)
Chinese (zh)
Other versions
CN101796508A (zh
Inventor
M·范登伯格
R·克鲁奇
F·萨尔维蒂
G·L·蒂奥内
D·安
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ivalley Holding Co Ltd
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Priority claimed from US12/200,962 external-priority patent/US8712758B2/en
Publication of CN101796508A publication Critical patent/CN101796508A/zh
Application granted granted Critical
Publication of CN101796508B publication Critical patent/CN101796508B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)
CN200880105563XA 2007-08-31 2008-08-29 歧义敏感自然语言处理系统中的共指消解 Active CN101796508B (zh)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US96942607P 2007-08-31 2007-08-31
US96948307P 2007-08-31 2007-08-31
US60/969,426 2007-08-31
US60/969,483 2007-08-31
PCT/US2008/074935 WO2009029903A2 (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system
US12/200,962 2008-08-29
US12/200,962 US8712758B2 (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system

Publications (2)

Publication Number Publication Date
CN101796508A CN101796508A (zh) 2010-08-04
CN101796508B true CN101796508B (zh) 2013-03-06

Family

ID=42041476

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200880105563XA Active CN101796508B (zh) 2007-08-31 2008-08-29 歧义敏感自然语言处理系统中的共指消解

Country Status (11)

Country Link
EP (1) EP2183684A4 (https=)
JP (2) JP2010538374A (https=)
KR (1) KR101522049B1 (https=)
CN (1) CN101796508B (https=)
AU (1) AU2008292779B2 (https=)
BR (1) BRPI0815826A2 (https=)
CA (1) CA2698054C (https=)
MX (1) MX2010002349A (https=)
RU (1) RU2480822C2 (https=)
WO (1) WO2009029903A2 (https=)
ZA (1) ZA201001259B (https=)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2643438C2 (ru) * 2013-12-25 2018-02-01 Общество с ограниченной ответственностью "Аби Продакшн" Обнаружение языковой неоднозначности в тексте
RU2563148C2 (ru) * 2013-07-15 2015-09-20 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Система и метод семантического поиска
JP5699789B2 (ja) * 2011-05-10 2015-04-15 ソニー株式会社 情報処理装置、情報処理方法、プログラム及び情報処理システム
US9286291B2 (en) * 2013-02-15 2016-03-15 International Business Machines Corporation Disambiguation of dependent referring expression in natural language processing
CN104462053B (zh) * 2013-09-22 2018-10-12 江苏金鸽网络科技有限公司 一种文本内的基于语义特征的人称代词指代消解方法
US9606977B2 (en) * 2014-01-22 2017-03-28 Google Inc. Identifying tasks in messages
US9497153B2 (en) * 2014-01-30 2016-11-15 Google Inc. Associating a segment of an electronic message with one or more segment addressees
EP3143519A1 (en) * 2014-05-12 2017-03-22 Google, Inc. Automated reading comprehension
SG11201701613YA (en) 2014-09-03 2017-03-30 Dun & Bradstreet Corp System and process for analyzing, qualifying and ingesting sources of unstructured data via empirical attribution
RU2591175C1 (ru) * 2015-03-19 2016-07-10 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Способ и система для глобальной идентификации в коллекции документов
CN106815215B (zh) * 2015-11-30 2019-11-26 华为技术有限公司 生成标注库的方法和装置
CN107515851B (zh) * 2016-06-16 2021-09-10 佳能株式会社 用于共指消解、信息提取以及相似文档检索的装置和方法
JP7135399B2 (ja) * 2018-04-12 2022-09-13 富士通株式会社 特定プログラム、特定方法および情報処理装置
CN112585596B (zh) * 2018-06-25 2024-11-12 硕动力公司 用于调查实体之间的关系的系统和方法
US20200074322A1 (en) * 2018-09-04 2020-03-05 Rovi Guides, Inc. Methods and systems for using machine-learning extracts and semantic graphs to create structured data to drive search, recommendation, and discovery
CN109815482B (zh) * 2018-12-17 2023-05-23 北京百度网讯科技有限公司 一种新闻交互的方法、装置、设备和计算机存储介质
CN112740200B (zh) * 2019-07-25 2024-05-03 百度时代网络技术(北京)有限公司 用于基于共指消解的端到端深度强化学习的系统和方法
US11151321B2 (en) * 2019-12-10 2021-10-19 International Business Machines Corporation Anaphora resolution
CN115409045A (zh) * 2022-08-29 2022-11-29 科大讯飞股份有限公司 一种文档翻译方法、装置、设备及存储介质

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6185592B1 (en) * 1997-11-18 2001-02-06 Apple Computer, Inc. Summarizing text documents by resolving co-referentiality among actors or objects around which a story unfolds
CN1898670A (zh) * 2003-12-30 2007-01-17 Google公司 提高搜索质量的系统和方法

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0268661A (ja) * 1988-09-05 1990-03-08 Agency Of Ind Science & Technol 文脈理解装置
RU2096824C1 (ru) * 1996-04-29 1997-11-20 Государственный научно-технический центр гиперинформационных технологий Способы автоматизированной обработки информационных материалов для персонализированного использования
JPH1011462A (ja) * 1996-06-26 1998-01-16 Fuji Xerox Co Ltd 類似関係展開辞書、類似度評価装置、検索装置
JP3504439B2 (ja) * 1996-07-25 2004-03-08 日本電信電話株式会社 映像検索方法
JPH11282844A (ja) * 1998-03-26 1999-10-15 Toshiba Corp 文書作成方法および情報処理装置および記録媒体
CA2419105C (en) * 2002-02-20 2007-01-09 Xerox Corporation Generating with lexical functional grammars
US20050108630A1 (en) * 2003-11-19 2005-05-19 Wasson Mark D. Extraction of facts from text
US7401077B2 (en) * 2004-12-21 2008-07-15 Palo Alto Research Center Incorporated Systems and methods for using and constructing user-interest sensitive indicators of search results
JP4439431B2 (ja) * 2005-05-25 2010-03-24 株式会社東芝 コミュニケーション支援装置、コミュニケーション支援方法およびコミュニケーション支援プログラム
JP4654780B2 (ja) * 2005-06-10 2011-03-23 富士ゼロックス株式会社 質問応答システム、およびデータ検索方法、並びにコンピュータ・プログラム
US8060357B2 (en) * 2006-01-27 2011-11-15 Xerox Corporation Linguistic user interface

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6185592B1 (en) * 1997-11-18 2001-02-06 Apple Computer, Inc. Summarizing text documents by resolving co-referentiality among actors or objects around which a story unfolds
CN1898670A (zh) * 2003-12-30 2007-01-17 Google公司 提高搜索质量的系统和方法

Also Published As

Publication number Publication date
JP2010538374A (ja) 2010-12-09
JP2014238865A (ja) 2014-12-18
EP2183684A2 (en) 2010-05-12
CN101796508A (zh) 2010-08-04
MX2010002349A (es) 2010-07-30
WO2009029903A3 (en) 2009-05-07
CA2698054C (en) 2015-12-22
RU2480822C2 (ru) 2013-04-27
EP2183684A4 (en) 2017-10-18
AU2008292779A1 (en) 2009-03-05
KR20100075451A (ko) 2010-07-02
KR101522049B1 (ko) 2015-05-20
ZA201001259B (en) 2012-05-30
BRPI0815826A2 (pt) 2015-02-18
WO2009029903A2 (en) 2009-03-05
CA2698054A1 (en) 2009-03-05
AU2008292779B2 (en) 2012-09-06
RU2010107148A (ru) 2011-09-10

Similar Documents

Publication Publication Date Title
CN101796508B (zh) 歧义敏感自然语言处理系统中的共指消解
US8712758B2 (en) Coreference resolution in an ambiguity-sensitive natural language processing system
US9760570B2 (en) Finding and disambiguating references to entities on web pages
US8041697B2 (en) Semi-automatic example-based induction of semantic translation rules to support natural language search
US8463593B2 (en) Natural language hypernym weighting for word sense disambiguation
CN102253930B (zh) 一种文本翻译的方法及装置
US20160132572A1 (en) Collecting, organizing, and searching knowledge about a dataset
US8280721B2 (en) Efficiently representing word sense probabilities
CN103136352A (zh) 基于双层语义分析的全文检索系统
Moncla et al. Automated geoparsing of paris street names in 19th century novels
KR101709055B1 (ko) 오픈 웹 질의응답을 위한 질문분석 장치 및 방법
CN101398858A (zh) 一种基于本体学习的Web服务语义提取方法
US8229970B2 (en) Efficient storage and retrieval of posting lists
Garrido et al. GEO-NASS: A semantic tagging experience from geographical data on the media
Yunus et al. Semantic method for query translation.
RU2618375C2 (ru) Расширение возможностей информационного поиска
Giannini et al. A Logic-based approach to Named-Entity Disambiguation in the Web of Data
WO2001024053A2 (en) System and method for automatic context creation for electronic documents
Moscato et al. Mowis: A system for building multimedia ontologies from web information sources
Smits et al. Personal semantic indexation of images using textual annotations
CN120295975A (zh) 一种基于意图识别规则的文件检索方法、装置及相关介质
Millan et al. Unsupervised Web-based Automatic Annotation.
Nagi et al. Creating Facets Hierarchy for Unstructured Arabic Documents.
Peters et al. Within-Language Information Retrieval
Hazman et al. Ontology Learning from Web Organization Documents

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: MICROSOFT TECHNOLOGY LICENSING LLC

Free format text: FORMER OWNER: MICROSOFT CORP.

Effective date: 20150421

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20150421

Address after: Washington State

Patentee after: Micro soft technique license Co., Ltd

Address before: Washington State

Patentee before: Microsoft Corp.

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20160722

Address after: Grand Cayman, Georgetown, Cayman Islands

Patentee after: IValley Holding Co., Ltd.

Address before: Washington State

Patentee before: Micro soft technique license Co., Ltd