RU2480822C2 - Разрешение кореференции в чувствительной к неоднозначности системе обработки естественного языка - Google Patents

Разрешение кореференции в чувствительной к неоднозначности системе обработки естественного языка Download PDF

Info

Publication number
RU2480822C2
RU2480822C2 RU2010107148/08A RU2010107148A RU2480822C2 RU 2480822 C2 RU2480822 C2 RU 2480822C2 RU 2010107148/08 A RU2010107148/08 A RU 2010107148/08A RU 2010107148 A RU2010107148 A RU 2010107148A RU 2480822 C2 RU2480822 C2 RU 2480822C2
Authority
RU
Russia
Prior art keywords
text
coreference
fact
computer
natural language
Prior art date
Application number
RU2010107148/08A
Other languages
English (en)
Russian (ru)
Other versions
RU2010107148A (ru
Inventor
ДЕН БЕРГ Мартин ВАН
Ричард КРАУЧ
Франко САЛВЕТТИ
Джованни Лоренцо ТИОНЕ
Дэвид АН
Original Assignee
Майкрософт Корпорейшн
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Майкрософт Корпорейшн filed Critical Майкрософт Корпорейшн
Priority claimed from US12/200,962 external-priority patent/US8712758B2/en
Publication of RU2010107148A publication Critical patent/RU2010107148A/ru
Application granted granted Critical
Publication of RU2480822C2 publication Critical patent/RU2480822C2/ru

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)
RU2010107148/08A 2007-08-31 2008-08-29 Разрешение кореференции в чувствительной к неоднозначности системе обработки естественного языка RU2480822C2 (ru)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US96948307P 2007-08-31 2007-08-31
US96942607P 2007-08-31 2007-08-31
US60/969,426 2007-08-31
US60/969,483 2007-08-31
US12/200,962 2008-08-29
US12/200,962 US8712758B2 (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system
PCT/US2008/074935 WO2009029903A2 (en) 2007-08-31 2008-08-29 Coreference resolution in an ambiguity-sensitive natural language processing system

Publications (2)

Publication Number Publication Date
RU2010107148A RU2010107148A (ru) 2011-09-10
RU2480822C2 true RU2480822C2 (ru) 2013-04-27

Family

ID=42041476

Family Applications (1)

Application Number Title Priority Date Filing Date
RU2010107148/08A RU2480822C2 (ru) 2007-08-31 2008-08-29 Разрешение кореференции в чувствительной к неоднозначности системе обработки естественного языка

Country Status (11)

Country Link
EP (1) EP2183684A4 (ja)
JP (2) JP2010538374A (ja)
KR (1) KR101522049B1 (ja)
CN (1) CN101796508B (ja)
AU (1) AU2008292779B2 (ja)
BR (1) BRPI0815826A2 (ja)
CA (1) CA2698054C (ja)
MX (1) MX2010002349A (ja)
RU (1) RU2480822C2 (ja)
WO (1) WO2009029903A2 (ja)
ZA (1) ZA201001259B (ja)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2563148C2 (ru) * 2013-07-15 2015-09-20 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Система и метод семантического поиска
RU2643438C2 (ru) * 2013-12-25 2018-02-01 Общество с ограниченной ответственностью "Аби Продакшн" Обнаружение языковой неоднозначности в тексте
RU2674331C2 (ru) * 2014-09-03 2018-12-06 Дзе Дан Энд Брэдстрит Корпорейшн Система и процесс для анализа, квалифицирования и проглатывания источников неструктурированных данных посредством эмпирической атрибуции

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5699789B2 (ja) * 2011-05-10 2015-04-15 ソニー株式会社 情報処理装置、情報処理方法、プログラム及び情報処理システム
US9286291B2 (en) * 2013-02-15 2016-03-15 International Business Machines Corporation Disambiguation of dependent referring expression in natural language processing
CN104462053B (zh) * 2013-09-22 2018-10-12 江苏金鸽网络科技有限公司 一种文本内的基于语义特征的人称代词指代消解方法
US9606977B2 (en) * 2014-01-22 2017-03-28 Google Inc. Identifying tasks in messages
US9497153B2 (en) * 2014-01-30 2016-11-15 Google Inc. Associating a segment of an electronic message with one or more segment addressees
WO2015175443A1 (en) * 2014-05-12 2015-11-19 Google Inc. Automated reading comprehension
RU2591175C1 (ru) * 2015-03-19 2016-07-10 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Способ и система для глобальной идентификации в коллекции документов
CN106815215B (zh) * 2015-11-30 2019-11-26 华为技术有限公司 生成标注库的方法和装置
CN107515851B (zh) * 2016-06-16 2021-09-10 佳能株式会社 用于共指消解、信息提取以及相似文档检索的装置和方法
JP7135399B2 (ja) * 2018-04-12 2022-09-13 富士通株式会社 特定プログラム、特定方法および情報処理装置
WO2020005986A1 (en) * 2018-06-25 2020-01-02 Diffeo, Inc. Systems and method for investigating relationships among entities
US20200074322A1 (en) * 2018-09-04 2020-03-05 Rovi Guides, Inc. Methods and systems for using machine-learning extracts and semantic graphs to create structured data to drive search, recommendation, and discovery
CN109815482B (zh) * 2018-12-17 2023-05-23 北京百度网讯科技有限公司 一种新闻交互的方法、装置、设备和计算机存储介质
CN112740200B (zh) * 2019-07-25 2024-05-03 百度时代网络技术(北京)有限公司 用于基于共指消解的端到端深度强化学习的系统和方法
US11151321B2 (en) * 2019-12-10 2021-10-19 International Business Machines Corporation Anaphora resolution

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2096824C1 (ru) * 1996-04-29 1997-11-20 Государственный научно-технический центр гиперинформационных технологий Способы автоматизированной обработки информационных материалов для персонализированного использования
US6185592B1 (en) * 1997-11-18 2001-02-06 Apple Computer, Inc. Summarizing text documents by resolving co-referentiality among actors or objects around which a story unfolds
EP1675025A2 (en) * 2004-12-21 2006-06-28 Palo Alto Research Center Incorporated Systems and methods for generating user-interest sensitive abstracts of search results

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0268661A (ja) * 1988-09-05 1990-03-08 Agency Of Ind Science & Technol 文脈理解装置
JPH1011462A (ja) * 1996-06-26 1998-01-16 Fuji Xerox Co Ltd 類似関係展開辞書、類似度評価装置、検索装置
JP3504439B2 (ja) * 1996-07-25 2004-03-08 日本電信電話株式会社 映像検索方法
JPH11282844A (ja) * 1998-03-26 1999-10-15 Toshiba Corp 文書作成方法および情報処理装置および記録媒体
CA2419105C (en) * 2002-02-20 2007-01-09 Xerox Corporation Generating with lexical functional grammars
US20050108630A1 (en) * 2003-11-19 2005-05-19 Wasson Mark D. Extraction of facts from text
US20050149499A1 (en) * 2003-12-30 2005-07-07 Google Inc., A Delaware Corporation Systems and methods for improving search quality
JP4439431B2 (ja) * 2005-05-25 2010-03-24 株式会社東芝 コミュニケーション支援装置、コミュニケーション支援方法およびコミュニケーション支援プログラム
JP4654780B2 (ja) * 2005-06-10 2011-03-23 富士ゼロックス株式会社 質問応答システム、およびデータ検索方法、並びにコンピュータ・プログラム
US8060357B2 (en) * 2006-01-27 2011-11-15 Xerox Corporation Linguistic user interface

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2096824C1 (ru) * 1996-04-29 1997-11-20 Государственный научно-технический центр гиперинформационных технологий Способы автоматизированной обработки информационных материалов для персонализированного использования
US6185592B1 (en) * 1997-11-18 2001-02-06 Apple Computer, Inc. Summarizing text documents by resolving co-referentiality among actors or objects around which a story unfolds
EP1675025A2 (en) * 2004-12-21 2006-06-28 Palo Alto Research Center Incorporated Systems and methods for generating user-interest sensitive abstracts of search results

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
RU2563148C2 (ru) * 2013-07-15 2015-09-20 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Система и метод семантического поиска
RU2643438C2 (ru) * 2013-12-25 2018-02-01 Общество с ограниченной ответственностью "Аби Продакшн" Обнаружение языковой неоднозначности в тексте
RU2674331C2 (ru) * 2014-09-03 2018-12-06 Дзе Дан Энд Брэдстрит Корпорейшн Система и процесс для анализа, квалифицирования и проглатывания источников неструктурированных данных посредством эмпирической атрибуции
US10621182B2 (en) 2014-09-03 2020-04-14 The Dun & Bradstreet Corporation System and process for analyzing, qualifying and ingesting sources of unstructured data via empirical attribution

Also Published As

Publication number Publication date
MX2010002349A (es) 2010-07-30
CA2698054A1 (en) 2009-03-05
KR101522049B1 (ko) 2015-05-20
JP2014238865A (ja) 2014-12-18
WO2009029903A2 (en) 2009-03-05
BRPI0815826A2 (pt) 2015-02-18
AU2008292779B2 (en) 2012-09-06
CN101796508B (zh) 2013-03-06
ZA201001259B (en) 2012-05-30
WO2009029903A3 (en) 2009-05-07
EP2183684A2 (en) 2010-05-12
CN101796508A (zh) 2010-08-04
RU2010107148A (ru) 2011-09-10
EP2183684A4 (en) 2017-10-18
JP2010538374A (ja) 2010-12-09
AU2008292779A1 (en) 2009-03-05
CA2698054C (en) 2015-12-22
KR20100075451A (ko) 2010-07-02

Similar Documents

Publication Publication Date Title
RU2480822C2 (ru) Разрешение кореференции в чувствительной к неоднозначности системе обработки естественного языка
US8712758B2 (en) Coreference resolution in an ambiguity-sensitive natural language processing system
US11080295B2 (en) Collecting, organizing, and searching knowledge about a dataset
US8041697B2 (en) Semi-automatic example-based induction of semantic translation rules to support natural language search
Kowalski Information retrieval systems: theory and implementation
US8463593B2 (en) Natural language hypernym weighting for word sense disambiguation
US9569527B2 (en) Machine translation for query expansion
US20140114942A1 (en) Dynamic Pruning of a Search Index Based on Search Results
WO2010082207A9 (en) Dynamic indexing while authoring
Moncla et al. Automated geoparsing of paris street names in 19th century novels
Armentano et al. NLP-based faceted search: Experience in the development of a science and technology search engine
Agichtein Scaling Information Extraction to Large Document Collections.
Al-Zoghby et al. Semantic relations extraction and ontology learning from Arabic texts—a survey
US8229970B2 (en) Efficient storage and retrieval of posting lists
Garrido et al. GEO-NASS: A semantic tagging experience from geographical data on the media
RU2563148C2 (ru) Система и метод семантического поиска
Fauzi et al. Image understanding and the web: a state-of-the-art review
Song et al. Semantic query graph based SPARQL generation from natural language questions
Hazman et al. An ontology based approach for automatically annotating document segments
Tran et al. A model of vietnamese person named entity question answering system
Xu et al. Building large collections of Chinese and English medical terms from semi-structured and encyclopedia websites
Giannini et al. A Logic-based approach to Named-Entity Disambiguation in the Web of Data
Jena et al. Semantic desktop search application for Hindi-English code-mixed user query with query sequence analysis
Singh et al. Intelligent Bilingual Data Extraction and Rebuilding Using Data Mining for Big Data
Maheshwari et al. Entity Resolution and Location Disambiguation in the Ancient Hindu Temples Domain using Web Data

Legal Events

Date Code Title Description
PC41 Official registration of the transfer of exclusive right

Effective date: 20150526

MM4A The patent is invalid due to non-payment of fees

Effective date: 20170830