CN101233484B - 定义提取 - Google Patents

定义提取 Download PDF

Info

Publication number
CN101233484B
CN101233484B CN200680027965.3A CN200680027965A CN101233484B CN 101233484 B CN101233484 B CN 101233484B CN 200680027965 A CN200680027965 A CN 200680027965A CN 101233484 B CN101233484 B CN 101233484B
Authority
CN
China
Prior art keywords
phrase
text unit
definition
prompting
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN200680027965.3A
Other languages
English (en)
Chinese (zh)
Other versions
CN101233484A (zh
Inventor
K·R·普维尔
K·W·亨姆菲耶斯
S·阿扎姆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of CN101233484A publication Critical patent/CN101233484A/zh
Application granted granted Critical
Publication of CN101233484B publication Critical patent/CN101233484B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/374Thesaurus
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
CN200680027965.3A 2005-08-01 2006-08-01 定义提取 Expired - Fee Related CN101233484B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US11/194,873 US7376551B2 (en) 2005-08-01 2005-08-01 Definition extraction
US11/194,873 2005-08-01
PCT/US2006/030094 WO2007016628A2 (en) 2005-08-01 2006-08-01 Definition extraction

Publications (2)

Publication Number Publication Date
CN101233484A CN101233484A (zh) 2008-07-30
CN101233484B true CN101233484B (zh) 2014-06-11

Family

ID=37695583

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200680027965.3A Expired - Fee Related CN101233484B (zh) 2005-08-01 2006-08-01 定义提取

Country Status (6)

Country Link
US (1) US7376551B2 (enExample)
EP (1) EP1913464A4 (enExample)
JP (1) JP5113750B2 (enExample)
KR (1) KR101279707B1 (enExample)
CN (1) CN101233484B (enExample)
WO (1) WO2007016628A2 (enExample)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7668791B2 (en) * 2006-07-31 2010-02-23 Microsoft Corporation Distinguishing facts from opinions using a multi-stage approach
US20100076965A1 (en) * 2006-11-20 2010-03-25 Access Co., Ltd. Information display device, information display program and information display system
US20080147579A1 (en) * 2006-12-14 2008-06-19 Microsoft Corporation Discriminative training using boosted lasso
TW200843642A (en) * 2007-03-08 2008-11-16 Du Pont Liquid sulfonylurea herbicide formulations
US20100228538A1 (en) * 2009-03-03 2010-09-09 Yamada John A Computational linguistic systems and methods
US8433559B2 (en) * 2009-03-24 2013-04-30 Microsoft Corporation Text analysis using phrase definitions and containers
US8321848B2 (en) * 2009-04-16 2012-11-27 The Mathworks, Inc. Method and system for syntax error repair in programming languages
KR101072100B1 (ko) * 2009-10-23 2011-10-10 포항공과대학교 산학협력단 표현 및 설명 추출을 위한 문서 처리 장치 및 방법
US8788260B2 (en) * 2010-05-11 2014-07-22 Microsoft Corporation Generating snippets based on content features
CA2747669C (en) * 2010-07-28 2016-03-08 Wairever Inc. Method and system for validation of claims against policy with contextualized semantic interoperability
CN102541955B (zh) * 2010-12-30 2015-03-11 中国移动通信集团公司 一种联系人信息保存的方法、设备及系统
US8589791B2 (en) 2011-06-28 2013-11-19 Microsoft Corporation Automatically generating a glossary of terms for a given document or group of documents
CN104572628B (zh) * 2015-02-05 2017-08-08 《中国学术期刊(光盘版)》电子杂志社有限公司 一种基于句法特征的学术定义自动抽取系统及方法
CN107402913B (zh) * 2016-05-20 2020-10-09 腾讯科技(深圳)有限公司 先行词的确定方法和装置
US10740365B2 (en) * 2017-06-14 2020-08-11 International Business Machines Corporation Gap identification in corpora
CN107423363B (zh) * 2017-06-22 2021-02-19 百度在线网络技术(北京)有限公司 基于人工智能的话术生成方法、装置、设备及存储介质
CN111742322A (zh) 2017-12-29 2020-10-02 罗伯特·博世有限公司 用于使用深度神经网络来进行独立于领域和语言的定义提取的系统和方法
US10642939B2 (en) 2018-02-24 2020-05-05 Twenty Lane Media, LLC Systems and methods for generating jokes
US10878817B2 (en) 2018-02-24 2020-12-29 Twenty Lane Media, LLC Systems and methods for generating comedy
US11080485B2 (en) 2018-02-24 2021-08-03 Twenty Lane Media, LLC Systems and methods for generating and recognizing jokes
US12135938B2 (en) * 2021-05-11 2024-11-05 Corascloud, Inc. Extended open information extraction by identifying nested relationships
CN116127971B (zh) * 2022-11-21 2025-07-22 北京智谱华章科技股份有限公司 一种基于主客观词表的英语推文命名实体提取方法及设备
CN119484138B (zh) * 2024-11-26 2025-11-28 中国农业银行股份有限公司 基于Ranger的数据湖运维权限控制系统及方法

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5715468A (en) * 1994-09-30 1998-02-03 Budzinski; Robert Lucius Memory system for storing and retrieving experience and knowledge with natural language
US5841895A (en) * 1996-10-25 1998-11-24 Pricewaterhousecoopers, Llp Method for learning local syntactic relationships for use in example-based information-extraction-pattern learning
JP2000259657A (ja) * 1999-03-10 2000-09-22 Fujitsu Ltd 用語定義の検索/収集装置
WO2003027894A1 (en) * 2001-09-26 2003-04-03 The Trustees Of Columbia University In The City Of New York System and method of generating dictionary entries

Also Published As

Publication number Publication date
JP5113750B2 (ja) 2013-01-09
US20070027863A1 (en) 2007-02-01
CN101233484A (zh) 2008-07-30
KR20080033325A (ko) 2008-04-16
EP1913464A2 (en) 2008-04-23
EP1913464A4 (en) 2013-06-26
US7376551B2 (en) 2008-05-20
WO2007016628A3 (en) 2007-12-13
WO2007016628A2 (en) 2007-02-08
JP2009503739A (ja) 2009-01-29
KR101279707B1 (ko) 2013-06-27

Similar Documents

Publication Publication Date Title
CN101233484B (zh) 定义提取
Poibeau et al. Proper name extraction from non-journalistic texts
US8374844B2 (en) Hybrid system for named entity resolution
Lita et al. Truecasing
RU2571373C2 (ru) Метод анализа тональности текстовых данных
US8447588B2 (en) Region-matching transducers for natural language processing
US7191115B2 (en) Statistical method and apparatus for learning translation relationships among words
US8266169B2 (en) Complex queries for corpus indexing and search
JP4714400B2 (ja) スケーラブル機械翻訳システム
US20100332217A1 (en) Method for text improvement via linguistic abstractions
US8510097B2 (en) Region-matching transducers for text-characterization
US20150120788A1 (en) Classification of hashtags in micro-blogs
EP2915068A2 (en) Natural language processing system and method
CN101346717A (zh) 用于语言处理的方法和装置
Bloom Sentiment analysis based on appraisal theory and functional local grammars
Clews et al. Rudimentary lexicon based method for sarcasm detection
Nwesri Effective retrieval techniques for Arabic text
Brun et al. Intertwining deep syntactic processing and named entity detection
Hirpassa Information extraction system for Amharic text
Shimohata Acquiring paraphrases from corpora and its application to machine translation
DeRose Stochastic methods for resolution of grammatical category ambiguity in inflected and uninflected languages
Mesfar Towards a cascade of morpho-syntactic tools for arabic natural language processing
JPH0773200A (ja) キーワード抽出方法
Ljubešic et al. Generating a morphological lexicon of organization entity names
Pandey et al. A Technique for Anaphora Resolution of Text

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: MICROSOFT TECHNOLOGY LICENSING LLC

Free format text: FORMER OWNER: MICROSOFT CORP.

Effective date: 20150428

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20150428

Address after: Washington State

Patentee after: Micro soft technique license Co., Ltd

Address before: Washington State

Patentee before: Microsoft Corp.

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20140611

Termination date: 20200801

CF01 Termination of patent right due to non-payment of annual fee