JP5113750B2 - 定義の抽出 - Google Patents

定義の抽出 Download PDF

Info

Publication number
JP5113750B2
JP5113750B2 JP2008525156A JP2008525156A JP5113750B2 JP 5113750 B2 JP5113750 B2 JP 5113750B2 JP 2008525156 A JP2008525156 A JP 2008525156A JP 2008525156 A JP2008525156 A JP 2008525156A JP 5113750 B2 JP5113750 B2 JP 5113750B2
Authority
JP
Japan
Prior art keywords
phrase
text
definition
cue
scoring
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2008525156A
Other languages
English (en)
Japanese (ja)
Other versions
JP2009503739A5 (enExample
JP2009503739A (ja
Inventor
アール.パウエル ケビン
ダブリュ.ハンフリーズ ケビン
アッザーム サリハ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of JP2009503739A publication Critical patent/JP2009503739A/ja
Publication of JP2009503739A5 publication Critical patent/JP2009503739A5/ja
Application granted granted Critical
Publication of JP5113750B2 publication Critical patent/JP5113750B2/ja
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/374Thesaurus
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
JP2008525156A 2005-08-01 2006-08-01 定義の抽出 Expired - Fee Related JP5113750B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US11/194,873 US7376551B2 (en) 2005-08-01 2005-08-01 Definition extraction
US11/194,873 2005-08-01
PCT/US2006/030094 WO2007016628A2 (en) 2005-08-01 2006-08-01 Definition extraction

Publications (3)

Publication Number Publication Date
JP2009503739A JP2009503739A (ja) 2009-01-29
JP2009503739A5 JP2009503739A5 (enExample) 2009-09-24
JP5113750B2 true JP5113750B2 (ja) 2013-01-09

Family

ID=37695583

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2008525156A Expired - Fee Related JP5113750B2 (ja) 2005-08-01 2006-08-01 定義の抽出

Country Status (6)

Country Link
US (1) US7376551B2 (enExample)
EP (1) EP1913464A4 (enExample)
JP (1) JP5113750B2 (enExample)
KR (1) KR101279707B1 (enExample)
CN (1) CN101233484B (enExample)
WO (1) WO2007016628A2 (enExample)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7668791B2 (en) * 2006-07-31 2010-02-23 Microsoft Corporation Distinguishing facts from opinions using a multi-stage approach
US20100076965A1 (en) * 2006-11-20 2010-03-25 Access Co., Ltd. Information display device, information display program and information display system
US20080147579A1 (en) * 2006-12-14 2008-06-19 Microsoft Corporation Discriminative training using boosted lasso
TW200843642A (en) * 2007-03-08 2008-11-16 Du Pont Liquid sulfonylurea herbicide formulations
US20100228538A1 (en) * 2009-03-03 2010-09-09 Yamada John A Computational linguistic systems and methods
US8433559B2 (en) * 2009-03-24 2013-04-30 Microsoft Corporation Text analysis using phrase definitions and containers
US8321848B2 (en) * 2009-04-16 2012-11-27 The Mathworks, Inc. Method and system for syntax error repair in programming languages
KR101072100B1 (ko) * 2009-10-23 2011-10-10 포항공과대학교 산학협력단 표현 및 설명 추출을 위한 문서 처리 장치 및 방법
US8788260B2 (en) * 2010-05-11 2014-07-22 Microsoft Corporation Generating snippets based on content features
CA2747669C (en) * 2010-07-28 2016-03-08 Wairever Inc. Method and system for validation of claims against policy with contextualized semantic interoperability
CN102541955B (zh) * 2010-12-30 2015-03-11 中国移动通信集团公司 一种联系人信息保存的方法、设备及系统
US8589791B2 (en) 2011-06-28 2013-11-19 Microsoft Corporation Automatically generating a glossary of terms for a given document or group of documents
CN104572628B (zh) * 2015-02-05 2017-08-08 《中国学术期刊(光盘版)》电子杂志社有限公司 一种基于句法特征的学术定义自动抽取系统及方法
CN107402913B (zh) * 2016-05-20 2020-10-09 腾讯科技(深圳)有限公司 先行词的确定方法和装置
US10740365B2 (en) * 2017-06-14 2020-08-11 International Business Machines Corporation Gap identification in corpora
CN107423363B (zh) * 2017-06-22 2021-02-19 百度在线网络技术(北京)有限公司 基于人工智能的话术生成方法、装置、设备及存储介质
CN111742322A (zh) 2017-12-29 2020-10-02 罗伯特·博世有限公司 用于使用深度神经网络来进行独立于领域和语言的定义提取的系统和方法
US10642939B2 (en) 2018-02-24 2020-05-05 Twenty Lane Media, LLC Systems and methods for generating jokes
US10878817B2 (en) 2018-02-24 2020-12-29 Twenty Lane Media, LLC Systems and methods for generating comedy
US11080485B2 (en) 2018-02-24 2021-08-03 Twenty Lane Media, LLC Systems and methods for generating and recognizing jokes
US12135938B2 (en) * 2021-05-11 2024-11-05 Corascloud, Inc. Extended open information extraction by identifying nested relationships
CN116127971B (zh) * 2022-11-21 2025-07-22 北京智谱华章科技股份有限公司 一种基于主客观词表的英语推文命名实体提取方法及设备
CN119484138B (zh) * 2024-11-26 2025-11-28 中国农业银行股份有限公司 基于Ranger的数据湖运维权限控制系统及方法

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5715468A (en) * 1994-09-30 1998-02-03 Budzinski; Robert Lucius Memory system for storing and retrieving experience and knowledge with natural language
US5841895A (en) * 1996-10-25 1998-11-24 Pricewaterhousecoopers, Llp Method for learning local syntactic relationships for use in example-based information-extraction-pattern learning
JP2000259657A (ja) * 1999-03-10 2000-09-22 Fujitsu Ltd 用語定義の検索/収集装置
WO2003027894A1 (en) * 2001-09-26 2003-04-03 The Trustees Of Columbia University In The City Of New York System and method of generating dictionary entries

Also Published As

Publication number Publication date
US20070027863A1 (en) 2007-02-01
CN101233484A (zh) 2008-07-30
KR20080033325A (ko) 2008-04-16
EP1913464A2 (en) 2008-04-23
EP1913464A4 (en) 2013-06-26
US7376551B2 (en) 2008-05-20
WO2007016628A3 (en) 2007-12-13
CN101233484B (zh) 2014-06-11
WO2007016628A2 (en) 2007-02-08
JP2009503739A (ja) 2009-01-29
KR101279707B1 (ko) 2013-06-27

Similar Documents

Publication Publication Date Title
JP5113750B2 (ja) 定義の抽出
JP4714400B2 (ja) スケーラブル機械翻訳システム
US8447588B2 (en) Region-matching transducers for natural language processing
JP5538820B2 (ja) 2カ国語コーパスからの変換マッピングの自動抽出プログラム
US8266169B2 (en) Complex queries for corpus indexing and search
Lita et al. Truecasing
Wacholder et al. Disambiguation of proper names in text
US7783476B2 (en) Word extraction method and system for use in word-breaking using statistical information
US8510097B2 (en) Region-matching transducers for text-characterization
US7092871B2 (en) Tokenizer for a natural language processing system
JP5167546B2 (ja) 文単位検索方法、文単位検索装置、コンピュータプログラム、記録媒体及び文書記憶装置
US9239826B2 (en) Method and system for generating new entries in natural language dictionary
US20060095250A1 (en) Parser for natural language processing
JP2008539476A (ja) スペル提示の生成方法およびシステム
EP0839357A1 (en) Method and apparatus for automated search and retrieval processing
WO2005059771A1 (ja) 対訳判断装置、方法及びプログラム
US8204736B2 (en) Access to multilingual textual resources
US7328404B2 (en) Method for predicting the readings of japanese ideographs
EP1503295A1 (en) Text generation method and text generation device
US20100094615A1 (en) Document translation apparatus and method
US20050086214A1 (en) Computer system and method for multilingual associative searching
Appelt et al. Named entity extraction from speech: Approach and results using the TextPro system
Nguyen et al. Named entity disambiguation: A hybrid statistical and rule-based incremental approach
Alfonseca et al. German decompounding in a difficult corpus
US20050102278A1 (en) Expanded search keywords

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20090803

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20090803

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20120524

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20120601

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20120903

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20120928

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20121012

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20151019

Year of fee payment: 3

R150 Certificate of patent or registration of utility model

Ref document number: 5113750

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

Free format text: JAPANESE INTERMEDIATE CODE: R150

S111 Request for change of ownership or part of ownership

Free format text: JAPANESE INTERMEDIATE CODE: R313113

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees