NO20085387L - Optimalisering av faktainnhenting i en flertrinnstilnaerming - Google Patents

Optimalisering av faktainnhenting i en flertrinnstilnaerming

Info

Publication number
NO20085387L
NO20085387L NO20085387A NO20085387A NO20085387L NO 20085387 L NO20085387 L NO 20085387L NO 20085387 A NO20085387 A NO 20085387A NO 20085387 A NO20085387 A NO 20085387A NO 20085387 L NO20085387 L NO 20085387L
Authority
NO
Norway
Prior art keywords
fact
words
facts
phrases
entire
Prior art date
Application number
NO20085387A
Other languages
English (en)
Inventor
Saliha Azzam
Kevin William Humphreys
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of NO20085387L publication Critical patent/NO20085387L/no

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • G06F16/345Summarisation for human users
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)

Abstract

Fakta blir innhentet fra elektroniske dokumenter ved å gjenkjenne saklige beskrivelser ved hjelp av en faktaordtabell som sammenliknes med ord i de elektroniske dokumentene. Ordene i disse saklige beskrivelsene kan bli merket med passende ordklasse. Mer detaljert analyse blir så utført på disse saklige beskrivelsene heller enn på hele det elektroniske dokumentet, og spesielt på teksten i området rundt oppdagede faktaord. Analysen kan omfatte det å identifisere språkbestanddelene i hver frase og bestemme deres rolle som enten subjekt eller objekt. Utelukkingsregler kan anvendes for å fjerne de frasene som trolig ikke er del av fakta, der utelukkingsreglene delvis er basert på språkbestanddelene. Poengsettingsregler kan bli anvendt på gjenværende fraser, og for de frasene som har en poengverdi som overstiger en terskel, kan den tilhørende setningsdelen, hele setningen, hele avsnittet eller en annen dokumentdel bli presentert som en representasjon av et faktum eller flere fakta.
NO20085387A 2006-07-31 2008-12-29 Optimalisering av faktainnhenting i en flertrinnstilnaerming NO20085387L (no)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/496,650 US7668791B2 (en) 2006-07-31 2006-07-31 Distinguishing facts from opinions using a multi-stage approach
PCT/US2007/016435 WO2008016491A1 (en) 2006-07-31 2007-07-20 Optimization of fact extraction using a multi-stage approach

Publications (1)

Publication Number Publication Date
NO20085387L true NO20085387L (no) 2009-01-19

Family

ID=38987573

Family Applications (1)

Application Number Title Priority Date Filing Date
NO20085387A NO20085387L (no) 2006-07-31 2008-12-29 Optimalisering av faktainnhenting i en flertrinnstilnaerming

Country Status (10)

Country Link
US (1) US7668791B2 (no)
EP (1) EP2050019A4 (no)
JP (1) JP5202524B2 (no)
AU (1) AU2007281638B2 (no)
BR (1) BRPI0714311A2 (no)
MX (1) MX2009000588A (no)
NO (1) NO20085387L (no)
RU (1) RU2451999C2 (no)
TW (1) TWI431493B (no)
WO (1) WO2008016491A1 (no)

Families Citing this family (54)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7269875B1 (en) * 2003-11-19 2007-09-18 David Brian Grimes Cleaning apparatus
US9495358B2 (en) 2006-10-10 2016-11-15 Abbyy Infopoisk Llc Cross-language text clustering
US8671341B1 (en) * 2007-01-05 2014-03-11 Linguastat, Inc. Systems and methods for identifying claims associated with electronic text
US8190628B1 (en) * 2007-11-30 2012-05-29 Google Inc. Phrase generation
TWI544349B (zh) 2008-06-13 2016-08-01 尼爾 揚 可分類與可更新之編譯及封存平台以及其使用
US20110231387A1 (en) * 2010-03-22 2011-09-22 Yahoo! Inc. Engaging content provision
US8719692B2 (en) * 2011-03-11 2014-05-06 Microsoft Corporation Validation, rejection, and modification of automatically generated document annotations
US8812301B2 (en) * 2011-09-26 2014-08-19 Xerox Corporation Linguistically-adapted structural query annotation
CN102929934A (zh) * 2012-09-25 2013-02-13 东莞宇龙通信科技有限公司 照片信息显示的方法及移动终端
US10922326B2 (en) * 2012-11-27 2021-02-16 Google Llc Triggering knowledge panels
US10289653B2 (en) 2013-03-15 2019-05-14 International Business Machines Corporation Adapting tabular data for narration
USD805535S1 (en) 2013-06-04 2017-12-19 Abbyy Production Llc Display screen or portion thereof with a transitional graphical user interface
USD802609S1 (en) 2013-06-04 2017-11-14 Abbyy Production Llc Display screen with graphical user interface
US9164977B2 (en) 2013-06-24 2015-10-20 International Business Machines Corporation Error correction in tables using discovered functional dependencies
US9600461B2 (en) 2013-07-01 2017-03-21 International Business Machines Corporation Discovering relationships in tabular data
US9830314B2 (en) 2013-11-18 2017-11-28 International Business Machines Corporation Error correction in tables using a question and answer system
RU2665239C2 (ru) 2014-01-15 2018-08-28 Общество с ограниченной ответственностью "Аби Продакшн" Автоматическое извлечение именованных сущностей из текста
RU2586577C2 (ru) 2014-01-15 2016-06-10 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Фильтрация дуг в синтаксическом графе
US10331782B2 (en) 2014-11-19 2019-06-25 Lexisnexis, A Division Of Reed Elsevier Inc. Systems and methods for automatic identification of potential material facts in documents
US9626358B2 (en) 2014-11-26 2017-04-18 Abbyy Infopoisk Llc Creating ontologies by analyzing natural language texts
RU2592396C1 (ru) 2015-02-03 2016-07-20 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Способ и система для машинного извлечения и интерпретации текстовой информации
RU2610241C2 (ru) 2015-03-19 2017-02-08 Общество с ограниченной ответственностью "Аби ИнфоПоиск" Способ и система синтеза текста на основе извлеченной информации в виде rdf-графа с использованием шаблонов
US10095740B2 (en) * 2015-08-25 2018-10-09 International Business Machines Corporation Selective fact generation from table data in a cognitive system
CN105260091B (zh) * 2015-09-07 2019-06-21 努比亚技术有限公司 照片处理方法及装置
US10776587B2 (en) * 2016-07-11 2020-09-15 International Business Machines Corporation Claim generation
RU2637992C1 (ru) * 2016-08-25 2017-12-08 Общество с ограниченной ответственностью "Аби Продакшн" Способ извлечения фактов из текстов на естественном языке
CN106648390B (zh) * 2016-12-05 2018-12-21 网易(杭州)网络有限公司 一种控制指令生成方法、装置及移动终端
CN106649786B (zh) * 2016-12-28 2020-04-07 北京百度网讯科技有限公司 基于深度问答的答案检索方法及装置
CN106924963B (zh) * 2017-04-26 2023-06-27 温州大学 一种视力听力康复训练娱乐打靶机
CN108038263A (zh) * 2017-11-15 2018-05-15 南京邮电大学 考虑性能相关结构不确定的芯片多元参数成品率预测方法
CN108257380B (zh) * 2017-12-05 2020-11-10 北京掌行通信息技术有限公司 一种基于路况信息检测拥堵事件的方法及系统
US10303771B1 (en) * 2018-02-14 2019-05-28 Capital One Services, Llc Utilizing machine learning models to identify insights in a document
CN109344993B (zh) * 2018-08-23 2021-08-24 江西省水利科学研究院 一种基于条件概率分布的河道洪峰水位预报方法
CN111026597B (zh) * 2019-01-31 2023-12-26 安天科技集团股份有限公司 一种芯片隐藏存储空间的检测方法、装置及存储介质
CN110007589B (zh) * 2019-02-26 2021-05-18 湖南盛世威得科技有限公司 一种具有火灾自动求救功能的智能手表
CN110057634B (zh) * 2019-04-11 2021-09-07 东北石油大学 一种制造岩心裂缝的装置及方法
CN111858225A (zh) * 2019-04-28 2020-10-30 中国移动通信集团上海有限公司 延时预测方法、装置、设备及计算机存储介质
CN111090785A (zh) * 2019-06-10 2020-05-01 工盒(嘉兴)网络技术有限公司 一种紧固云系统
CN110597108B (zh) * 2019-08-23 2021-12-21 广州电力设计院有限公司 电缆隧道区域控制系统、控制方法、装置及计算机设备
CN110737010B (zh) * 2019-09-19 2021-11-16 西安空间无线电技术研究所 一种基于低轨通信卫星的安全定位授时信号生成系统
CN111078849B (zh) * 2019-12-02 2023-07-25 百度在线网络技术(北京)有限公司 用于输出信息的方法和装置
CN111126057B (zh) * 2019-12-09 2023-08-01 航天科工网络信息发展有限公司 一种分级神经网络的案件情节精准量刑系统
DE102020103941A1 (de) * 2020-02-14 2021-08-19 Grimme Landmaschinenfabrik Gmbh & Co. Kg Verfahren zum Betrieb einer Maschine zum Ernten und/oder Trennen von Hackfrüchten, zugehörige Maschine und zugehöriges Computerprogrammprodukt
JP2021164005A (ja) * 2020-03-30 2021-10-11 Kddi株式会社 画像復号装置、画像復号方法及びプログラム
CN111526397A (zh) * 2020-03-30 2020-08-11 深圳市懿美莱科技有限公司 一种智能家庭网络播放器
CN111836065B (zh) * 2020-07-14 2022-04-29 北京场景互娱传媒科技有限公司 一种直播商标自动隐藏的智能方法
CN111882828B (zh) * 2020-07-22 2021-08-20 淮北智淮科技有限公司 一种防滑坡预警装置及其使用方法
CN112182895B (zh) * 2020-10-10 2022-08-23 中际联合(天津)科技有限公司 一种风机塔筒爬梯及防坠落布置方案图的自动分析方法
CN112890771B (zh) * 2021-01-14 2022-08-26 四川写正智能科技有限公司 一种基于毫米波雷达传感器监测睡眠状态的儿童手表
US11687539B2 (en) 2021-03-17 2023-06-27 International Business Machines Corporation Automatic neutral point of view content generation
US11972210B2 (en) * 2021-05-13 2024-04-30 Motorola Solutions, Inc. System and method for predicting a penal code and modifying an annotation based on the prediction
CN115191786B (zh) * 2022-08-04 2023-12-19 慕思健康睡眠股份有限公司 一种控制方法、装置、设备和存储介质
CN115432851B (zh) * 2022-08-23 2023-06-23 长兴瑷晟环保装备有限公司 一种高效混凝水力空化一体机
CN118278385B (zh) * 2024-05-29 2024-09-17 暗物智能科技(广州)有限公司 一种基于篇章卷面分析的测试方法、装置及可读存储介质

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0756933A (ja) * 1993-06-24 1995-03-03 Xerox Corp 文書検索方法
US5519608A (en) * 1993-06-24 1996-05-21 Xerox Corporation Method for extracting from a text corpus answers to questions stated in natural language by using linguistic analysis and hypothesis generation
US5331556A (en) * 1993-06-28 1994-07-19 General Electric Company Method for natural language data processing using morphological and part-of-speech information
US5715468A (en) * 1994-09-30 1998-02-03 Budzinski; Robert Lucius Memory system for storing and retrieving experience and knowledge with natural language
JP2000029902A (ja) * 1998-07-15 2000-01-28 Nec Corp 構造化文書分類装置およびこの構造化文書分類装置をコンピュータで実現するプログラムを記録した記録媒体、並びに、構造化文書検索システムおよびこの構造化文書検索システムをコンピュータで実現するプログラムを記録した記録媒体
US6167370A (en) * 1998-09-09 2000-12-26 Invention Machine Corporation Document semantic analysis/selection with knowledge creativity capability utilizing subject-action-object (SAO) structures
US6741986B2 (en) * 2000-12-08 2004-05-25 Ingenuity Systems, Inc. Method and system for performing information extraction and quality control for a knowledgebase
US6665661B1 (en) * 2000-09-29 2003-12-16 Battelle Memorial Institute System and method for use in text analysis of documents and records
JP4630480B2 (ja) * 2001-03-19 2011-02-09 株式会社東芝 要約抽出プログラム、文書分析支援プログラム、要約抽出方法、文書分析支援方法、文書分析支援システム
JP2001357064A (ja) * 2001-04-09 2001-12-26 Toshiba Corp 情報共有支援システム
US9009590B2 (en) * 2001-07-31 2015-04-14 Invention Machines Corporation Semantic processor for recognition of cause-effect relations in natural language documents
US7526425B2 (en) * 2001-08-14 2009-04-28 Evri Inc. Method and system for extending keyword searching to syntactically and semantically annotated data
US7254530B2 (en) * 2001-09-26 2007-08-07 The Trustees Of Columbia University In The City Of New York System and method of generating dictionary entries
US7398269B2 (en) * 2002-11-15 2008-07-08 Justsystems Evans Research Inc. Method and apparatus for document filtering using ensemble filters
WO2004072780A2 (en) * 2003-02-05 2004-08-26 Verint Systems, Inc. Method for automatic and semi-automatic classification and clustering of non-deterministic texts
RU2236699C1 (ru) * 2003-02-25 2004-09-20 Открытое акционерное общество "Телепортал. Ру" Способ поиска и выборки информации с повышенной релевантностью
KR100515641B1 (ko) * 2003-04-24 2005-09-22 우순조 모빌적 형상 개념을 기초로 한 구문 분석방법 및 이를이용한 자연어 검색 방법
US20050108630A1 (en) * 2003-11-19 2005-05-19 Wasson Mark D. Extraction of facts from text
US7496500B2 (en) * 2004-03-01 2009-02-24 Microsoft Corporation Systems and methods that determine intent of data and respond to the data based on the intent
US7970600B2 (en) * 2004-11-03 2011-06-28 Microsoft Corporation Using a first natural language parser to train a second parser
US20070027860A1 (en) * 2005-07-28 2007-02-01 International Business Machines Corporation Method and apparatus for eliminating partitions of a database table from a join query using implicit limitations on a partition key value
US7376551B2 (en) * 2005-08-01 2008-05-20 Microsoft Corporation Definition extraction

Also Published As

Publication number Publication date
TWI431493B (zh) 2014-03-21
RU2009103145A (ru) 2010-08-10
AU2007281638B2 (en) 2011-10-06
WO2008016491A1 (en) 2008-02-07
BRPI0714311A2 (pt) 2013-04-24
MX2009000588A (es) 2009-01-27
JP2009545808A (ja) 2009-12-24
EP2050019A1 (en) 2009-04-22
RU2451999C2 (ru) 2012-05-27
TW200817947A (en) 2008-04-16
JP5202524B2 (ja) 2013-06-05
US7668791B2 (en) 2010-02-23
AU2007281638A1 (en) 2008-02-07
EP2050019A4 (en) 2012-03-21
US20080027888A1 (en) 2008-01-31

Similar Documents

Publication Publication Date Title
NO20085387L (no) Optimalisering av faktainnhenting i en flertrinnstilnaerming
Halteren Linguistic profiling for authorship recognition and verification
Pettersson et al. Normalisation of historical text using context-sensitive weighted Levenshtein distance and compound splitting
WO2012050743A3 (en) Language identification in multilingual text
WO2007130544A3 (en) Method for domain identification of documents in a document database
Barnes et al. Sentiment analysis is not solved! assessing and probing sentiment classification
CN103617158A (zh) 一种对话文本情感摘要的生成方法
Janda et al. Aspectual pairs in the Russian national corpus
Jauhiainen et al. HeLI-based experiments in Swiss German dialect identification
CN105302794A (zh) 一种中文同指事件识别方法及系统
Mustafa et al. An enhanced approach for arabic sentiment analysis
Weller et al. Distinguishing degrees of compositionality in compound splitting for statistical machine translation
Ibrahim et al. Sentiment analysis of Arabic tweets: With special reference restaurant tweets
Gupta et al. Preprocessing phase of Punjabi language text summarization
Posadas-Durán et al. Author verification using syntactic n-grams
Yeong et al. A hybrid of sentence-level approach and fragment-level approach of parallel text extraction from comparable text
Liu Sentiment lexicon generation
Libovický et al. Tolerant BLEU: a submission to the WMT14 metrics task
Klenner et al. Gender-tailored semantic role profiling for german
He et al. Method of new word identification based on lager-scale corpus
Domtasouj et al. Are Sentences that Begin with Kāna and other Similar Auxiliary Verbs Nominal or Verbal?(With an Emphasis on Their Implications)
Shahriari et al. A Corpus-based Investigation of the Words Used in the Representation of Offender, Victim, and Crime in Iranian Press based on Critical Stylistics Framework
Liu et al. Pengyuan@ PKU: Extracting infrequent sense instance with the same n-gram pattern for the semeval-2010 task 15
Luangpiensamut et al. Using tagged and untagged corpora to improve thai morphological analysis with unknown word boundary detections
송민영 The Semantics of Epistemic must

Legal Events

Date Code Title Description
CHAD Change of the owner's name or address (par. 44 patent law, par. patentforskriften)

Owner name: MICROSOFT TECHNOLOGY LICENSING, US

FC2A Withdrawal, rejection or dismissal of laid open patent application