NO20085387L - Optimalisering av faktainnhenting i en flertrinnstilnaerming - Google Patents
Optimalisering av faktainnhenting i en flertrinnstilnaermingInfo
- Publication number
- NO20085387L NO20085387L NO20085387A NO20085387A NO20085387L NO 20085387 L NO20085387 L NO 20085387L NO 20085387 A NO20085387 A NO 20085387A NO 20085387 A NO20085387 A NO 20085387A NO 20085387 L NO20085387 L NO 20085387L
- Authority
- NO
- Norway
- Prior art keywords
- fact
- words
- facts
- phrases
- entire
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/34—Browsing; Visualisation therefor
- G06F16/345—Summarisation for human users
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Abstract
Fakta blir innhentet fra elektroniske dokumenter ved å gjenkjenne saklige beskrivelser ved hjelp av en faktaordtabell som sammenliknes med ord i de elektroniske dokumentene. Ordene i disse saklige beskrivelsene kan bli merket med passende ordklasse. Mer detaljert analyse blir så utført på disse saklige beskrivelsene heller enn på hele det elektroniske dokumentet, og spesielt på teksten i området rundt oppdagede faktaord. Analysen kan omfatte det å identifisere språkbestanddelene i hver frase og bestemme deres rolle som enten subjekt eller objekt. Utelukkingsregler kan anvendes for å fjerne de frasene som trolig ikke er del av fakta, der utelukkingsreglene delvis er basert på språkbestanddelene. Poengsettingsregler kan bli anvendt på gjenværende fraser, og for de frasene som har en poengverdi som overstiger en terskel, kan den tilhørende setningsdelen, hele setningen, hele avsnittet eller en annen dokumentdel bli presentert som en representasjon av et faktum eller flere fakta.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/496,650 US7668791B2 (en) | 2006-07-31 | 2006-07-31 | Distinguishing facts from opinions using a multi-stage approach |
PCT/US2007/016435 WO2008016491A1 (en) | 2006-07-31 | 2007-07-20 | Optimization of fact extraction using a multi-stage approach |
Publications (1)
Publication Number | Publication Date |
---|---|
NO20085387L true NO20085387L (no) | 2009-01-19 |
Family
ID=38987573
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
NO20085387A NO20085387L (no) | 2006-07-31 | 2008-12-29 | Optimalisering av faktainnhenting i en flertrinnstilnaerming |
Country Status (10)
Country | Link |
---|---|
US (1) | US7668791B2 (no) |
EP (1) | EP2050019A4 (no) |
JP (1) | JP5202524B2 (no) |
AU (1) | AU2007281638B2 (no) |
BR (1) | BRPI0714311A2 (no) |
MX (1) | MX2009000588A (no) |
NO (1) | NO20085387L (no) |
RU (1) | RU2451999C2 (no) |
TW (1) | TWI431493B (no) |
WO (1) | WO2008016491A1 (no) |
Families Citing this family (54)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7269875B1 (en) * | 2003-11-19 | 2007-09-18 | David Brian Grimes | Cleaning apparatus |
US9495358B2 (en) | 2006-10-10 | 2016-11-15 | Abbyy Infopoisk Llc | Cross-language text clustering |
US8671341B1 (en) * | 2007-01-05 | 2014-03-11 | Linguastat, Inc. | Systems and methods for identifying claims associated with electronic text |
US8190628B1 (en) * | 2007-11-30 | 2012-05-29 | Google Inc. | Phrase generation |
TWI544349B (zh) | 2008-06-13 | 2016-08-01 | 尼爾 揚 | 可分類與可更新之編譯及封存平台以及其使用 |
US20110231387A1 (en) * | 2010-03-22 | 2011-09-22 | Yahoo! Inc. | Engaging content provision |
US8719692B2 (en) * | 2011-03-11 | 2014-05-06 | Microsoft Corporation | Validation, rejection, and modification of automatically generated document annotations |
US8812301B2 (en) * | 2011-09-26 | 2014-08-19 | Xerox Corporation | Linguistically-adapted structural query annotation |
CN102929934A (zh) * | 2012-09-25 | 2013-02-13 | 东莞宇龙通信科技有限公司 | 照片信息显示的方法及移动终端 |
US10922326B2 (en) * | 2012-11-27 | 2021-02-16 | Google Llc | Triggering knowledge panels |
US10289653B2 (en) | 2013-03-15 | 2019-05-14 | International Business Machines Corporation | Adapting tabular data for narration |
USD805535S1 (en) | 2013-06-04 | 2017-12-19 | Abbyy Production Llc | Display screen or portion thereof with a transitional graphical user interface |
USD802609S1 (en) | 2013-06-04 | 2017-11-14 | Abbyy Production Llc | Display screen with graphical user interface |
US9164977B2 (en) | 2013-06-24 | 2015-10-20 | International Business Machines Corporation | Error correction in tables using discovered functional dependencies |
US9600461B2 (en) | 2013-07-01 | 2017-03-21 | International Business Machines Corporation | Discovering relationships in tabular data |
US9830314B2 (en) | 2013-11-18 | 2017-11-28 | International Business Machines Corporation | Error correction in tables using a question and answer system |
RU2665239C2 (ru) | 2014-01-15 | 2018-08-28 | Общество с ограниченной ответственностью "Аби Продакшн" | Автоматическое извлечение именованных сущностей из текста |
RU2586577C2 (ru) | 2014-01-15 | 2016-06-10 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Фильтрация дуг в синтаксическом графе |
US10331782B2 (en) | 2014-11-19 | 2019-06-25 | Lexisnexis, A Division Of Reed Elsevier Inc. | Systems and methods for automatic identification of potential material facts in documents |
US9626358B2 (en) | 2014-11-26 | 2017-04-18 | Abbyy Infopoisk Llc | Creating ontologies by analyzing natural language texts |
RU2592396C1 (ru) | 2015-02-03 | 2016-07-20 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Способ и система для машинного извлечения и интерпретации текстовой информации |
RU2610241C2 (ru) | 2015-03-19 | 2017-02-08 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Способ и система синтеза текста на основе извлеченной информации в виде rdf-графа с использованием шаблонов |
US10095740B2 (en) * | 2015-08-25 | 2018-10-09 | International Business Machines Corporation | Selective fact generation from table data in a cognitive system |
CN105260091B (zh) * | 2015-09-07 | 2019-06-21 | 努比亚技术有限公司 | 照片处理方法及装置 |
US10776587B2 (en) * | 2016-07-11 | 2020-09-15 | International Business Machines Corporation | Claim generation |
RU2637992C1 (ru) * | 2016-08-25 | 2017-12-08 | Общество с ограниченной ответственностью "Аби Продакшн" | Способ извлечения фактов из текстов на естественном языке |
CN106648390B (zh) * | 2016-12-05 | 2018-12-21 | 网易(杭州)网络有限公司 | 一种控制指令生成方法、装置及移动终端 |
CN106649786B (zh) * | 2016-12-28 | 2020-04-07 | 北京百度网讯科技有限公司 | 基于深度问答的答案检索方法及装置 |
CN106924963B (zh) * | 2017-04-26 | 2023-06-27 | 温州大学 | 一种视力听力康复训练娱乐打靶机 |
CN108038263A (zh) * | 2017-11-15 | 2018-05-15 | 南京邮电大学 | 考虑性能相关结构不确定的芯片多元参数成品率预测方法 |
CN108257380B (zh) * | 2017-12-05 | 2020-11-10 | 北京掌行通信息技术有限公司 | 一种基于路况信息检测拥堵事件的方法及系统 |
US10303771B1 (en) * | 2018-02-14 | 2019-05-28 | Capital One Services, Llc | Utilizing machine learning models to identify insights in a document |
CN109344993B (zh) * | 2018-08-23 | 2021-08-24 | 江西省水利科学研究院 | 一种基于条件概率分布的河道洪峰水位预报方法 |
CN111026597B (zh) * | 2019-01-31 | 2023-12-26 | 安天科技集团股份有限公司 | 一种芯片隐藏存储空间的检测方法、装置及存储介质 |
CN110007589B (zh) * | 2019-02-26 | 2021-05-18 | 湖南盛世威得科技有限公司 | 一种具有火灾自动求救功能的智能手表 |
CN110057634B (zh) * | 2019-04-11 | 2021-09-07 | 东北石油大学 | 一种制造岩心裂缝的装置及方法 |
CN111858225A (zh) * | 2019-04-28 | 2020-10-30 | 中国移动通信集团上海有限公司 | 延时预测方法、装置、设备及计算机存储介质 |
CN111090785A (zh) * | 2019-06-10 | 2020-05-01 | 工盒(嘉兴)网络技术有限公司 | 一种紧固云系统 |
CN110597108B (zh) * | 2019-08-23 | 2021-12-21 | 广州电力设计院有限公司 | 电缆隧道区域控制系统、控制方法、装置及计算机设备 |
CN110737010B (zh) * | 2019-09-19 | 2021-11-16 | 西安空间无线电技术研究所 | 一种基于低轨通信卫星的安全定位授时信号生成系统 |
CN111078849B (zh) * | 2019-12-02 | 2023-07-25 | 百度在线网络技术(北京)有限公司 | 用于输出信息的方法和装置 |
CN111126057B (zh) * | 2019-12-09 | 2023-08-01 | 航天科工网络信息发展有限公司 | 一种分级神经网络的案件情节精准量刑系统 |
DE102020103941A1 (de) * | 2020-02-14 | 2021-08-19 | Grimme Landmaschinenfabrik Gmbh & Co. Kg | Verfahren zum Betrieb einer Maschine zum Ernten und/oder Trennen von Hackfrüchten, zugehörige Maschine und zugehöriges Computerprogrammprodukt |
JP2021164005A (ja) * | 2020-03-30 | 2021-10-11 | Kddi株式会社 | 画像復号装置、画像復号方法及びプログラム |
CN111526397A (zh) * | 2020-03-30 | 2020-08-11 | 深圳市懿美莱科技有限公司 | 一种智能家庭网络播放器 |
CN111836065B (zh) * | 2020-07-14 | 2022-04-29 | 北京场景互娱传媒科技有限公司 | 一种直播商标自动隐藏的智能方法 |
CN111882828B (zh) * | 2020-07-22 | 2021-08-20 | 淮北智淮科技有限公司 | 一种防滑坡预警装置及其使用方法 |
CN112182895B (zh) * | 2020-10-10 | 2022-08-23 | 中际联合(天津)科技有限公司 | 一种风机塔筒爬梯及防坠落布置方案图的自动分析方法 |
CN112890771B (zh) * | 2021-01-14 | 2022-08-26 | 四川写正智能科技有限公司 | 一种基于毫米波雷达传感器监测睡眠状态的儿童手表 |
US11687539B2 (en) | 2021-03-17 | 2023-06-27 | International Business Machines Corporation | Automatic neutral point of view content generation |
US11972210B2 (en) * | 2021-05-13 | 2024-04-30 | Motorola Solutions, Inc. | System and method for predicting a penal code and modifying an annotation based on the prediction |
CN115191786B (zh) * | 2022-08-04 | 2023-12-19 | 慕思健康睡眠股份有限公司 | 一种控制方法、装置、设备和存储介质 |
CN115432851B (zh) * | 2022-08-23 | 2023-06-23 | 长兴瑷晟环保装备有限公司 | 一种高效混凝水力空化一体机 |
CN118278385B (zh) * | 2024-05-29 | 2024-09-17 | 暗物智能科技(广州)有限公司 | 一种基于篇章卷面分析的测试方法、装置及可读存储介质 |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0756933A (ja) * | 1993-06-24 | 1995-03-03 | Xerox Corp | 文書検索方法 |
US5519608A (en) * | 1993-06-24 | 1996-05-21 | Xerox Corporation | Method for extracting from a text corpus answers to questions stated in natural language by using linguistic analysis and hypothesis generation |
US5331556A (en) * | 1993-06-28 | 1994-07-19 | General Electric Company | Method for natural language data processing using morphological and part-of-speech information |
US5715468A (en) * | 1994-09-30 | 1998-02-03 | Budzinski; Robert Lucius | Memory system for storing and retrieving experience and knowledge with natural language |
JP2000029902A (ja) * | 1998-07-15 | 2000-01-28 | Nec Corp | 構造化文書分類装置およびこの構造化文書分類装置をコンピュータで実現するプログラムを記録した記録媒体、並びに、構造化文書検索システムおよびこの構造化文書検索システムをコンピュータで実現するプログラムを記録した記録媒体 |
US6167370A (en) * | 1998-09-09 | 2000-12-26 | Invention Machine Corporation | Document semantic analysis/selection with knowledge creativity capability utilizing subject-action-object (SAO) structures |
US6741986B2 (en) * | 2000-12-08 | 2004-05-25 | Ingenuity Systems, Inc. | Method and system for performing information extraction and quality control for a knowledgebase |
US6665661B1 (en) * | 2000-09-29 | 2003-12-16 | Battelle Memorial Institute | System and method for use in text analysis of documents and records |
JP4630480B2 (ja) * | 2001-03-19 | 2011-02-09 | 株式会社東芝 | 要約抽出プログラム、文書分析支援プログラム、要約抽出方法、文書分析支援方法、文書分析支援システム |
JP2001357064A (ja) * | 2001-04-09 | 2001-12-26 | Toshiba Corp | 情報共有支援システム |
US9009590B2 (en) * | 2001-07-31 | 2015-04-14 | Invention Machines Corporation | Semantic processor for recognition of cause-effect relations in natural language documents |
US7526425B2 (en) * | 2001-08-14 | 2009-04-28 | Evri Inc. | Method and system for extending keyword searching to syntactically and semantically annotated data |
US7254530B2 (en) * | 2001-09-26 | 2007-08-07 | The Trustees Of Columbia University In The City Of New York | System and method of generating dictionary entries |
US7398269B2 (en) * | 2002-11-15 | 2008-07-08 | Justsystems Evans Research Inc. | Method and apparatus for document filtering using ensemble filters |
WO2004072780A2 (en) * | 2003-02-05 | 2004-08-26 | Verint Systems, Inc. | Method for automatic and semi-automatic classification and clustering of non-deterministic texts |
RU2236699C1 (ru) * | 2003-02-25 | 2004-09-20 | Открытое акционерное общество "Телепортал. Ру" | Способ поиска и выборки информации с повышенной релевантностью |
KR100515641B1 (ko) * | 2003-04-24 | 2005-09-22 | 우순조 | 모빌적 형상 개념을 기초로 한 구문 분석방법 및 이를이용한 자연어 검색 방법 |
US20050108630A1 (en) * | 2003-11-19 | 2005-05-19 | Wasson Mark D. | Extraction of facts from text |
US7496500B2 (en) * | 2004-03-01 | 2009-02-24 | Microsoft Corporation | Systems and methods that determine intent of data and respond to the data based on the intent |
US7970600B2 (en) * | 2004-11-03 | 2011-06-28 | Microsoft Corporation | Using a first natural language parser to train a second parser |
US20070027860A1 (en) * | 2005-07-28 | 2007-02-01 | International Business Machines Corporation | Method and apparatus for eliminating partitions of a database table from a join query using implicit limitations on a partition key value |
US7376551B2 (en) * | 2005-08-01 | 2008-05-20 | Microsoft Corporation | Definition extraction |
-
2006
- 2006-07-31 US US11/496,650 patent/US7668791B2/en active Active
-
2007
- 2007-07-18 TW TW096126248A patent/TWI431493B/zh not_active IP Right Cessation
- 2007-07-20 AU AU2007281638A patent/AU2007281638B2/en active Active
- 2007-07-20 WO PCT/US2007/016435 patent/WO2008016491A1/en active Application Filing
- 2007-07-20 JP JP2009522777A patent/JP5202524B2/ja active Active
- 2007-07-20 EP EP07796948A patent/EP2050019A4/en not_active Ceased
- 2007-07-20 BR BRPI0714311-7A patent/BRPI0714311A2/pt not_active IP Right Cessation
- 2007-07-20 RU RU2009103145/08A patent/RU2451999C2/ru active
- 2007-07-20 MX MX2009000588A patent/MX2009000588A/es unknown
-
2008
- 2008-12-29 NO NO20085387A patent/NO20085387L/no not_active Application Discontinuation
Also Published As
Publication number | Publication date |
---|---|
TWI431493B (zh) | 2014-03-21 |
RU2009103145A (ru) | 2010-08-10 |
AU2007281638B2 (en) | 2011-10-06 |
WO2008016491A1 (en) | 2008-02-07 |
BRPI0714311A2 (pt) | 2013-04-24 |
MX2009000588A (es) | 2009-01-27 |
JP2009545808A (ja) | 2009-12-24 |
EP2050019A1 (en) | 2009-04-22 |
RU2451999C2 (ru) | 2012-05-27 |
TW200817947A (en) | 2008-04-16 |
JP5202524B2 (ja) | 2013-06-05 |
US7668791B2 (en) | 2010-02-23 |
AU2007281638A1 (en) | 2008-02-07 |
EP2050019A4 (en) | 2012-03-21 |
US20080027888A1 (en) | 2008-01-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
NO20085387L (no) | Optimalisering av faktainnhenting i en flertrinnstilnaerming | |
Halteren | Linguistic profiling for authorship recognition and verification | |
Pettersson et al. | Normalisation of historical text using context-sensitive weighted Levenshtein distance and compound splitting | |
WO2012050743A3 (en) | Language identification in multilingual text | |
WO2007130544A3 (en) | Method for domain identification of documents in a document database | |
Barnes et al. | Sentiment analysis is not solved! assessing and probing sentiment classification | |
CN103617158A (zh) | 一种对话文本情感摘要的生成方法 | |
Janda et al. | Aspectual pairs in the Russian national corpus | |
Jauhiainen et al. | HeLI-based experiments in Swiss German dialect identification | |
CN105302794A (zh) | 一种中文同指事件识别方法及系统 | |
Mustafa et al. | An enhanced approach for arabic sentiment analysis | |
Weller et al. | Distinguishing degrees of compositionality in compound splitting for statistical machine translation | |
Ibrahim et al. | Sentiment analysis of Arabic tweets: With special reference restaurant tweets | |
Gupta et al. | Preprocessing phase of Punjabi language text summarization | |
Posadas-Durán et al. | Author verification using syntactic n-grams | |
Yeong et al. | A hybrid of sentence-level approach and fragment-level approach of parallel text extraction from comparable text | |
Liu | Sentiment lexicon generation | |
Libovický et al. | Tolerant BLEU: a submission to the WMT14 metrics task | |
Klenner et al. | Gender-tailored semantic role profiling for german | |
He et al. | Method of new word identification based on lager-scale corpus | |
Domtasouj et al. | Are Sentences that Begin with Kāna and other Similar Auxiliary Verbs Nominal or Verbal?(With an Emphasis on Their Implications) | |
Shahriari et al. | A Corpus-based Investigation of the Words Used in the Representation of Offender, Victim, and Crime in Iranian Press based on Critical Stylistics Framework | |
Liu et al. | Pengyuan@ PKU: Extracting infrequent sense instance with the same n-gram pattern for the semeval-2010 task 15 | |
Luangpiensamut et al. | Using tagged and untagged corpora to improve thai morphological analysis with unknown word boundary detections | |
송민영 | The Semantics of Epistemic must |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
CHAD | Change of the owner's name or address (par. 44 patent law, par. patentforskriften) |
Owner name: MICROSOFT TECHNOLOGY LICENSING, US |
|
FC2A | Withdrawal, rejection or dismissal of laid open patent application |