WO2010151788A3 - Système et procédé pour récupérer des informations numériques basées sur des unités - Google Patents
Système et procédé pour récupérer des informations numériques basées sur des unités Download PDFInfo
- Publication number
- WO2010151788A3 WO2010151788A3 PCT/US2010/040024 US2010040024W WO2010151788A3 WO 2010151788 A3 WO2010151788 A3 WO 2010151788A3 US 2010040024 W US2010040024 W US 2010040024W WO 2010151788 A3 WO2010151788 A3 WO 2010151788A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- numeric
- data
- numeric data
- information retrieval
- extracted
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/316—Indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/313—Selection or weighting of terms for indexing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3347—Query execution using vector based model
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/338—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2216/00—Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
- G06F2216/11—Patent retrieval
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Business, Economics & Management (AREA)
- General Business, Economics & Management (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
La présente invention concerne un système de récupération et d'analyse d'informations pour des données numériques. Ledit système fournit une précision élevée et un rappel pour la recherche numérique et il utilise une méthodologie pour déterminer une contextualisation des données extraites. Les capacités comprennent l'extraction, l'analyse syntaxique et la contextualisation des données numériques comprenant une valeur numérique et une unité associée. Ce système facilite l'organisation de données numériques en grande partie non structurées en un index inversé et en d'autres formats de base de données. La présente invention concerne également un système de récupération d'informations qui permet l'exploration et la décomposition d'un ensemble de données numériques extraites défini par une entrée de recherche qui peut être précise ou initialement vague. Ce système facilite également l'analyse et la représentation graphique de données numériques, la création de connaissances en combinant des données provenant de plusieurs sources, l'extraction de corrélations entre des variables apparemment disparates et la reconnaissance de tendances pour les données numériques. Ce système utilise un traitement local de langage naturel, une analyse mathématique et une heuristique scientifique à base d'expert pour noter la pertinence numérique et contextuelle des données pour les paramètres d'interrogation.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US22061709P | 2009-06-26 | 2009-06-26 | |
US61/220,617 | 2009-06-26 | ||
US12/496,199 | 2009-07-01 | ||
US12/496,199 US8756229B2 (en) | 2009-06-26 | 2009-07-01 | System and methods for units-based numeric information retrieval |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2010151788A2 WO2010151788A2 (fr) | 2010-12-29 |
WO2010151788A3 true WO2010151788A3 (fr) | 2011-06-16 |
Family
ID=43381865
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2010/040024 WO2010151788A2 (fr) | 2009-06-26 | 2010-06-25 | Système et procédé pour récupérer des informations numériques basées sur des unités |
Country Status (2)
Country | Link |
---|---|
US (2) | US8756229B2 (fr) |
WO (1) | WO2010151788A2 (fr) |
Families Citing this family (60)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8996416B2 (en) * | 2002-01-22 | 2015-03-31 | Lavante, Inc. | OCR enabled management of accounts payable and/or accounts receivable auditing data |
US20110153509A1 (en) | 2005-05-27 | 2011-06-23 | Ip Development Venture | Method and apparatus for cross-referencing important ip relationships |
US7895104B1 (en) * | 2007-10-04 | 2011-02-22 | Ip Street Inc. | Presentation and analysis of docket information and financial information |
US20220327484A1 (en) * | 2008-03-21 | 2022-10-13 | Brian Gale | System and method for clinical practice and health risk reduction monitoring |
US20100131513A1 (en) | 2008-10-23 | 2010-05-27 | Lundberg Steven W | Patent mapping |
US20100250340A1 (en) * | 2009-03-24 | 2010-09-30 | Ip Street, Inc. | Processing and Presenting Intellectual Property and Other Information |
US20100262512A1 (en) * | 2009-04-13 | 2010-10-14 | Ip Street, Inc. | Processing and Presenting Intellectual Property and Other Information |
US8122043B2 (en) * | 2009-06-30 | 2012-02-21 | Ebsco Industries, Inc | System and method for using an exemplar document to retrieve relevant documents from an inverted index of a large corpus |
US20110145283A1 (en) * | 2009-12-10 | 2011-06-16 | International Business Machines Corporation | Intelligent mechanism for identifying ontological hypertext and pre-fetching and presenting the target information |
US20110255788A1 (en) * | 2010-01-15 | 2011-10-20 | Copanion, Inc. | Systems and methods for automatically extracting data from electronic documents using external data |
US20120284305A1 (en) * | 2010-01-19 | 2012-11-08 | Nec Corporation | Trend information search device, trend information search method and recording medium |
US8751520B1 (en) * | 2010-06-23 | 2014-06-10 | Google Inc. | Query suggestions with high utility |
US20120174017A1 (en) * | 2010-12-29 | 2012-07-05 | Verisign, Inc. | Systems, methods and computer software for innovation management |
EP2697710A4 (fr) * | 2011-04-15 | 2014-10-08 | Ip Street Inc | Évaluation d'une propriété intellectuelle |
US10891701B2 (en) | 2011-04-15 | 2021-01-12 | Rowan TELS Corp. | Method and system for evaluating intellectual property |
US9904726B2 (en) | 2011-05-04 | 2018-02-27 | Black Hills IP Holdings, LLC. | Apparatus and method for automated and assisted patent claim mapping and expense planning |
US10001898B1 (en) | 2011-07-12 | 2018-06-19 | Domo, Inc. | Automated provisioning of relational information for a summary data visualization |
US9792017B1 (en) | 2011-07-12 | 2017-10-17 | Domo, Inc. | Automatic creation of drill paths |
US9202297B1 (en) | 2011-07-12 | 2015-12-01 | Domo, Inc. | Dynamic expansion of data visualizations |
RU2459242C1 (ru) * | 2011-08-09 | 2012-08-20 | Олег Александрович Серебренников | Способ создания и использования рекурсивного индекса поисковых машин |
CA2849292A1 (fr) * | 2011-09-21 | 2013-03-28 | ValueCorp Pacific, Inc. | Systeme et procede d'extraction et de recherche d'ontologie mathematique |
US20130085946A1 (en) | 2011-10-03 | 2013-04-04 | Steven W. Lundberg | Systems, methods and user interfaces in a patent management system |
KR20140139521A (ko) * | 2012-03-29 | 2014-12-05 | 무 시그마 비지니스 솔루션스 피브이티 엘티디 | 데이터 솔루션 시스템 |
US9641431B1 (en) * | 2012-04-18 | 2017-05-02 | Google Inc. | System and methods for utilization-based balancing of traffic to an information retrieval system |
US8949263B1 (en) * | 2012-05-14 | 2015-02-03 | NetBase Solutions, Inc. | Methods and apparatus for sentiment analysis |
US8849843B1 (en) * | 2012-06-18 | 2014-09-30 | Ez-XBRL Solutions, Inc. | System and method for facilitating associating semantic labels with content |
US9135327B1 (en) | 2012-08-30 | 2015-09-15 | Ez-XBRL Solutions, Inc. | System and method to facilitate the association of structured content in a structured document with unstructured content in an unstructured document |
US20140129543A1 (en) * | 2012-11-02 | 2014-05-08 | Microsoft Corporation | Search service including indexing text containing numbers in part using one or more number index structures |
US20140280290A1 (en) * | 2013-03-14 | 2014-09-18 | Microsoft Corporation | Selection and display of alternative suggested sub-strings in a query |
US9244952B2 (en) | 2013-03-17 | 2016-01-26 | Alation, Inc. | Editable and searchable markup pages automatically populated through user query monitoring |
US9792330B1 (en) | 2013-04-30 | 2017-10-17 | Google Inc. | Identifying local experts for local search |
US20140324656A1 (en) * | 2013-04-30 | 2014-10-30 | Omx Technology Ab | Order life-cycle visualization |
US20140372216A1 (en) * | 2013-06-13 | 2014-12-18 | Microsoft Corporation | Contextual mobile application advertisements |
US10726018B2 (en) | 2014-02-10 | 2020-07-28 | Microsoft Technology Licensing, Llc | Semantic matching and annotation of attributes |
US9477782B2 (en) | 2014-03-21 | 2016-10-25 | Microsoft Corporation | User interface mechanisms for query refinement |
US9460075B2 (en) * | 2014-06-17 | 2016-10-04 | International Business Machines Corporation | Solving and answering arithmetic and algebraic problems using natural language processing |
US20170177704A1 (en) * | 2014-07-29 | 2017-06-22 | Hewlett Packard Enterprise Development Lp | Similarity in a structured dataset |
US9514185B2 (en) * | 2014-08-07 | 2016-12-06 | International Business Machines Corporation | Answering time-sensitive questions |
US20160063095A1 (en) * | 2014-08-27 | 2016-03-03 | International Business Machines Corporation | Unstructured data guided query modification |
US9613134B2 (en) * | 2014-09-07 | 2017-04-04 | Microsoft Technology Licensing, Llc | Identifying mathematical operators in natural language text for knowledge-based matching |
US9430557B2 (en) | 2014-09-17 | 2016-08-30 | International Business Machines Corporation | Automatic data interpretation and answering analytical questions with tables and charts |
US11275775B2 (en) | 2014-10-09 | 2022-03-15 | Splunk Inc. | Performing search queries for key performance indicators using an optimized common information model |
US10289679B2 (en) | 2014-12-10 | 2019-05-14 | International Business Machines Corporation | Data relationships in a question-answering environment |
US10509800B2 (en) * | 2015-01-23 | 2019-12-17 | Hewlett-Packard Development Company, L.P. | Visually interactive identification of a cohort of data objects similar to a query based on domain knowledge |
US10019442B2 (en) * | 2015-05-31 | 2018-07-10 | Thomson Reuters Global Resources Unlimited Company | Method and system for peer detection |
US10325385B2 (en) | 2015-09-24 | 2019-06-18 | International Business Machines Corporation | Comparative visualization of numerical information |
US9679198B2 (en) * | 2015-11-05 | 2017-06-13 | International Business Machines Corporation | Ingestion plan based on table uniqueness |
US20170220950A1 (en) * | 2016-01-29 | 2017-08-03 | International Business Machines Corporation | Numerical expression analysis |
WO2017214266A1 (fr) * | 2016-06-07 | 2017-12-14 | Panoramix Solutions | Systèmes et procédés d'identification et de classification de texte |
US11934465B2 (en) | 2016-11-28 | 2024-03-19 | Thomson Reuters Enterprise Centre Gmbh | System and method for finding similar documents based on semantic factual similarity |
US11205103B2 (en) | 2016-12-09 | 2021-12-21 | The Research Foundation for the State University | Semisupervised autoencoder for sentiment analysis |
US11100100B2 (en) * | 2017-03-20 | 2021-08-24 | International Business Machines Corporation | Numeric data type support for cognitive intelligence queries |
US10268688B2 (en) | 2017-05-03 | 2019-04-23 | International Business Machines Corporation | Corpus-scoped annotation and analysis |
US10360302B2 (en) * | 2017-09-15 | 2019-07-23 | International Business Machines Corporation | Visual comparison of documents using latent semantic differences |
CN107957989B9 (zh) | 2017-10-23 | 2021-01-12 | 创新先进技术有限公司 | 基于集群的词向量处理方法、装置以及设备 |
CN108170663A (zh) | 2017-11-14 | 2018-06-15 | 阿里巴巴集团控股有限公司 | 基于集群的词向量处理方法、装置以及设备 |
US10963627B2 (en) * | 2018-06-11 | 2021-03-30 | Adobe Inc. | Automatically generating digital enterprise content variants |
CN110197197B (zh) * | 2019-04-15 | 2022-08-30 | 贵州电网有限责任公司 | 一种基于文本相似度改进的电网档案相似度计算方法 |
US11003865B1 (en) * | 2020-05-20 | 2021-05-11 | Google Llc | Retrieval-augmented language model pre-training and fine-tuning |
CN112100393B (zh) * | 2020-08-07 | 2022-03-15 | 浙江大学 | 一种低资源场景下的知识三元组抽取方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020073115A1 (en) * | 2000-02-17 | 2002-06-13 | Davis Russell T. | RDL search engine |
US20030225779A1 (en) * | 2002-05-09 | 2003-12-04 | Yasuhiro Matsuda | Inverted index system and method for numeric attributes |
US20060031183A1 (en) * | 2004-08-04 | 2006-02-09 | Tolga Oral | System and method for enhancing keyword relevance by user's interest on the search result documents |
EP1930816A1 (fr) * | 2006-11-07 | 2008-06-11 | Fast Serach & Transfer ASA | Navigation dépendant du contextes des resultats selon le contexte et l'importance pondérée pour moteurs de recherche |
Family Cites Families (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5911138A (en) * | 1993-06-04 | 1999-06-08 | International Business Machines Corporation | Database search facility having improved user interface |
US6714933B2 (en) * | 2000-05-09 | 2004-03-30 | Cnet Networks, Inc. | Content aggregation method and apparatus for on-line purchasing system |
US5809502A (en) * | 1996-08-09 | 1998-09-15 | Digital Equipment Corporation | Object-oriented interface for an index |
US5950189A (en) * | 1997-01-02 | 1999-09-07 | At&T Corp | Retrieval system and method |
US6418419B1 (en) * | 1999-07-23 | 2002-07-09 | 5Th Market, Inc. | Automated system for conditional order transactions in securities or other items in commerce |
US7197479B1 (en) * | 1999-09-02 | 2007-03-27 | Cnet Europe Sa | Methods and apparatus for implementing a multi-lingual catalog system |
US6704728B1 (en) * | 2000-05-02 | 2004-03-09 | Iphase.Com, Inc. | Accessing information from a collection of data |
US7246110B1 (en) * | 2000-05-25 | 2007-07-17 | Cnet Networks, Inc. | Product feature and relation comparison system |
US7421418B2 (en) * | 2003-02-19 | 2008-09-02 | Nahava Inc. | Method and apparatus for fundamental operations on token sequences: computing similarity, extracting term values, and searching efficiently |
US7149748B1 (en) * | 2003-05-06 | 2006-12-12 | Sap Ag | Expanded inverted index |
US7693824B1 (en) * | 2003-10-20 | 2010-04-06 | Google Inc. | Number-range search system and method |
US7299224B2 (en) * | 2003-12-19 | 2007-11-20 | International Business Machines Corporation | Method and infrastructure for processing queries in a database |
US7370037B2 (en) | 2003-12-29 | 2008-05-06 | International Business Machines Corporation | Methods for processing a text search query in a collection of documents |
JP2005250980A (ja) * | 2004-03-05 | 2005-09-15 | Oki Electric Ind Co Ltd | 文書検索システム、検索条件入力装置、検索実行装置、文書検索方法、および文書検索プログラム |
US7461064B2 (en) * | 2004-09-24 | 2008-12-02 | International Buiness Machines Corporation | Method for searching documents for ranges of numeric values |
US8312034B2 (en) * | 2005-06-24 | 2012-11-13 | Purediscovery Corporation | Concept bridge and method of operating the same |
US7680789B2 (en) * | 2006-01-18 | 2010-03-16 | Microsoft Corporation | Indexing and searching numeric ranges |
US20070185870A1 (en) * | 2006-01-27 | 2007-08-09 | Hogue Andrew W | Data object visualization using graphs |
US8589869B2 (en) * | 2006-09-07 | 2013-11-19 | Wolfram Alpha Llc | Methods and systems for determining a formula |
GB2457267B (en) * | 2008-02-07 | 2010-04-07 | Yves Dassas | A method and system of indexing numerical data |
-
2009
- 2009-07-01 US US12/496,199 patent/US8756229B2/en active Active
-
2010
- 2010-06-25 WO PCT/US2010/040024 patent/WO2010151788A2/fr active Application Filing
-
2014
- 2014-05-12 US US14/275,840 patent/US9830378B2/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020073115A1 (en) * | 2000-02-17 | 2002-06-13 | Davis Russell T. | RDL search engine |
US20030225779A1 (en) * | 2002-05-09 | 2003-12-04 | Yasuhiro Matsuda | Inverted index system and method for numeric attributes |
US20060031183A1 (en) * | 2004-08-04 | 2006-02-09 | Tolga Oral | System and method for enhancing keyword relevance by user's interest on the search result documents |
EP1930816A1 (fr) * | 2006-11-07 | 2008-06-11 | Fast Serach & Transfer ASA | Navigation dépendant du contextes des resultats selon le contexte et l'importance pondérée pour moteurs de recherche |
Also Published As
Publication number | Publication date |
---|---|
WO2010151788A2 (fr) | 2010-12-29 |
US9830378B2 (en) | 2017-11-28 |
US20100332511A1 (en) | 2010-12-30 |
US20140250130A1 (en) | 2014-09-04 |
US8756229B2 (en) | 2014-06-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2010151788A3 (fr) | Système et procédé pour récupérer des informations numériques basées sur des unités | |
Trupthi et al. | Sentiment analysis on twitter using streaming API | |
CN111221983A (zh) | 时序知识图谱生成方法、装置、设备和介质 | |
CN112507068A (zh) | 文档查询方法、装置、电子设备和存储介质 | |
CN102169495A (zh) | 行业词典生成方法及装置 | |
JP2013529805A5 (ja) | 検索方法、検索システム及びコンピュータプログラム | |
US20160085855A1 (en) | Perspective data analysis and management | |
WO2004072757A3 (fr) | Recherches de textes et d'attributs de memoires de donnees contenant des objets commerciaux | |
WO2014005657A4 (fr) | Système et procédé pour la génération automatique d'un contenu riche en informations à partir de microblogues multiples, chaque microblogue contenant seulement des informations éparses | |
US11157540B2 (en) | Search space reduction for knowledge graph querying and interactions | |
CN104281698A (zh) | 一种高效的大数据查询方法 | |
KR101651780B1 (ko) | 빅 데이터 처리 기술을 이용한 연관 단어 추출 방법 및 그 시스템 | |
CN109508441B (zh) | 通过自然语言实现数据统计分析的方法、装置及电子设备 | |
US20140019482A1 (en) | Apparatus and method for searching for personalized content based on user's comment | |
US20170337477A1 (en) | System for determination of automated response follow-up | |
JP5834795B2 (ja) | 情報処理装置及びプログラム | |
Shekhawat | Sentiment classification of current public opinion on BREXIT: Naïve Bayes classifier model vs Python’s TextBlob approach | |
CN103150331A (zh) | 一种提供搜索引擎标签的方法和装置 | |
US20220365956A1 (en) | Method and apparatus for generating patent summary information, and electronic device and medium | |
KR20130022075A (ko) | 감성 어휘 정보 구축 방법 및 장치 | |
CN106653006B (zh) | 基于语音交互的搜索方法和装置 | |
CN103020311A (zh) | 一种用户检索词的处理方法及系统 | |
Walha et al. | A Lexicon approach to multidimensional analysis of tweets opinion | |
Chinthala et al. | Sentiment analysis on twitter streaming data | |
JP6305630B2 (ja) | 文書検索装置、方法及びプログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 10792737 Country of ref document: EP Kind code of ref document: A2 |