WO2010151788A3 - Système et procédé pour récupérer des informations numériques basées sur des unités - Google Patents

Système et procédé pour récupérer des informations numériques basées sur des unités Download PDF

Info

Publication number
WO2010151788A3
WO2010151788A3 PCT/US2010/040024 US2010040024W WO2010151788A3 WO 2010151788 A3 WO2010151788 A3 WO 2010151788A3 US 2010040024 W US2010040024 W US 2010040024W WO 2010151788 A3 WO2010151788 A3 WO 2010151788A3
Authority
WO
WIPO (PCT)
Prior art keywords
numeric
data
numeric data
information retrieval
extracted
Prior art date
Application number
PCT/US2010/040024
Other languages
English (en)
Other versions
WO2010151788A2 (fr
Inventor
John Kenton Stockton
Ari Keith Tuchman
Original Assignee
Entanglement Technologies, Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Entanglement Technologies, Llc filed Critical Entanglement Technologies, Llc
Publication of WO2010151788A2 publication Critical patent/WO2010151788A2/fr
Publication of WO2010151788A3 publication Critical patent/WO2010151788A3/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/338Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/11Patent retrieval

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

La présente invention concerne un système de récupération et d'analyse d'informations pour des données numériques. Ledit système fournit une précision élevée et un rappel pour la recherche numérique et il utilise une méthodologie pour déterminer une contextualisation des données extraites. Les capacités comprennent l'extraction, l'analyse syntaxique et la contextualisation des données numériques comprenant une valeur numérique et une unité associée. Ce système facilite l'organisation de données numériques en grande partie non structurées en un index inversé et en d'autres formats de base de données. La présente invention concerne également un système de récupération d'informations qui permet l'exploration et la décomposition d'un ensemble de données numériques extraites défini par une entrée de recherche qui peut être précise ou initialement vague. Ce système facilite également l'analyse et la représentation graphique de données numériques, la création de connaissances en combinant des données provenant de plusieurs sources, l'extraction de corrélations entre des variables apparemment disparates et la reconnaissance de tendances pour les données numériques. Ce système utilise un traitement local de langage naturel, une analyse mathématique et une heuristique scientifique à base d'expert pour noter la pertinence numérique et contextuelle des données pour les paramètres d'interrogation.
PCT/US2010/040024 2009-06-26 2010-06-25 Système et procédé pour récupérer des informations numériques basées sur des unités WO2010151788A2 (fr)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US22061709P 2009-06-26 2009-06-26
US61/220,617 2009-06-26
US12/496,199 2009-07-01
US12/496,199 US8756229B2 (en) 2009-06-26 2009-07-01 System and methods for units-based numeric information retrieval

Publications (2)

Publication Number Publication Date
WO2010151788A2 WO2010151788A2 (fr) 2010-12-29
WO2010151788A3 true WO2010151788A3 (fr) 2011-06-16

Family

ID=43381865

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2010/040024 WO2010151788A2 (fr) 2009-06-26 2010-06-25 Système et procédé pour récupérer des informations numériques basées sur des unités

Country Status (2)

Country Link
US (2) US8756229B2 (fr)
WO (1) WO2010151788A2 (fr)

Families Citing this family (60)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8996416B2 (en) * 2002-01-22 2015-03-31 Lavante, Inc. OCR enabled management of accounts payable and/or accounts receivable auditing data
US20110153509A1 (en) 2005-05-27 2011-06-23 Ip Development Venture Method and apparatus for cross-referencing important ip relationships
US7895104B1 (en) * 2007-10-04 2011-02-22 Ip Street Inc. Presentation and analysis of docket information and financial information
US20220327484A1 (en) * 2008-03-21 2022-10-13 Brian Gale System and method for clinical practice and health risk reduction monitoring
US20100131513A1 (en) 2008-10-23 2010-05-27 Lundberg Steven W Patent mapping
US20100250340A1 (en) * 2009-03-24 2010-09-30 Ip Street, Inc. Processing and Presenting Intellectual Property and Other Information
US20100262512A1 (en) * 2009-04-13 2010-10-14 Ip Street, Inc. Processing and Presenting Intellectual Property and Other Information
US8122043B2 (en) * 2009-06-30 2012-02-21 Ebsco Industries, Inc System and method for using an exemplar document to retrieve relevant documents from an inverted index of a large corpus
US20110145283A1 (en) * 2009-12-10 2011-06-16 International Business Machines Corporation Intelligent mechanism for identifying ontological hypertext and pre-fetching and presenting the target information
US20110255788A1 (en) * 2010-01-15 2011-10-20 Copanion, Inc. Systems and methods for automatically extracting data from electronic documents using external data
US20120284305A1 (en) * 2010-01-19 2012-11-08 Nec Corporation Trend information search device, trend information search method and recording medium
US8751520B1 (en) * 2010-06-23 2014-06-10 Google Inc. Query suggestions with high utility
US20120174017A1 (en) * 2010-12-29 2012-07-05 Verisign, Inc. Systems, methods and computer software for innovation management
EP2697710A4 (fr) * 2011-04-15 2014-10-08 Ip Street Inc Évaluation d'une propriété intellectuelle
US10891701B2 (en) 2011-04-15 2021-01-12 Rowan TELS Corp. Method and system for evaluating intellectual property
US9904726B2 (en) 2011-05-04 2018-02-27 Black Hills IP Holdings, LLC. Apparatus and method for automated and assisted patent claim mapping and expense planning
US10001898B1 (en) 2011-07-12 2018-06-19 Domo, Inc. Automated provisioning of relational information for a summary data visualization
US9792017B1 (en) 2011-07-12 2017-10-17 Domo, Inc. Automatic creation of drill paths
US9202297B1 (en) 2011-07-12 2015-12-01 Domo, Inc. Dynamic expansion of data visualizations
RU2459242C1 (ru) * 2011-08-09 2012-08-20 Олег Александрович Серебренников Способ создания и использования рекурсивного индекса поисковых машин
CA2849292A1 (fr) * 2011-09-21 2013-03-28 ValueCorp Pacific, Inc. Systeme et procede d'extraction et de recherche d'ontologie mathematique
US20130085946A1 (en) 2011-10-03 2013-04-04 Steven W. Lundberg Systems, methods and user interfaces in a patent management system
KR20140139521A (ko) * 2012-03-29 2014-12-05 무 시그마 비지니스 솔루션스 피브이티 엘티디 데이터 솔루션 시스템
US9641431B1 (en) * 2012-04-18 2017-05-02 Google Inc. System and methods for utilization-based balancing of traffic to an information retrieval system
US8949263B1 (en) * 2012-05-14 2015-02-03 NetBase Solutions, Inc. Methods and apparatus for sentiment analysis
US8849843B1 (en) * 2012-06-18 2014-09-30 Ez-XBRL Solutions, Inc. System and method for facilitating associating semantic labels with content
US9135327B1 (en) 2012-08-30 2015-09-15 Ez-XBRL Solutions, Inc. System and method to facilitate the association of structured content in a structured document with unstructured content in an unstructured document
US20140129543A1 (en) * 2012-11-02 2014-05-08 Microsoft Corporation Search service including indexing text containing numbers in part using one or more number index structures
US20140280290A1 (en) * 2013-03-14 2014-09-18 Microsoft Corporation Selection and display of alternative suggested sub-strings in a query
US9244952B2 (en) 2013-03-17 2016-01-26 Alation, Inc. Editable and searchable markup pages automatically populated through user query monitoring
US9792330B1 (en) 2013-04-30 2017-10-17 Google Inc. Identifying local experts for local search
US20140324656A1 (en) * 2013-04-30 2014-10-30 Omx Technology Ab Order life-cycle visualization
US20140372216A1 (en) * 2013-06-13 2014-12-18 Microsoft Corporation Contextual mobile application advertisements
US10726018B2 (en) 2014-02-10 2020-07-28 Microsoft Technology Licensing, Llc Semantic matching and annotation of attributes
US9477782B2 (en) 2014-03-21 2016-10-25 Microsoft Corporation User interface mechanisms for query refinement
US9460075B2 (en) * 2014-06-17 2016-10-04 International Business Machines Corporation Solving and answering arithmetic and algebraic problems using natural language processing
US20170177704A1 (en) * 2014-07-29 2017-06-22 Hewlett Packard Enterprise Development Lp Similarity in a structured dataset
US9514185B2 (en) * 2014-08-07 2016-12-06 International Business Machines Corporation Answering time-sensitive questions
US20160063095A1 (en) * 2014-08-27 2016-03-03 International Business Machines Corporation Unstructured data guided query modification
US9613134B2 (en) * 2014-09-07 2017-04-04 Microsoft Technology Licensing, Llc Identifying mathematical operators in natural language text for knowledge-based matching
US9430557B2 (en) 2014-09-17 2016-08-30 International Business Machines Corporation Automatic data interpretation and answering analytical questions with tables and charts
US11275775B2 (en) 2014-10-09 2022-03-15 Splunk Inc. Performing search queries for key performance indicators using an optimized common information model
US10289679B2 (en) 2014-12-10 2019-05-14 International Business Machines Corporation Data relationships in a question-answering environment
US10509800B2 (en) * 2015-01-23 2019-12-17 Hewlett-Packard Development Company, L.P. Visually interactive identification of a cohort of data objects similar to a query based on domain knowledge
US10019442B2 (en) * 2015-05-31 2018-07-10 Thomson Reuters Global Resources Unlimited Company Method and system for peer detection
US10325385B2 (en) 2015-09-24 2019-06-18 International Business Machines Corporation Comparative visualization of numerical information
US9679198B2 (en) * 2015-11-05 2017-06-13 International Business Machines Corporation Ingestion plan based on table uniqueness
US20170220950A1 (en) * 2016-01-29 2017-08-03 International Business Machines Corporation Numerical expression analysis
WO2017214266A1 (fr) * 2016-06-07 2017-12-14 Panoramix Solutions Systèmes et procédés d'identification et de classification de texte
US11934465B2 (en) 2016-11-28 2024-03-19 Thomson Reuters Enterprise Centre Gmbh System and method for finding similar documents based on semantic factual similarity
US11205103B2 (en) 2016-12-09 2021-12-21 The Research Foundation for the State University Semisupervised autoencoder for sentiment analysis
US11100100B2 (en) * 2017-03-20 2021-08-24 International Business Machines Corporation Numeric data type support for cognitive intelligence queries
US10268688B2 (en) 2017-05-03 2019-04-23 International Business Machines Corporation Corpus-scoped annotation and analysis
US10360302B2 (en) * 2017-09-15 2019-07-23 International Business Machines Corporation Visual comparison of documents using latent semantic differences
CN107957989B9 (zh) 2017-10-23 2021-01-12 创新先进技术有限公司 基于集群的词向量处理方法、装置以及设备
CN108170663A (zh) 2017-11-14 2018-06-15 阿里巴巴集团控股有限公司 基于集群的词向量处理方法、装置以及设备
US10963627B2 (en) * 2018-06-11 2021-03-30 Adobe Inc. Automatically generating digital enterprise content variants
CN110197197B (zh) * 2019-04-15 2022-08-30 贵州电网有限责任公司 一种基于文本相似度改进的电网档案相似度计算方法
US11003865B1 (en) * 2020-05-20 2021-05-11 Google Llc Retrieval-augmented language model pre-training and fine-tuning
CN112100393B (zh) * 2020-08-07 2022-03-15 浙江大学 一种低资源场景下的知识三元组抽取方法

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020073115A1 (en) * 2000-02-17 2002-06-13 Davis Russell T. RDL search engine
US20030225779A1 (en) * 2002-05-09 2003-12-04 Yasuhiro Matsuda Inverted index system and method for numeric attributes
US20060031183A1 (en) * 2004-08-04 2006-02-09 Tolga Oral System and method for enhancing keyword relevance by user's interest on the search result documents
EP1930816A1 (fr) * 2006-11-07 2008-06-11 Fast Serach & Transfer ASA Navigation dépendant du contextes des resultats selon le contexte et l'importance pondérée pour moteurs de recherche

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5911138A (en) * 1993-06-04 1999-06-08 International Business Machines Corporation Database search facility having improved user interface
US6714933B2 (en) * 2000-05-09 2004-03-30 Cnet Networks, Inc. Content aggregation method and apparatus for on-line purchasing system
US5809502A (en) * 1996-08-09 1998-09-15 Digital Equipment Corporation Object-oriented interface for an index
US5950189A (en) * 1997-01-02 1999-09-07 At&T Corp Retrieval system and method
US6418419B1 (en) * 1999-07-23 2002-07-09 5Th Market, Inc. Automated system for conditional order transactions in securities or other items in commerce
US7197479B1 (en) * 1999-09-02 2007-03-27 Cnet Europe Sa Methods and apparatus for implementing a multi-lingual catalog system
US6704728B1 (en) * 2000-05-02 2004-03-09 Iphase.Com, Inc. Accessing information from a collection of data
US7246110B1 (en) * 2000-05-25 2007-07-17 Cnet Networks, Inc. Product feature and relation comparison system
US7421418B2 (en) * 2003-02-19 2008-09-02 Nahava Inc. Method and apparatus for fundamental operations on token sequences: computing similarity, extracting term values, and searching efficiently
US7149748B1 (en) * 2003-05-06 2006-12-12 Sap Ag Expanded inverted index
US7693824B1 (en) * 2003-10-20 2010-04-06 Google Inc. Number-range search system and method
US7299224B2 (en) * 2003-12-19 2007-11-20 International Business Machines Corporation Method and infrastructure for processing queries in a database
US7370037B2 (en) 2003-12-29 2008-05-06 International Business Machines Corporation Methods for processing a text search query in a collection of documents
JP2005250980A (ja) * 2004-03-05 2005-09-15 Oki Electric Ind Co Ltd 文書検索システム、検索条件入力装置、検索実行装置、文書検索方法、および文書検索プログラム
US7461064B2 (en) * 2004-09-24 2008-12-02 International Buiness Machines Corporation Method for searching documents for ranges of numeric values
US8312034B2 (en) * 2005-06-24 2012-11-13 Purediscovery Corporation Concept bridge and method of operating the same
US7680789B2 (en) * 2006-01-18 2010-03-16 Microsoft Corporation Indexing and searching numeric ranges
US20070185870A1 (en) * 2006-01-27 2007-08-09 Hogue Andrew W Data object visualization using graphs
US8589869B2 (en) * 2006-09-07 2013-11-19 Wolfram Alpha Llc Methods and systems for determining a formula
GB2457267B (en) * 2008-02-07 2010-04-07 Yves Dassas A method and system of indexing numerical data

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020073115A1 (en) * 2000-02-17 2002-06-13 Davis Russell T. RDL search engine
US20030225779A1 (en) * 2002-05-09 2003-12-04 Yasuhiro Matsuda Inverted index system and method for numeric attributes
US20060031183A1 (en) * 2004-08-04 2006-02-09 Tolga Oral System and method for enhancing keyword relevance by user's interest on the search result documents
EP1930816A1 (fr) * 2006-11-07 2008-06-11 Fast Serach & Transfer ASA Navigation dépendant du contextes des resultats selon le contexte et l'importance pondérée pour moteurs de recherche

Also Published As

Publication number Publication date
WO2010151788A2 (fr) 2010-12-29
US9830378B2 (en) 2017-11-28
US20100332511A1 (en) 2010-12-30
US20140250130A1 (en) 2014-09-04
US8756229B2 (en) 2014-06-17

Similar Documents

Publication Publication Date Title
WO2010151788A3 (fr) Système et procédé pour récupérer des informations numériques basées sur des unités
Trupthi et al. Sentiment analysis on twitter using streaming API
CN111221983A (zh) 时序知识图谱生成方法、装置、设备和介质
CN112507068A (zh) 文档查询方法、装置、电子设备和存储介质
CN102169495A (zh) 行业词典生成方法及装置
JP2013529805A5 (ja) 検索方法、検索システム及びコンピュータプログラム
US20160085855A1 (en) Perspective data analysis and management
WO2004072757A3 (fr) Recherches de textes et d'attributs de memoires de donnees contenant des objets commerciaux
WO2014005657A4 (fr) Système et procédé pour la génération automatique d'un contenu riche en informations à partir de microblogues multiples, chaque microblogue contenant seulement des informations éparses
US11157540B2 (en) Search space reduction for knowledge graph querying and interactions
CN104281698A (zh) 一种高效的大数据查询方法
KR101651780B1 (ko) 빅 데이터 처리 기술을 이용한 연관 단어 추출 방법 및 그 시스템
CN109508441B (zh) 通过自然语言实现数据统计分析的方法、装置及电子设备
US20140019482A1 (en) Apparatus and method for searching for personalized content based on user's comment
US20170337477A1 (en) System for determination of automated response follow-up
JP5834795B2 (ja) 情報処理装置及びプログラム
Shekhawat Sentiment classification of current public opinion on BREXIT: Naïve Bayes classifier model vs Python’s TextBlob approach
CN103150331A (zh) 一种提供搜索引擎标签的方法和装置
US20220365956A1 (en) Method and apparatus for generating patent summary information, and electronic device and medium
KR20130022075A (ko) 감성 어휘 정보 구축 방법 및 장치
CN106653006B (zh) 基于语音交互的搜索方法和装置
CN103020311A (zh) 一种用户检索词的处理方法及系统
Walha et al. A Lexicon approach to multidimensional analysis of tweets opinion
Chinthala et al. Sentiment analysis on twitter streaming data
JP6305630B2 (ja) 文書検索装置、方法及びプログラム

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10792737

Country of ref document: EP

Kind code of ref document: A2