CN104221017B - 使用示例来查找连接语料库中的数据 - Google Patents

使用示例来查找连接语料库中的数据 Download PDF

Info

Publication number
CN104221017B
CN104221017B CN201380019331.3A CN201380019331A CN104221017B CN 104221017 B CN104221017 B CN 104221017B CN 201380019331 A CN201380019331 A CN 201380019331A CN 104221017 B CN104221017 B CN 104221017B
Authority
CN
China
Prior art keywords
data
values
user
datasets
proposed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201380019331.3A
Other languages
English (en)
Chinese (zh)
Other versions
CN104221017A (zh
Inventor
J·C·普拉特
S·乔德里
L·诺维克
H·J·M·梅杰
E·胡迪斯
K·穆克吉
C·A·海斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Publication of CN104221017A publication Critical patent/CN104221017A/zh
Application granted granted Critical
Publication of CN104221017B publication Critical patent/CN104221017B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/632Query formulation
    • G06F16/634Query by example, e.g. query by humming
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2425Iterative querying; Query formulation based on the results of a preceding query
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24578Query processing with adaptation to user needs using ranking
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Fuzzy Systems (AREA)
  • Multimedia (AREA)
  • Library & Information Science (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN201380019331.3A 2012-04-10 2013-04-08 使用示例来查找连接语料库中的数据 Active CN104221017B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/443,681 2012-04-10
US13/443,681 US8983954B2 (en) 2012-04-10 2012-04-10 Finding data in connected corpuses using examples
PCT/US2013/035539 WO2013154951A1 (en) 2012-04-10 2013-04-08 Finding data in connected corpuses using examples

Publications (2)

Publication Number Publication Date
CN104221017A CN104221017A (zh) 2014-12-17
CN104221017B true CN104221017B (zh) 2018-05-22

Family

ID=48289606

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201380019331.3A Active CN104221017B (zh) 2012-04-10 2013-04-08 使用示例来查找连接语料库中的数据

Country Status (5)

Country Link
US (2) US8983954B2 (https=)
EP (1) EP2836935B1 (https=)
CN (1) CN104221017B (https=)
BR (1) BR112014023495B1 (https=)
WO (1) WO2013154951A1 (https=)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8983954B2 (en) 2012-04-10 2015-03-17 Microsoft Technology Licensing, Llc Finding data in connected corpuses using examples
US11163732B2 (en) * 2015-12-28 2021-11-02 International Business Machines Corporation Linking, deploying, and executing distributed analytics with distributed datasets
US11157520B2 (en) * 2016-03-28 2021-10-26 DataSpark, Pte Ltd. Uniqueness level for anonymized datasets
US10417439B2 (en) 2016-04-08 2019-09-17 Google Llc Post-hoc management of datasets
US11157498B1 (en) 2016-09-26 2021-10-26 Splunk Inc. Query generation using a dataset association record of a metadata catalog
US11093564B1 (en) 2016-09-26 2021-08-17 Splunk Inc. Identifying configuration parameters for a query using a metadata catalog
WO2018236341A1 (en) * 2017-06-19 2018-12-27 Hitachi, Ltd. Method of creating metadata automatically for data lake
US11301495B2 (en) * 2017-11-21 2022-04-12 Cherre, Inc. Entity resolution computing system and methods
US20190220537A1 (en) * 2018-01-15 2019-07-18 Microsoft Technology Licensing, Llc Context-sensitive methods of surfacing comprehensive knowledge in and between applications
US11238049B1 (en) 2018-04-30 2022-02-01 Splunk Inc. Revising catalog metadata based on parsing queries
US11573955B1 (en) 2018-04-30 2023-02-07 Splunk Inc. Data-determinant query terms
US11392578B1 (en) * 2018-04-30 2022-07-19 Splunk Inc. Automatically generating metadata for a metadata catalog based on detected changes to the metadata catalog
GB201817074D0 (en) 2018-10-19 2018-12-05 Palantir Technologies Inc System and method for querying a data repository
US11048675B2 (en) 2019-01-31 2021-06-29 EMC IP Holding Company LLC Structured data enrichment
FR3094508A1 (fr) * 2019-03-29 2020-10-02 Orange Système et procédé d’enrichissement de données
US11715051B1 (en) 2019-04-30 2023-08-01 Splunk Inc. Service provider instance recommendations using machine-learned classifications and reconciliation
EP4095713A1 (en) * 2021-05-27 2022-11-30 Ovh Systems and methods for generating a target dataset having a target data format on a user device
US20240160999A1 (en) * 2022-11-15 2024-05-16 Fujitsu Limited Automated custom feature engineering
US12393604B2 (en) * 2023-11-29 2025-08-19 Sap Se Systems and methods for preventing database deadlocks during synchronization

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1508728A (zh) * 2002-12-18 2004-06-30 �Ҵ���˾ 使用元数据在关系数据库中创建多维数据集的方法和系统
CN1758251A (zh) * 2004-10-09 2006-04-12 微软公司 静态和动态数据集的交互
CN101479697A (zh) * 2006-05-15 2009-07-08 克斯普拉达公司 用于数据存储和检索的系统和方法
CN101694952A (zh) * 2009-09-28 2010-04-14 国电南京自动化股份有限公司 由iec61850 scd文件生成嵌入式远动系统装置定义的方法
CN102222081A (zh) * 2010-04-13 2011-10-19 微软公司 将人物的模型应用于搜索结果
CN102243647A (zh) * 2010-05-11 2011-11-16 微软公司 从结构化数据中提取高阶知识
WO2011112960A3 (en) * 2010-03-12 2011-12-22 Microsoft Corporation Semantics update and adaptive interfaces in connection with information as a service

Family Cites Families (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5448727A (en) 1991-04-30 1995-09-05 Hewlett-Packard Company Domain based partitioning and reclustering of relations in object-oriented relational database management systems
US7181438B1 (en) * 1999-07-21 2007-02-20 Alberti Anemometer, Llc Database access system
US6842758B1 (en) * 1999-07-30 2005-01-11 Computer Associates Think, Inc. Modular method and system for performing database queries
US6691140B1 (en) * 1999-07-30 2004-02-10 Computer Associates Think, Inc. Method and system for multidimensional storage model with interdimensional links
US7065451B2 (en) * 2001-05-24 2006-06-20 Board Of Regents, The University Of Texas System Computer-based method for creating collections of sequences from a dataset of sequence identifiers corresponding to natural complex biopolymer sequences and linked to corresponding annotations
US20030126144A1 (en) * 2001-08-10 2003-07-03 O'halloran Sharyn Method and apparatus for access, integration, and analysis of heterogeneous data sources via the manipulation of metadata objects
US6792414B2 (en) 2001-10-19 2004-09-14 Microsoft Corporation Generalized keyword matching for keyword based searching over relational databases
US20030083892A1 (en) * 2001-11-01 2003-05-01 Arun Ramachandran Process for one-stop shopping of all available license deals available using a usage based licensing server data structure
US20040088322A1 (en) * 2002-10-31 2004-05-06 International Business Machines Corporation System and method for determining connections between information aggregates
US20040088315A1 (en) * 2002-10-31 2004-05-06 International Business Machines Corporation System and method for determining membership of information aggregates
US7613700B1 (en) * 2003-09-18 2009-11-03 Matereality, LLC System and method for electronic submission, procurement, and access to highly varied material property data
US7333995B2 (en) * 2004-07-02 2008-02-19 Cognos, Incorporated Very large dataset representation system and method
GB2418499A (en) * 2004-09-24 2006-03-29 Advanced Forensic Solutions Lt Information analysis arrangement
FR2886217A1 (fr) * 2005-05-27 2006-12-01 Valeo Systemes Thermiques Module autonome de climatisation notamment destine au traitement thermique d'une zone de l'habitacle d'un vehicule
US7917511B2 (en) * 2006-03-20 2011-03-29 Cannon Structures, Inc. Query system using iterative grouping and narrowing of query results
US7999809B2 (en) * 2006-04-19 2011-08-16 Tableau Software, Inc. Computer systems and methods for automatic generation of models for a dataset
US8856105B2 (en) * 2006-04-28 2014-10-07 Hewlett-Packard Development Company, L.P. Dynamic data navigation
US7912841B2 (en) * 2006-09-13 2011-03-22 I. Know Nv. Data processing based on data linking elements
US7912875B2 (en) * 2006-10-31 2011-03-22 Business Objects Software Ltd. Apparatus and method for filtering data using nested panels
US7685146B2 (en) 2006-11-03 2010-03-23 Business Objects, S.A. Apparatus and method for a collaborative semantic domain and data set based on combining data
US7698285B2 (en) * 2006-11-09 2010-04-13 International Business Machines Corporation Compression of multidimensional datasets
US8484252B2 (en) * 2006-11-30 2013-07-09 International Business Machines Corporation Generation of a multidimensional dataset from an associative database
US20090024590A1 (en) * 2007-03-15 2009-01-22 Sturge Timothy User contributed knowledge database
US8145677B2 (en) * 2007-03-27 2012-03-27 Faleh Jassem Al-Shameri Automated generation of metadata for mining image and text data
US8640056B2 (en) * 2007-07-05 2014-01-28 Oracle International Corporation Data visualization techniques
US8286100B2 (en) * 2007-07-05 2012-10-09 Oracle International Corporation Linking graphical elements of data visualizations
US8209665B2 (en) * 2008-04-08 2012-06-26 Infosys Limited Identification of topics in source code
US9104738B2 (en) * 2008-06-19 2015-08-11 Tropare, Inc. Leveraging collaborative cloud services to build and share apps
US8484215B2 (en) * 2008-10-23 2013-07-09 Ab Initio Technology Llc Fuzzy data operations
GB0906409D0 (en) * 2009-04-15 2009-05-20 Ipv Ltd Metadata browse
US8892540B2 (en) * 2009-04-24 2014-11-18 Rockwell Automation Technologies, Inc. Dynamic sustainability search engine
US8321390B2 (en) * 2009-06-11 2012-11-27 Vivek Swarnakar Methods and apparatus for organizing data in a database
US8543591B2 (en) * 2009-09-02 2013-09-24 Google Inc. Method and system for generating and sharing dataset segmentation schemes
US8862557B2 (en) 2009-12-23 2014-10-14 Adi, Llc System and method for rule-driven constraint-based generation of domain-specific data sets
US8447747B1 (en) * 2010-09-14 2013-05-21 Amazon Technologies, Inc. System for generating behavior-based associations for multiple domain-specific applications
US8943079B2 (en) * 2012-02-01 2015-01-27 Telefonaktiebolaget L M Ericsson (Publ) Apparatus and methods for anonymizing a data set
US8983954B2 (en) 2012-04-10 2015-03-17 Microsoft Technology Licensing, Llc Finding data in connected corpuses using examples

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1508728A (zh) * 2002-12-18 2004-06-30 �Ҵ���˾ 使用元数据在关系数据库中创建多维数据集的方法和系统
CN1758251A (zh) * 2004-10-09 2006-04-12 微软公司 静态和动态数据集的交互
CN101479697A (zh) * 2006-05-15 2009-07-08 克斯普拉达公司 用于数据存储和检索的系统和方法
CN101694952A (zh) * 2009-09-28 2010-04-14 国电南京自动化股份有限公司 由iec61850 scd文件生成嵌入式远动系统装置定义的方法
WO2011112960A3 (en) * 2010-03-12 2011-12-22 Microsoft Corporation Semantics update and adaptive interfaces in connection with information as a service
CN102222081A (zh) * 2010-04-13 2011-10-19 微软公司 将人物的模型应用于搜索结果
CN102243647A (zh) * 2010-05-11 2011-11-16 微软公司 从结构化数据中提取高阶知识

Also Published As

Publication number Publication date
CN104221017A (zh) 2014-12-17
WO2013154951A1 (en) 2013-10-17
US20130268531A1 (en) 2013-10-10
BR112014023495A8 (pt) 2017-12-12
EP2836935B1 (en) 2019-06-05
BR112014023495A2 (https=) 2017-06-20
US20150193533A1 (en) 2015-07-09
US10140366B2 (en) 2018-11-27
BR112014023495B1 (pt) 2021-11-30
US8983954B2 (en) 2015-03-17
EP2836935A1 (en) 2015-02-18

Similar Documents

Publication Publication Date Title
CN104221017B (zh) 使用示例来查找连接语料库中的数据
EP2482204B1 (en) System and method for information retrieval from object collections with complex interrelationships
US10146862B2 (en) Context-based metadata generation and automatic annotation of electronic media in a computer network
JP6014725B2 (ja) 単文/複文構造の自然言語クエリに対する検索および情報提供方法並びにシステム
US10599643B2 (en) Template-driven structured query generation
Hoekstra et al. Data scopes for digital history research
US20210157857A1 (en) Systems, apparatuses, and methods to generate synthetic queries from customer data for training of document querying machine learning models
US11314819B2 (en) Systems, apparatuses, and method for document ingestion
US11347742B2 (en) Querying across a composite join of multiple database tables using a search engine index
CN111061828B (zh) 一种数字图书馆知识检索方法及装置
US20130282693A1 (en) Object oriented data and metadata based search
US9275155B1 (en) Querying across a composite join of multiple database tables using a search engine index
CN106716402A (zh) 以实体为中心的知识发现
EP3732587B1 (en) Systems and methods for context-independent database search paths
Thung et al. Webapirec: Recommending web apis to software projects via personalized ranking
CN104750776B (zh) 使用元数据访问数据库平台中的信息内容
US20210158209A1 (en) Systems, apparatuses, and methods of active learning for document querying machine learning models
CN102693320A (zh) 一种搜索方法及装置
Dixit Elasticsearch essentials
CN104462552A (zh) 问答页面核心词提取方法和装置
Hampson et al. Supporting personalized information exploration through subjective expert-created semantic attributes
Goyal et al. Concept based query recommendation
Dasgupta et al. Searching the Library Through Commands: Implementing an Open Source Command Line-Based Conversational Library Search System Using Model Context Protocol
Chung A study on varieties of subject access and usabilities of the national library of Korea subject headings
Smith Exploratory and faceted browsing, over heterogeneous and cross-domain data sources

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20171016

Address after: Washington State

Applicant after: Micro soft technique license Co., Ltd

Address before: Washington State

Applicant before: Microsoft Corp.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant