EP3117347B1 - Systèmes et procédés d'analyse de données rapide - Google Patents

Systèmes et procédés d'analyse de données rapide Download PDF

Info

Publication number
EP3117347B1
EP3117347B1 EP15762364.6A EP15762364A EP3117347B1 EP 3117347 B1 EP3117347 B1 EP 3117347B1 EP 15762364 A EP15762364 A EP 15762364A EP 3117347 B1 EP3117347 B1 EP 3117347B1
Authority
EP
European Patent Office
Prior art keywords
data
query
shards
collecting
event
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
EP15762364.6A
Other languages
German (de)
English (en)
Other versions
EP3117347A4 (fr
EP3117347A2 (fr
Inventor
Robert Johnson
Lior Abraham
Ann Johnson
Boris Dimitrov
Don Fossgreen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Scuba Analytics Inc
Original Assignee
Interana Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Interana Inc filed Critical Interana Inc
Publication of EP3117347A2 publication Critical patent/EP3117347A2/fr
Publication of EP3117347A4 publication Critical patent/EP3117347A4/fr
Application granted granted Critical
Publication of EP3117347B1 publication Critical patent/EP3117347B1/fr
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2425Iterative querying; Query formulation based on the results of a preceding query
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • G06F16/24534Query rewriting; Transformation
    • G06F16/24542Plan optimisation
    • G06F16/24545Selectivity estimation or determination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24553Query execution of query operations
    • G06F16/24554Unary operations; Data partitioning operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2471Distributed queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • G06F16/278Data partitioning, e.g. horizontal or vertical partitioning

Claims (20)

  1. Procédé d'analyse de données rapide comportant les étapes consistant à :
    • recevoir et interpréter une requête (S210), dans lequel l'interprétation de la requête comporte la traduction de chaînes de la requête en des nombres entiers au moyen d'un module de traduction de chaînes (130), dans lequel l'interprétation de la requête comporte par ailleurs l'identification d'un premier ensemble de fragments de données contenant des données liées à la requête ;
    • collecter un premier échantillon de données en provenance du premier ensemble de fragments de données (S220), dans lequel la collecte du premier échantillon de données comporte la collecte de données en provenance de chacun du premier ensemble de fragments de données, dans lequel la collecte de données en provenance de chacun du premier ensemble de fragments de données comporte la collecte uniquement d'un sous-ensemble de données contenues dans chacun du premier ensemble de fragments de données ;
    • calculer un résultat intermédiaire pour la requête en se basant sur l'analyse du premier échantillon de données (S230) ;
    • identifier un deuxième ensemble de fragments de données en se basant sur le résultat intermédiaire ;
    • collecter un deuxième échantillon de données en provenance du deuxième ensemble de fragments de données (S240), dans lequel la collecte du deuxième échantillon de données comporte la collecte de données en provenance de chacun du deuxième ensemble de fragments de données, dans lequel la collecte de données en provenance de chacun du deuxième ensemble de fragments de données comporte la collecte d'un ensemble complet de données contenues dans chacun du deuxième ensemble de fragments de données ; et
    • calculer un résultat final pour la requête en se basant sur l'analyse du deuxième échantillon de données (S250).
  2. Procédé selon la revendication 1, dans lequel l'étape consistant à collecter le premier échantillon de données (S220) en provenance du premier ensemble de fragments de données comporte la collecte de données en provenance d'ensembles de données en colonne du premier ensemble de fragments de données.
  3. Procédé selon la revendication 2, dans lequel le premier ensemble de fragments de données comporte des données d'événements organisées de manière chronologique.
  4. Procédé selon la revendication 1, dans lequel l'étape consistant à recevoir et à interpréter la requête (S210) comporte par ailleurs l'interprétation de références à des données implicites.
  5. Procédé selon la revendication 4, dans lequel l'étape consistant à recevoir et à interpréter la requête (S210) comporte par ailleurs la sélection d'au moins l'une parmi une fonction de mise en ordre et une fonction de regroupement.
  6. Procédé selon la revendication 1, dans lequel l'étape consistant à identifier un premier ensemble de fragments de données comporte l'identification d'emplacements de nœuds du premier ensemble de fragments de données au moyen d'une base de données de configuration.
  7. Procédé selon la revendication 1, dans lequel l'étape consistant à traduire des chaînes de la requête en des nombres entiers au moyen d'un module de traduction de chaînes (130) comporte la traduction de chaînes au moyen d'identifiants de chaînes stockés dans un format qui active la mise en correspondance de préfixes.
  8. Procédé selon la revendication 1, dans lequel la requête comprend au moins un intervalle de temps et au moins une source de données d'événement.
  9. Procédé selon la revendication 8, dans lequel l'étape consistant à calculer le résultat final pour la requête (S250) comporte par ailleurs le calcul de bandes de confiance pour la précision estimée du résultat en se basant sur l'analyse d'une distribution statistique des données échantillonnées.
  10. Procédé selon la revendication 9, dans lequel l'étape consistant à calculer le résultat final pour la requête (S250) comporte par ailleurs le retour à la fois d'une cohorte et de données agrégées associées à la cohorte sous la forme d'un résultat d'une requête.
  11. Procédé selon la revendication 9, dans lequel l'étape consistant à calculer le résultat intermédiaire pour la requête (S230) comporte par ailleurs le calcul de bandes de confiance pour la précision estimée du résultat en se basant sur l'analyse d'une distribution statistique des données échantillonnées.
  12. Procédé selon la revendication 1,
    dans lequel l'étape consistant à identifier le deuxième ensemble de fragments de données en se basant sur le résultat intermédiaire et l'étape consistant à collecter le deuxième échantillon de données en provenance du deuxième ensemble de fragments de données (S240) comprennent la réalisation d'un certain nombre de recherches intermédiaires, chaque recherche intermédiaire comportant les étapes consistant à :
    ∘ identifier un ensemble supplémentaire de fragments de données en se basant sur au moins l'un parmi le résultat intermédiaire et des résultats intermédiaires supplémentaires,
    ∘ collecter des échantillons de données supplémentaires en provenance de l'ensemble supplémentaire de fragments de données, et
    ∘ calculer des résultats intermédiaires supplémentaires en se basant sur l'analyse des échantillons de données supplémentaires.
  13. Procédé selon la revendication 12, dans lequel le nombre de recherches intermédiaires est un nombre fixe.
  14. Procédé selon la revendication 12, comportant par ailleurs l'étape consistant à calculer des bandes de confiance pour chaque résultat intermédiaire supplémentaire en se basant sur l'analyse d'une distribution statistique de données échantillonnées.
  15. Procédé selon la revendication 14, dans lequel l'étape consistant à réaliser un certain nombre de recherches intermédiaires comporte la réalisation de recherches intermédiaires jusqu'à ce qu'une bande de confiance d'un résultat intermédiaire supplémentaire passe un seuil de confiance.
  16. Procédé selon la revendication 15, dans lequel le seuil de confiance est réglé automatiquement en réponse à une variable vitesse/précision.
  17. Procédé selon la revendication 15, dans lequel l'étape consistant à recevoir et à interpréter la requête comporte par ailleurs l'analyse syntaxique de chaînes de requête du type SQL en une arborescence de requêtes.
  18. Procédé selon la revendication 16, dans lequel la variable vitesse/précision est passée comme faisant partie de la requête.
  19. Procédé selon la revendication 18, dans lequel la requête comprend au moins un intervalle de temps et au moins une source de données d'événements.
  20. Procédé selon la revendication 14, comportant par ailleurs l'étape consistant à signaler à un utilisateur que les bandes de confiance sont inférieures à un seuil de confiance.
EP15762364.6A 2014-03-10 2015-03-10 Systèmes et procédés d'analyse de données rapide Active EP3117347B1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201461950827P 2014-03-10 2014-03-10
PCT/US2015/019765 WO2015138497A2 (fr) 2014-03-10 2015-03-10 Systèmes et procédés d'analyse de données rapide

Publications (3)

Publication Number Publication Date
EP3117347A2 EP3117347A2 (fr) 2017-01-18
EP3117347A4 EP3117347A4 (fr) 2017-11-29
EP3117347B1 true EP3117347B1 (fr) 2020-09-23

Family

ID=54017560

Family Applications (1)

Application Number Title Priority Date Filing Date
EP15762364.6A Active EP3117347B1 (fr) 2014-03-10 2015-03-10 Systèmes et procédés d'analyse de données rapide

Country Status (5)

Country Link
US (5) US9323809B2 (fr)
EP (1) EP3117347B1 (fr)
JP (1) JP6307169B2 (fr)
CN (1) CN106462583B (fr)
WO (1) WO2015138497A2 (fr)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9607045B2 (en) 2012-07-12 2017-03-28 Microsoft Technology Licensing, Llc Progressive query computation using streaming architectures
US8631325B1 (en) 2013-08-09 2014-01-14 Zoomdata, Inc. Real-time data visualization of streaming data
US9659079B2 (en) * 2014-05-30 2017-05-23 Wal-Mart Stores, Inc. Shard determination logic for scalable order and inventory management architecture with a sharded transactional database
US9251276B1 (en) 2015-02-27 2016-02-02 Zoomdata, Inc. Prioritization of retrieval and/or processing of data
US10331752B2 (en) * 2015-07-21 2019-06-25 Oath Inc. Methods and systems for determining query date ranges
US10339116B2 (en) * 2015-10-07 2019-07-02 Oracle International Corporation Composite sharding
US11442909B2 (en) * 2015-12-01 2022-09-13 Motorola Solutions, Inc. Data analytics system
US10740328B2 (en) * 2016-06-24 2020-08-11 Microsoft Technology Licensing, Llc Aggregate-query database system and processing
CN107798021B (zh) * 2016-09-07 2021-04-30 北京京东尚科信息技术有限公司 数据关联处理方法、系统及电子设备
US11055273B1 (en) * 2016-11-04 2021-07-06 Amazon Technologies, Inc. Software container event monitoring systems
US9942312B1 (en) 2016-12-16 2018-04-10 Zoomdata, Inc. System and method for facilitating load reduction at a landing zone
US10552435B2 (en) 2017-03-08 2020-02-04 Microsoft Technology Licensing, Llc Fast approximate results and slow precise results
US10776361B2 (en) * 2017-04-07 2020-09-15 Salesforce.Com, Inc. Time series database search system
CN107665228B (zh) * 2017-05-10 2019-12-20 平安科技(深圳)有限公司 一种关联信息查询方法、终端及设备
JP6528807B2 (ja) * 2017-06-28 2019-06-12 オムロン株式会社 制御システム、制御装置、結合方法およびプログラム
US20190163790A1 (en) * 2017-11-29 2019-05-30 Intuit Inc. System and method for generating aggregated statistics over sets of user data while enforcing data governance policy
CN108280153B (zh) * 2018-01-08 2022-01-07 天津科技大学 一种碎片化知识智能化聚合方法
CN111143393A (zh) * 2018-11-03 2020-05-12 广州市明领信息科技有限公司 一种大数据的处理系统
CN109471862B (zh) * 2018-11-12 2021-06-11 北京懿医云科技有限公司 数据处理方法及装置、电子设备、存储介质
CN109491989B (zh) * 2018-11-12 2021-08-31 北京懿医云科技有限公司 数据处理方法及装置、电子设备、存储介质
US11354168B2 (en) 2019-01-18 2022-06-07 Salesforce.Com, Inc. Elastic data partitioning of a database
US11194773B2 (en) 2019-09-12 2021-12-07 Oracle International Corporation Integration of existing databases into a sharding environment
US11609886B2 (en) 2020-01-31 2023-03-21 Salesforce.Com, Inc. Mechanism for stream processing efficiency using probabilistic model to reduce data redundancy
US11537572B2 (en) 2020-01-31 2022-12-27 Salesforce.Com, Inc. Multidimensional partition of data to calculate aggregation at scale
US11886435B1 (en) * 2023-01-06 2024-01-30 Snark AI, Inc. Systems and methods for executing queries on tensor datasets

Family Cites Families (125)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5335345A (en) 1990-04-11 1994-08-02 Bell Communications Research, Inc. Dynamic query optimization using partial information
JP3865775B2 (ja) 1995-04-11 2007-01-10 キネテック インコーポレイテッド データ処理システムにおけるデータの識別
US5668987A (en) * 1995-08-31 1997-09-16 Sybase, Inc. Database system with subquery optimizer
JP3144470B2 (ja) 1996-07-08 2001-03-12 矢崎総業株式会社 衝撃検出装置、遮断弁制御装置、衝撃検出方法及び遮断弁制御方法
JPH10269248A (ja) * 1997-01-24 1998-10-09 Hitachi Ltd データベース処理システムにおけるデータの無作為抽出処理方法及びこれに基づくデータベース処理システム
US5893090A (en) * 1997-01-31 1999-04-06 Informix Software, Inc. Method and apparatus for performing an aggregate query in a database system
US6804663B1 (en) * 1998-09-21 2004-10-12 Microsoft Corporation Methods for optimizing the installation of a software product onto a target computer system
US20030195872A1 (en) * 1999-04-12 2003-10-16 Paul Senn Web-based information content analyzer and information dimension dictionary
US6681331B1 (en) 1999-05-11 2004-01-20 Cylant, Inc. Dynamic software system intrusion detection
US6766320B1 (en) * 2000-08-24 2004-07-20 Microsoft Corporation Search engine with natural language-based robust parsing for user query and relevance feedback learning
US7120624B2 (en) 2001-05-21 2006-10-10 Microsoft Corporation Optimization based method for estimating the results of aggregate queries
US6775681B1 (en) * 2002-02-26 2004-08-10 Oracle International Corporation Evaluation of grouping sets by reduction to group-by clause, with or without a rollup operator, using temporary tables
US7010521B2 (en) * 2002-05-13 2006-03-07 Netezza Corporation Optimized database appliance
US6920460B1 (en) * 2002-05-29 2005-07-19 Oracle International Corporation Systems and methods for managing partitioned indexes that are created and maintained by user-defined indexing schemes
US7213012B2 (en) * 2003-05-09 2007-05-01 Oracle International Corporation Optimizer dynamic sampling
US7043621B2 (en) 2003-05-16 2006-05-09 Hewlett-Packard Development Company, L.P. Sharding method and apparatus using directed graphs
EP1673744B1 (fr) 2003-09-05 2010-01-27 Sensitech Inc. Conditionnement automatique de donnees accumulees par des detecteurs qui surveillent des procedes de chaines d'approvisionnement
EP1676217B1 (fr) * 2003-09-15 2011-07-06 Ab Initio Technology LLC Interconnexion de donnees
US7587394B2 (en) * 2003-09-23 2009-09-08 International Business Machines Corporation Methods and apparatus for query rewrite with auxiliary attributes in query processing operations
US20050223019A1 (en) * 2004-03-31 2005-10-06 Microsoft Corporation Block-level sampling in statistics estimation
JP4484643B2 (ja) 2004-09-10 2010-06-16 独立行政法人科学技術振興機構 時系列データ異常判定用プログラム及び時系列データ異常判別方法
US7932906B2 (en) 2004-09-15 2011-04-26 Hewlett-Packard Development Company, L.P.. Constructing substantially equal-width pixel bar charts to enable visual data analysis
US8108579B2 (en) 2005-03-31 2012-01-31 Qualcomm Incorporated Mechanism and method for managing data storage
US7930752B2 (en) 2005-11-18 2011-04-19 Nexthink S.A. Method for the detection and visualization of anomalous behaviors in a computer network
US20070150949A1 (en) 2005-12-28 2007-06-28 At&T Corp. Anomaly detection methods for a computer network
US7650367B2 (en) 2006-01-13 2010-01-19 Tekelec Methods, systems, and computer program products for detecting and restoring missing or corrupted data in a distributed, scalable, redundant measurement platform database
US7593939B2 (en) 2006-04-07 2009-09-22 Google Inc. Generating specialized search results in response to patterned queries
US8239383B2 (en) * 2006-06-15 2012-08-07 International Business Machines Corporation System and method for managing execution of queries against database samples
US7930595B2 (en) 2006-06-22 2011-04-19 International Business Machines Corporation Method and apparatus for analyzing error conditions in a massively parallel computer system by identifying anomalous nodes within a communicator set
US20080091730A1 (en) 2006-09-29 2008-04-17 Searete Llc, A Limited Liability Corporation Of The State Of Delaware Computational systems for biomedical data
US9465823B2 (en) 2006-10-19 2016-10-11 Oracle International Corporation System and method for data de-duplication
US20100138919A1 (en) 2006-11-03 2010-06-03 Tao Peng System and process for detecting anomalous network traffic
US20080109423A1 (en) * 2006-11-08 2008-05-08 Eric Lawrence Barsness Apparatus and method for database partition elimination for sampling queries
WO2008121945A2 (fr) 2007-03-30 2008-10-09 Netqos, Inc. Procédé et système de détection statistique d'une anomalie de réseau
US9128877B1 (en) 2007-04-19 2015-09-08 Robert E. Cousins Systems, methods and computer program products including features of transforming data involving a secure format from which the data is recoverable
US20090070752A1 (en) 2007-09-06 2009-03-12 International Business Machines Corporation Method and system for optimization of an application
US20080215576A1 (en) 2008-03-05 2008-09-04 Quantum Intelligence, Inc. Fusion and visualization for multiple anomaly detection systems
US8000993B2 (en) 2008-04-14 2011-08-16 Tra, Inc. Using consumer purchase behavior for television targeting
EP2289030A4 (fr) 2008-04-29 2011-08-10 Sugarcrm Inc Système et procédé d'application logicielle commerciale
US8484162B2 (en) 2008-06-24 2013-07-09 Commvault Systems, Inc. De-duplication systems and methods for application-specific data
US8140522B2 (en) * 2008-08-12 2012-03-20 International Business Machines Corporation Method, apparatus, and computer program product for adaptive query parallelism partitioning with look-ahead probing and feedback
US8903973B1 (en) 2008-11-10 2014-12-02 Tanium Inc. Parallel distributed network management
US8380738B2 (en) 2009-03-17 2013-02-19 Nec Laboratories America, Inc. System and methods for database distribution and querying over key-based scalable storage
US8510538B1 (en) * 2009-04-13 2013-08-13 Google Inc. System and method for limiting the impact of stragglers in large-scale parallel data processing
US8108343B2 (en) 2009-04-23 2012-01-31 Microsoft Corporation De-duplication and completeness in multi-log based replication
US8918365B2 (en) 2009-06-19 2014-12-23 Blekko, Inc. Dedicating disks to reading or writing
EP2443574A4 (fr) 2009-06-19 2014-05-07 Blekko Inc Base de données évolutive en grappes
US9866426B2 (en) 2009-11-17 2018-01-09 Hawk Network Defense, Inc. Methods and apparatus for analyzing system events
US9323758B1 (en) 2009-12-22 2016-04-26 Emc Corporation Efficient migration of replicated files from a file server having a file de-duplication facility
US8527496B2 (en) 2010-02-11 2013-09-03 Facebook, Inc. Real time content searching in social network
US8458131B2 (en) 2010-02-26 2013-06-04 Microsoft Corporation Opportunistic asynchronous de-duplication in block level backups
US20120016633A1 (en) 2010-07-16 2012-01-19 Andreas Wittenstein System and method for automatic detection of anomalous recurrent behavior
US9246932B2 (en) 2010-07-19 2016-01-26 Sitelock, Llc Selective website vulnerability and infection testing
JP5598279B2 (ja) * 2010-11-16 2014-10-01 日本電気株式会社 分散メモリデータベースシステム、フロントデータベースサーバ、データ処理方法およびプログラム
US9424351B2 (en) 2010-11-22 2016-08-23 Microsoft Technology Licensing, Llc Hybrid-distribution model for search engine indexes
US8468132B1 (en) 2010-12-28 2013-06-18 Amazon Technologies, Inc. Data replication framework
US10198492B1 (en) 2010-12-28 2019-02-05 Amazon Technologies, Inc. Data replication framework
US9251097B1 (en) 2011-03-22 2016-02-02 Amazon Technologies, Inc. Redundant key management
US8983960B1 (en) 2011-03-28 2015-03-17 Google Inc. Opportunistic job processing
US9054864B2 (en) 2011-03-29 2015-06-09 Kaseya Limited Method and apparatus of securely processing data for file backup, de-duplication, and restoration
US8856169B2 (en) 2011-07-13 2014-10-07 Case Western Reserve University Multi-modality, multi-resource, information integration environment
US20140245337A1 (en) * 2011-07-27 2014-08-28 Google Inc. Proxy Analytics
US9392310B2 (en) * 2011-08-31 2016-07-12 Google Inc. Method and system for collecting and managing TV viewership data
US9305043B2 (en) 2011-09-12 2016-04-05 Hitachi, Ltd. Stream data anomaly detection method and device
US9129039B2 (en) 2011-10-18 2015-09-08 Ut-Battelle, Llc Scenario driven data modelling: a method for integrating diverse sources of data and data streams
US8356050B1 (en) 2011-11-21 2013-01-15 Yahoo! Inc. Method or system for spilling in query environments
US8756209B2 (en) * 2012-01-04 2014-06-17 International Business Machines Corporation Computing resource allocation based on query response analysis in a networked computing environment
US9239851B1 (en) * 2012-07-12 2016-01-19 Cross Commerce Media, Inc. Advanced database systems and methods
JP2013196565A (ja) * 2012-03-22 2013-09-30 Toshiba Corp データベース処理方法、及びデータベース処理装置
US20130253888A1 (en) 2012-03-22 2013-09-26 Microsoft Corporation One-pass statistical computations
US10489412B2 (en) 2012-03-29 2019-11-26 Hitachi Vantara Corporation Highly available search index with storage node addition and removal
GB2514947B (en) 2012-05-04 2015-06-17 Box Inc Repository redundancy implementation of a system which incrementally updates clients with events that occured via a cloud-enabled platform
US8825752B1 (en) 2012-05-18 2014-09-02 Netapp, Inc. Systems and methods for providing intelligent automated support capable of self rejuvenation with respect to storage systems
US9015812B2 (en) 2012-05-22 2015-04-21 Hasso-Plattner-Institut Fur Softwaresystemtechnik Gmbh Transparent control of access invoking real-time analysis of the query history
US9218573B1 (en) * 2012-05-22 2015-12-22 Google Inc. Training a model using parameter server shards
WO2014008415A1 (fr) 2012-07-05 2014-01-09 Littelfuse, Inc. Dispositif de limitation en tension destiné à une protection de circuit de surtension transitoire
US9792320B2 (en) * 2012-07-06 2017-10-17 Box, Inc. System and method for performing shard migration to support functions of a cloud-based service
US9904788B2 (en) 2012-08-08 2018-02-27 Amazon Technologies, Inc. Redundant key management
US20140108421A1 (en) * 2012-10-04 2014-04-17 Codefutures Corporation Partitioning database data in a sharded database
US9104786B2 (en) * 2012-10-12 2015-08-11 International Business Machines Corporation Iterative refinement of cohorts using visual exploration and data analytics
US9146830B2 (en) 2012-10-26 2015-09-29 Jsmapreduce Corporation Hybrid local/remote infrastructure for data processing with lightweight setup, powerful debuggability, controllability, integration, and productivity features
US9633022B2 (en) 2012-12-28 2017-04-25 Commvault Systems, Inc. Backup and restoration for a deduplicated file system
US9633033B2 (en) 2013-01-11 2017-04-25 Commvault Systems, Inc. High availability distributed deduplicated storage system
US10148548B1 (en) 2013-01-29 2018-12-04 Axway, Inc. System and method for real-time analysis of incoming data
US20140214886A1 (en) 2013-01-29 2014-07-31 ParElastic Corporation Adaptive multi-client saas database
US9424330B2 (en) 2013-03-15 2016-08-23 Tactile, Inc. Database sharding by shard levels
US8935272B2 (en) * 2013-03-17 2015-01-13 Alation, Inc. Curated answers community automatically populated through user query monitoring
WO2014174599A1 (fr) 2013-04-24 2014-10-30 株式会社日立製作所 Dispositif informatique, support de mémorisation et procédé de recherche de données
US20140337491A1 (en) 2013-05-08 2014-11-13 Box, Inc. Repository redundancy implementation of a system which incrementally updates clients with events that occurred via a cloud-enabled platform
US9195826B1 (en) 2013-05-30 2015-11-24 Emc Corporation Graph-based method to detect malware command-and-control infrastructure
US9053167B1 (en) 2013-06-19 2015-06-09 Amazon Technologies, Inc. Storage device selection for database partition replicas
US9774401B1 (en) 2013-07-15 2017-09-26 Paul Borrill Entangled links, transactions and trees for distributed computing systems
US10394848B2 (en) 2013-07-29 2019-08-27 Amazon Technologies, Inc. Generating a multi-column index for relational databases by interleaving data bits for selectivity
US20150039749A1 (en) 2013-08-01 2015-02-05 Alcatel-Lucent Canada Inc. Detecting traffic anomalies based on application-aware rolling baseline aggregates
US20150067860A1 (en) 2013-08-29 2015-03-05 Barracuda Networks, Inc. Virus Detector Controlled Backup Apparatus and File Restoration
US9280591B1 (en) 2013-09-20 2016-03-08 Amazon Technologies, Inc. Efficient replication of system transactions for read-only nodes of a distributed database
US10171501B2 (en) 2013-09-20 2019-01-01 Open Text Sa Ulc System and method for remote wipe
US10311154B2 (en) 2013-09-21 2019-06-04 Oracle International Corporation Combined row and columnar storage for in-memory databases for OLTP and analytics workloads
US9483509B2 (en) 2013-10-02 2016-11-01 Google Inc. Dynamic shuffle reconfiguration
US9503465B2 (en) 2013-11-14 2016-11-22 At&T Intellectual Property I, L.P. Methods and apparatus to identify malicious activity in a network
US10331695B1 (en) 2013-11-25 2019-06-25 Amazon Technologies, Inc. Replication coordination service for data transfers between distributed databases
US9411862B1 (en) 2013-12-03 2016-08-09 Google Inc. Systems and methods for dynamic sharding of hierarchical data
US10235377B2 (en) 2013-12-23 2019-03-19 Sap Se Adaptive dictionary compression/decompression for column-store databases
US9697274B2 (en) 2013-12-27 2017-07-04 International Business Machines Corporation Stratified sampling using adaptive parallel data processing
US9633056B2 (en) 2014-03-17 2017-04-25 Commvault Systems, Inc. Maintaining a deduplication database
US10380072B2 (en) 2014-03-17 2019-08-13 Commvault Systems, Inc. Managing deletions from a deduplication database
US9667738B2 (en) 2014-03-24 2017-05-30 Tanium Inc. Local data caching for data transfers on a network of computational devices
US10025710B2 (en) 2014-04-30 2018-07-17 Walmart Apollo, Llc Pattern for integrating primary and secondary data stores in a sharded data domain
US9613122B2 (en) 2014-05-02 2017-04-04 Facebook, Inc. Providing eventual consistency for multi-shard transactions
US9672241B2 (en) 2014-05-28 2017-06-06 International Business Machines Corporation Representing an outlier value in a non-nullable column as null in metadata
US10230747B2 (en) 2014-07-15 2019-03-12 Cisco Technology, Inc. Explaining network anomalies using decision trees
US9973520B2 (en) 2014-07-15 2018-05-15 Cisco Technology, Inc. Explaining causes of network anomalies
US9753955B2 (en) 2014-09-16 2017-09-05 Commvault Systems, Inc. Fast deduplication data verification
EP2998881B1 (fr) 2014-09-18 2018-07-25 Amplidata NV Procédé informatique pour partage dynamique
US9575673B2 (en) 2014-10-29 2017-02-21 Commvault Systems, Inc. Accessing a file system using tiered deduplication
US9288617B1 (en) 2014-10-30 2016-03-15 Deep Rock Ventures, Inc. Mobile media communications system
US9830342B2 (en) 2014-11-20 2017-11-28 International Business Machines Corporation Optimizing database deduplication
US20160171071A1 (en) 2014-12-11 2016-06-16 International Business Machines Corporation Dynamic creation and configuration of partitioned index through analytics based on existing data population
US10185730B2 (en) 2014-12-31 2019-01-22 Nexenta Systems, Inc. Methods and systems for key-value-tuple-encoded storage
EP3657905B1 (fr) 2015-01-06 2022-09-21 Battelle Memorial Institute Répartition uniforme de la chaleur dans les dispositifs de chauffage à résistance de dégivrage et d'antigivrage
US10353900B2 (en) 2015-07-24 2019-07-16 Oracle International Corporation System and method for creating an intelligent synopsis of a database using re-partitioning based sampling
US10339116B2 (en) 2015-10-07 2019-07-02 Oracle International Corporation Composite sharding
US11210279B2 (en) 2016-04-15 2021-12-28 Apple Inc. Distributed offline indexing
US11030169B1 (en) 2017-03-07 2021-06-08 Amazon Technologies, Inc. Data re-sharding
US11550505B1 (en) 2020-09-01 2023-01-10 Amazon Technologies, Inc. Intra-shard parallelization of data stream processing using virtual shards

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
None *

Also Published As

Publication number Publication date
US20150254307A1 (en) 2015-09-10
WO2015138497A3 (fr) 2015-12-03
CN106462583A (zh) 2017-02-22
JP2017512344A (ja) 2017-05-18
US20160203179A1 (en) 2016-07-14
EP3117347A4 (fr) 2017-11-29
CN106462583B (zh) 2020-03-24
US9734202B2 (en) 2017-08-15
EP3117347A2 (fr) 2017-01-18
JP6307169B2 (ja) 2018-04-04
US20220284017A1 (en) 2022-09-08
WO2015138497A2 (fr) 2015-09-17
US9323809B2 (en) 2016-04-26
US10713240B2 (en) 2020-07-14
US11977541B2 (en) 2024-05-07
US20200311064A1 (en) 2020-10-01
US11372851B2 (en) 2022-06-28
US20170308570A1 (en) 2017-10-26

Similar Documents

Publication Publication Date Title
US11977541B2 (en) Systems and methods for rapid data analysis
US11263215B2 (en) Methods for enhancing rapid data analysis
US20220300496A1 (en) Automatic partitioning
RU2691595C2 (ru) Конструируемый поток данных для улучшенной обработки событий
US10042887B2 (en) Query optimization with zone map selectivity modeling
KR101696338B1 (ko) 컬럼-인덱스 데이터 포맷을 이용하여 빅데이터를 효율적으로 처리 및 분석하는 시스템 및 방법
CN103186539A (zh) 一种确定用户群体、信息查询及推荐的方法及系统
CN106933893B (zh) 多维度数据的查询方法及装置
CN106933906B (zh) 数据多维度的查询方法及装置
US10176231B2 (en) Estimating most frequent values for a data set
US11928113B2 (en) Structure and method of aggregation index for improving aggregation query efficiency
CN106933902B (zh) 数据多维度自由剖析的查询方法及装置
CN110737673B (zh) 一种数据处理的方法及系统
CN110874366A (zh) 数据处理、查询方法和装置
US20190266163A1 (en) System and method for behavior-on-read query processing
CN106933909B (zh) 多维度数据的查询方法及装置
CN114943004B (zh) 属性图查询方法、属性图查询装置以及存储介质
US20140330768A1 (en) Incrementally updated sample tables

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20160816

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: INTERANA, INC.

RIN1 Information on inventor provided before grant (corrected)

Inventor name: DIMITROV, BORIS

Inventor name: ABRAHAM, LIOR

Inventor name: JOHNSON, ROBERT

Inventor name: FOSSGREEN, DON

Inventor name: JOHNSON, ANN

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20171026

RIC1 Information provided on ipc code assigned before grant

Ipc: G06F 17/30 20060101AFI20171020BHEP

Ipc: G06F 17/28 20060101ALI20171020BHEP

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: INTERANA, INC.

REG Reference to a national code

Ref country code: DE

Ref legal event code: R079

Ref document number: 602015059532

Country of ref document: DE

Free format text: PREVIOUS MAIN CLASS: G06F0017300000

Ipc: G06F0016245800

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: GRANT OF PATENT IS INTENDED

RIC1 Information provided on ipc code assigned before grant

Ipc: G06F 16/242 20190101ALI20200423BHEP

Ipc: G06F 16/2458 20190101AFI20200423BHEP

INTG Intention to grant announced

Effective date: 20200507

RIN1 Information on inventor provided before grant (corrected)

Inventor name: JOHNSON, ROBERT

Inventor name: JOHNSON, ANN

Inventor name: DIMITROV, BORIS

Inventor name: FOSSGREEN, DON

Inventor name: ABRAHAM, LIOR

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE PATENT HAS BEEN GRANTED

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 602015059532

Country of ref document: DE

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: AT

Ref legal event code: REF

Ref document number: 1317101

Country of ref document: AT

Kind code of ref document: T

Effective date: 20201015

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20201223

Ref country code: NO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20201223

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20201224

Ref country code: HR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK05

Ref document number: 1317101

Country of ref document: AT

Kind code of ref document: T

Effective date: 20200923

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: RS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

REG Reference to a national code

Ref country code: NL

Ref legal event code: MP

Effective date: 20200923

REG Reference to a national code

Ref country code: LT

Ref legal event code: MG4D

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210125

Ref country code: SM

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: AL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210123

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 602015059532

Country of ref document: DE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

26N No opposition filed

Effective date: 20210624

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

Ref country code: MC

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

REG Reference to a national code

Ref country code: BE

Ref legal event code: MM

Effective date: 20210331

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20210331

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20210310

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20210331

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20210331

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20210310

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20210331

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT; INVALID AB INITIO

Effective date: 20150310

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20230119

Year of fee payment: 9

Ref country code: DE

Payment date: 20230110

Year of fee payment: 9

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200923

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20240116

Year of fee payment: 10

Ref country code: GB

Payment date: 20240118

Year of fee payment: 10