CN106294853B - 用于处理相关数据集的方法及数据处理系统 - Google Patents

用于处理相关数据集的方法及数据处理系统 Download PDF

Info

Publication number
CN106294853B
CN106294853B CN201610703060.9A CN201610703060A CN106294853B CN 106294853 B CN106294853 B CN 106294853B CN 201610703060 A CN201610703060 A CN 201610703060A CN 106294853 B CN106294853 B CN 106294853B
Authority
CN
China
Prior art keywords
data set
record
data
transformation
field
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610703060.9A
Other languages
English (en)
Chinese (zh)
Other versions
CN106294853A (zh
Inventor
A.F.罗伯茨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ab Initio Technology LLC
Original Assignee
Ab Initio Technology LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ab Initio Technology LLC filed Critical Ab Initio Technology LLC
Publication of CN106294853A publication Critical patent/CN106294853A/zh
Application granted granted Critical
Publication of CN106294853B publication Critical patent/CN106294853B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • G06F16/2365Ensuring data consistency and integrity
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/61Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24564Applying rules; Deductive queries
    • G06F16/24565Triggers; Constraints
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/71Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/84Mapping; Conversion
    • G06F16/86Mapping to a database
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/40Searching chemical structures or physicochemical data
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16CCOMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
    • G16C20/00Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
    • G16C20/90Programming languages; Computing architectures; Database systems; Data warehousing
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references
    • G16H70/40ICT specially adapted for the handling or processing of medical references relating to drugs, e.g. their side effects or intended usage

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Software Systems (AREA)
  • Computer Security & Cryptography (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Bioethics (AREA)
  • Medical Informatics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Medicinal Chemistry (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Toxicology (AREA)
  • Epidemiology (AREA)
  • Primary Health Care (AREA)
  • Public Health (AREA)
  • Evolutionary Biology (AREA)
  • Computer Hardware Design (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN201610703060.9A 2010-06-22 2011-06-22 用于处理相关数据集的方法及数据处理系统 Active CN106294853B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US35737610P 2010-06-22 2010-06-22
US61/357,376 2010-06-22
CN201180040706.5A CN103080932B (zh) 2010-06-22 2011-06-22 处理相关数据集

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201180040706.5A Division CN103080932B (zh) 2010-06-22 2011-06-22 处理相关数据集

Publications (2)

Publication Number Publication Date
CN106294853A CN106294853A (zh) 2017-01-04
CN106294853B true CN106294853B (zh) 2019-10-11

Family

ID=44533077

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201610703060.9A Active CN106294853B (zh) 2010-06-22 2011-06-22 用于处理相关数据集的方法及数据处理系统
CN201180040706.5A Active CN103080932B (zh) 2010-06-22 2011-06-22 处理相关数据集

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201180040706.5A Active CN103080932B (zh) 2010-06-22 2011-06-22 处理相关数据集

Country Status (8)

Country Link
US (1) US8775447B2 (https=)
EP (1) EP2585949B1 (https=)
JP (1) JP5826260B2 (https=)
KR (2) KR20150042872A (https=)
CN (2) CN106294853B (https=)
AU (1) AU2011271002B2 (https=)
CA (1) CA2801079C (https=)
WO (1) WO2011163363A1 (https=)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012103438A1 (en) 2011-01-28 2012-08-02 Ab Initio Technology Llc Generating data pattern information
US20130006961A1 (en) * 2011-06-29 2013-01-03 Microsoft Corporation Data driven natural interface for automated relational queries
CN104756107B (zh) 2012-10-22 2019-01-01 起元科技有限公司 采用位置信息剖析数据
US9087138B2 (en) * 2013-01-15 2015-07-21 Xiaofan Zhou Method for representing and storing hierarchical data in a columnar format
US9892026B2 (en) * 2013-02-01 2018-02-13 Ab Initio Technology Llc Data records selection
US9195470B2 (en) 2013-07-22 2015-11-24 Globalfoundries Inc. Dynamic data dimensioning by partial reconfiguration of single or multiple field-programmable gate arrays using bootstraps
EP3025247B1 (en) 2013-07-26 2018-10-24 Hewlett-Packard Enterprise Development LP Data view based on context
US9535936B2 (en) * 2013-09-05 2017-01-03 The Boeing Company Correlation of maximum configuration data sets
EP3055786A4 (en) * 2013-10-09 2017-05-17 Google, Inc. Automatic definition of entity collections
US11487732B2 (en) * 2014-01-16 2022-11-01 Ab Initio Technology Llc Database key identification
AU2015225694B2 (en) 2014-03-07 2019-06-27 Ab Initio Technology Llc Managing data profiling operations related to data type
US9317558B2 (en) * 2014-05-13 2016-04-19 Sap Se Intelligent unmasking in an in-memory database
CN107145344B (zh) 2014-09-02 2020-12-04 起元科技有限公司 在基于图的程序中指定组件
SG11201701584SA (en) 2014-09-02 2017-03-30 Ab Initio Technology Llc Compiling graph-based program specifications
US10007598B2 (en) 2014-09-08 2018-06-26 Ab Initio Technology Llc Data-driven testing framework
WO2016054491A1 (en) 2014-10-03 2016-04-07 Infinity Pharmaceuticals, Inc. Heterocyclic compounds and uses thereof
US10176234B2 (en) * 2014-11-05 2019-01-08 Ab Initio Technology Llc Impact analysis
US10360520B2 (en) * 2015-01-06 2019-07-23 International Business Machines Corporation Operational data rationalization
KR102281454B1 (ko) * 2015-05-27 2021-07-23 삼성에스디에스 주식회사 리버스 데이터 모델링 관계선 설정 방법 및 그 장치
WO2017068481A1 (en) * 2015-10-20 2017-04-27 Jayaram Sanjay System for managing data
US11989096B2 (en) * 2015-12-21 2024-05-21 Ab Initio Technology Llc Search and retrieval data processing system for computing near real-time data aggregations
US10169364B2 (en) * 2016-01-13 2019-01-01 International Business Machines Corporation Gauging accuracy of sampling-based distinct element estimation
US20170242876A1 (en) * 2016-02-22 2017-08-24 Ca, Inc. Maintaining Database Referential Integrity Using Different Primary and Foreign Key Values
CN107330796B (zh) * 2016-04-29 2021-01-29 泰康保险集团股份有限公司 组件化生成表单的数据处理方法及系统
US11243938B2 (en) * 2016-05-31 2022-02-08 Micro Focus Llc Identifying data constraints in applications and databases
WO2017214269A1 (en) 2016-06-08 2017-12-14 Infinity Pharmaceuticals, Inc. Heterocyclic compounds and uses thereof
US10311057B2 (en) 2016-08-08 2019-06-04 International Business Machines Corporation Attribute value information for a data extent
US10360240B2 (en) 2016-08-08 2019-07-23 International Business Machines Corporation Providing multidimensional attribute value information
US10657120B2 (en) * 2016-10-03 2020-05-19 Bank Of America Corporation Cross-platform digital data movement control utility and method of use thereof
US10593080B2 (en) * 2017-04-27 2020-03-17 Daegu Gyeongbuk Institute Of Science And Technology Graph generating method and apparatus
US10176217B1 (en) 2017-07-06 2019-01-08 Palantir Technologies, Inc. Dynamically performing data processing in a data pipeline system
US11055074B2 (en) 2017-11-13 2021-07-06 Ab Initio Technology Llc Key-based logging for processing of structured data items with executable logic
US11068540B2 (en) 2018-01-25 2021-07-20 Ab Initio Technology Llc Techniques for integrating validation results in data profiling and related systems and methods
US10838915B2 (en) * 2018-09-06 2020-11-17 International Business Machines Corporation Data-centric approach to analysis
US12197438B2 (en) * 2021-01-04 2025-01-14 Liveramp, Inc. Data manipulation language parser system and method for entity resolution
CN117015769A (zh) * 2021-01-31 2023-11-07 起元技术有限责任公司 用于数据处理系统的数据集多路复用器
JP7832951B2 (ja) * 2021-01-31 2026-03-18 アビニシオ テクノロジー エルエルシー データ処理システム用のデータセットマルチプレクサ
US20230403218A1 (en) * 2022-06-08 2023-12-14 Vmware, Inc. State consistency monitoring for plane-separation architectures

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004063943A2 (en) * 2003-01-15 2004-07-29 Luke Leonard Martin Porter Time in databases and applications of databases
CN101141754A (zh) * 2006-09-05 2008-03-12 中兴通讯股份有限公司 一种增值业务分析系统及其方法
CN101452072A (zh) * 2008-12-26 2009-06-10 东南大学 一种用于土地监测的电子信息化系统及其方法
CN102098175A (zh) * 2011-01-26 2011-06-15 浪潮通信信息系统有限公司 一种移动互联网告警关联规则获取方法

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05204727A (ja) * 1992-01-27 1993-08-13 Hitachi Ltd デ−タベ−ス管理方法およびそのシステム
US5966072A (en) 1996-07-02 1999-10-12 Ab Initio Software Corporation Executing computations expressed as graphs
WO2002079993A1 (en) * 2001-03-29 2002-10-10 Reallegal.Com Methods for synchronizing on-line and off-line transcript projects
CA2409079A1 (en) * 2002-10-21 2004-04-21 Ibm Canada Limited-Ibm Canada Limitee Creating multiple and cascading business interpretations from raw application data using transformation layering
US20050004918A1 (en) * 2003-07-02 2005-01-06 International Business Machines Corporation Populating a database using inferred dependencies
US8868580B2 (en) 2003-09-15 2014-10-21 Ab Initio Technology Llc Data profiling
US7181472B2 (en) * 2003-10-23 2007-02-20 Microsoft Corporation Method and system for synchronizing identity information
JP4343752B2 (ja) * 2004-03-31 2009-10-14 キヤノン株式会社 色処理装置およびその方法
GB2414337B (en) * 2004-05-19 2008-10-29 Macrovision Europ Ltd The copy protection of optical discs
US7716630B2 (en) 2005-06-27 2010-05-11 Ab Initio Technology Llc Managing parameters for graph-based computations
CN101911859B (zh) * 2008-01-11 2012-12-05 富士机械制造株式会社 部件安装系统及部件安装方法
JP4870700B2 (ja) * 2008-03-11 2012-02-08 株式会社リコー 通信システム

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004063943A2 (en) * 2003-01-15 2004-07-29 Luke Leonard Martin Porter Time in databases and applications of databases
CN101141754A (zh) * 2006-09-05 2008-03-12 中兴通讯股份有限公司 一种增值业务分析系统及其方法
CN101452072A (zh) * 2008-12-26 2009-06-10 东南大学 一种用于土地监测的电子信息化系统及其方法
CN102098175A (zh) * 2011-01-26 2011-06-15 浪潮通信信息系统有限公司 一种移动互联网告警关联规则获取方法

Also Published As

Publication number Publication date
JP5826260B2 (ja) 2015-12-02
CA2801079C (en) 2016-05-03
EP2585949A1 (en) 2013-05-01
KR101781416B1 (ko) 2017-09-25
CN103080932A (zh) 2013-05-01
KR20130095250A (ko) 2013-08-27
US8775447B2 (en) 2014-07-08
JP2013529814A (ja) 2013-07-22
CN103080932B (zh) 2016-08-31
WO2011163363A1 (en) 2011-12-29
KR20150042872A (ko) 2015-04-21
EP2585949B1 (en) 2015-03-25
US20110313979A1 (en) 2011-12-22
AU2011271002B2 (en) 2015-08-20
CN106294853A (zh) 2017-01-04
CA2801079A1 (en) 2011-12-29
HK1179006A1 (en) 2013-09-19
AU2011271002A1 (en) 2012-12-13

Similar Documents

Publication Publication Date Title
CN106294853B (zh) 用于处理相关数据集的方法及数据处理系统
CN102982065B (zh) 数据处理方法、数据处理装置及计算机可读存储介质
CA2892301C (en) Data records selection
US8674993B1 (en) Graph database system and method for facilitating financial and corporate relationship analysis
US20110119300A1 (en) Method Of Generating An Analytical Data Set For Input Into An Analytical Model
JP2018536909A (ja) 表形式データから、多次元データベース環境に使用されるキューブスキーマを自動的に推論するためのシステムおよび方法
Pullokkaran Analysis of data virtualization & enterprise data standardization in business intelligence
Xie et al. Exploring Multi‐dimensional Data via Subset Embedding
CN101271472B (zh) 数据处理方法和数据处理系统
Faria Junior et al. Clustering analysis and frequent pattern mining for process profile analysis: an exploratory study for object-centric event logs
Adedayo et al. Schema reconstruction in database forensics
Meskine et al. A support architecture to MDA contribution for data mining
Muthoifin et al. Bibliometric Analysis of Development Maps and Research Directions in the Field of Islamic Currency in the Scopus Database 1965-2023
Prabhu Object–Oriented Database Systems: Approaches and Architectures
Salem et al. A Cloud-Based Data Integration Framework for E-Government Service.
El Seddawy et al. A proposed data mining technique to improve decision support system in an uncertain situation
Karban Relational Data Mining and GUHA.
HK1179006B (en) Processing related datasets
US20110029950A1 (en) Computer-readable media, methods and computer systems for designing a software application graphically
Fathima Sherin et al. An Efficient Method for Frequent Itemset Mining on Temporal Data
Bayer Data mining strategies in large-scale agent-based models with applications in econophysics
Feng et al. Frequent Pattern Mining for Massive XBRL Data in Internet Information Disclosure System
Pears A methodology for integrating and exploiting data mining techniques in the design of data warehouses
HK1212479B (en) Data records selection

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant