US20080177892A1 - Method for service oriented data extraction transformation and load - Google Patents

Method for service oriented data extraction transformation and load Download PDF

Info

Publication number
US20080177892A1
US20080177892A1 US11/624,893 US62489307A US2008177892A1 US 20080177892 A1 US20080177892 A1 US 20080177892A1 US 62489307 A US62489307 A US 62489307A US 2008177892 A1 US2008177892 A1 US 2008177892A1
Authority
US
United States
Prior art keywords
data
data source
transformation
information
source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/624,893
Inventor
Alfredo Alba
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Priority to US11/624,893 priority Critical patent/US20080177892A1/en
Assigned to INTERNATIONAL BUSINESS MACHINES CORPORATION reassignment INTERNATIONAL BUSINESS MACHINES CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ALBA, ALFREDO
Priority to US12/129,211 priority patent/US8307025B2/en
Publication of US20080177892A1 publication Critical patent/US20080177892A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/10Architectures or entities
    • H04L65/102Gateways
    • H04L65/1023Media gateways
    • H04L65/103Media gateways in the network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/765Media network packet handling intermediate

Definitions

  • This invention relates to methodologies for extracting data from data sources on a network and particularly to, methodologies for service oriented data extraction and data transformation.
  • large business enterprises typically implemented a plurality of dissimilar data sources within their operational networks, in addition to interacting on a daily basis with a wide variety of external business sources (such as business transactions or structured data acquisition processes).
  • the sharing, acquisition, transformation, and migration of managed data comprises significant costs that are associated with such activities. Incorporating new data feeds, or enabling new business transactions is usually a costly and lengthy process.
  • Automated tools that are utilized for managing a business enterprise's data efficiently are a necessity in today's business environment.
  • One classification of tools that is frequently used are stand-alone proprietary data transformation and schema mapping applications.
  • Such tools are used to provide a configurable data transformation processes for data migration, sharing, and reporting, in addition to performing mapping operations for business transactional operations.
  • These tools provide the means to achieve their configuration goals provided they are compatible with both the source and the destination data sources, in addition to being compatible with the available operating environment. While such automated tools provide valuable functionality, they remain proprietary; and further, they lack the flexibility and adaptability needed in today's business environment.
  • the shortcomings of the prior art are overcome and additional advantages are provided through the provision of a method for the configurable real time transformation of dissimilar data sources, the method further consisting of the steps of acquiring real time information pertaining to at least one data source, wherein the information comprises reference information that is associated with the data source, data transformation specification information that is associated with the data source, and scheduled event specification information that is associated with the data source, and maintaining the data source information.
  • the method further comprises the steps of acquiring data from the data source in accordance with a specified scheduled event, converting the acquired data into a predetermined standardized format, performing at least one data transformation function from the real time stream upon the converted data in accordance with the acquired data transformation specification information that was associated with the data source; and transmitting the transformed data to a destination data source.
  • FIG. 1 illustrates one example of aspects of the architecture of an embodiment for service oriented data extraction transformation.
  • FIG. 2 illustrates one example of a data flow diagram illustrating aspects of a scheduled data extraction functionality comprises within embodiments of the present invention.
  • FIG. 3 illustrates one example of a flow diagram detailing aspects of a change data flow functionality comprises within embodiments of the present invention.
  • aspects of the present invention relate to systems and methodologies for the configuration and implementation of data extraction transformation and data load tool solutions for business enterprises. As such, a flexible service oriented solution to reduce the overall cost associated with data migration, sharing, reporting and mapping is presented.
  • aspects of the present invention have the capability to provide a business enterprise with the capacity to adapt to an ever changing business environment at a much lower cost.
  • the present invention provides solutions that can be synergistically utilized to quickly enable new business transactions in addition to reducing the cost and time of migrations to more cost effective data management solutions.
  • transformation, schedule, and real time specification of remote or local data sources are created and maintained within the inventive system.
  • remote or locally stored data is extracted and uploaded to the destination data source(s).
  • the extracted data is normalized in a per fact basis into a XML document.
  • the normalized XML data/document is transformed into a desired format according to transformation specifications that have been associated with the extracted data. Thereafter, the transformed data is loaded to a destination data source.
  • the computer system as operated by a system user can embody a conventional personal computer on which a Web services based application that is configured to accomplish the objectives of the present methodologies is operating.
  • the computer system also includes other hardware and software elements that are conventionally included in personal computers.
  • FIG. 1 there is diagram detailing structural hardware and software components that may be implemented within embodiments of the present invention.
  • the present invention may embody a web services-based application 105 , wherein the web services-based application 105 comprises a plurality of software processing components.
  • the web service-based application 105 comprises a maintenance component 110 , wherein the function of the maintenance component 110 is to monitor the operations of the data source scheduler 115 , the transformation specification repository 140 , and the data source repository 130 .
  • Source data that is to be processed within the system is extracted form a source data source 145 , normalized into a predetermined data format at a normalized data store 125 , and eventually uploaded to the specified destination data source(s) 155 .
  • a comprehensive listing of data sources that the system is configured to extract data from is stored at the data source repository 130 .
  • the data extractor 120 acts to gather specified data from a data source 145 that is referenced at the data source repository 130 .
  • the extraction operation can either be initiated at the will of a system operator, or as part of a scheduled event that has been notated by the data source scheduler 115 .
  • a data extraction operation comprises the function of fetching data from a data source 145 .
  • the data is extracted from the specified data source 145 at a rate that complies with the performance limitations of the remote data source host 145 . Thereafter, the extracted data is transmitted to the normalized data store 125 , where the extracted data is normalized and stored into XML documents.
  • the normalized data is retrieved from the normalized data store 125 by the data transformation component 135 . Thereafter, the normalized data is transformed according to a set of data transformation rules that are contained within a data transformation specification.
  • Data transformation specifications are stored and maintained at the transformation specification repository 140 . Further, within embodiments of the present invention, data transformation specifications can be altered, or modified by a system operator at the time of the execution of the transformation of the normalized data. Essentially, data transformation is defined as the processes of converting normalized data into its final form prior to the data load component's 150 operation of transmitting (i.e., inserting or updating) the transformed data to its final destination data source.
  • data source references can be stored along with all the properties necessary to establish a proper connection for the data extraction operation, including any such credentials that are necessary for a successful extraction process.
  • FIG. 2 shows a flow diagram that illustrates aspects of a scheduled data extraction process that the present invention may be configured to perform.
  • a determination is made as to whether a data extraction event is scheduled.
  • the data extraction schedule 135 of the data source repository is accessed to confirm the inquiry.
  • the system goes into a sleep mode.
  • the data source 145 is contacted, and summarily the specified source data is extracted.
  • the extracted data is thereafter normalized (step 215 ), and transformed according to the data transformation specification instructions that are stored at the transformation specification repository 140 (step 220 ).
  • the transformed data is loaded to the destination data source 155 , and the extraction schedule 135 is updated in regard to the performed data transformation operation.
  • FIG. 3 shows a flowchart detailing aspects of a method for updating the information that is stored at the transformation specification repository 140 , the data source repository 130 , and the data extraction schedule 135 .
  • the modification information is initially submitted to the maintenance module 110 , wherein thereafter the appropriate registry is updated.
  • a configuration change has been initiated.
  • a determination is made to determine what type of configuration modification is to be made. In the event that the change is to be made to the data transformation specification, then the transformation repository 140 is accessed and the modifications are performed. In the event that the changes are to be made to the data source repository 130 or the extraction schedule 135 , then the elements are respectively accessed and the modification to the configuration for the relevant data is performed.
  • the capabilities of the present invention can be implemented in software, firmware, hardware or some combination thereof.
  • one or more aspects of the present invention can be included in an article of manufacture (e.g., one or more computer program products) having, for instance, computer usable media.
  • the media has embodied therein, for instance, computer readable program code means for providing and facilitating the capabilities of the present invention.
  • the article of manufacture can be included as a part of a computer system or sold separately.
  • At least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform the capabilities of the present invention can be provided.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to a method for the configurable real time transformation of dissimilar data sources, the method further consisting of the steps of acquiring real time information pertaining to at least one data source, wherein the information comprises reference information that is associated with the data source, data transformation specification information that is associated with the data source, and scheduled event specification information that is associated with the data source, and maintaining the data source information. The method further comprises the steps of acquiring data from the data source in accordance with a specified scheduled event, converting the acquired data into a predetermined standardized format, performing at least one data transformation function from the real time stream upon the converted data in accordance with the acquired data transformation specification information that was associated with the data source; and transmitting the transformed data to a destination data source.

Description

    BACKGROUND OF THE INVENTION
  • 1. Field of the Invention
  • This invention relates to methodologies for extracting data from data sources on a network and particularly to, methodologies for service oriented data extraction and data transformation.
  • 2. Description of Background
  • Before our invention, large business enterprises typically implemented a plurality of dissimilar data sources within their operational networks, in addition to interacting on a daily basis with a wide variety of external business sources (such as business transactions or structured data acquisition processes). The sharing, acquisition, transformation, and migration of managed data comprises significant costs that are associated with such activities. Incorporating new data feeds, or enabling new business transactions is usually a costly and lengthy process. Once a business enterprise has decided on a specific product, or product line for data management, it is often very difficult to migrate to a simpler, better, or more cost effective solution because of configuration differences between the existing and proposed data schemas.
  • Automated tools that are utilized for managing a business enterprise's data efficiently are a necessity in today's business environment. One classification of tools that is frequently used are stand-alone proprietary data transformation and schema mapping applications. Such tools are used to provide a configurable data transformation processes for data migration, sharing, and reporting, in addition to performing mapping operations for business transactional operations. These tools provide the means to achieve their configuration goals provided they are compatible with both the source and the destination data sources, in addition to being compatible with the available operating environment. While such automated tools provide valuable functionality, they remain proprietary; and further, they lack the flexibility and adaptability needed in today's business environment.
  • Therefore, there exists a need for a flexible service oriented solution to reduce the overall cost that is associated with data migration, sharing, reporting and mapping.
  • SUMMARY OF THE INVENTION
  • The shortcomings of the prior art are overcome and additional advantages are provided through the provision of a method for the configurable real time transformation of dissimilar data sources, the method further consisting of the steps of acquiring real time information pertaining to at least one data source, wherein the information comprises reference information that is associated with the data source, data transformation specification information that is associated with the data source, and scheduled event specification information that is associated with the data source, and maintaining the data source information.
  • The method further comprises the steps of acquiring data from the data source in accordance with a specified scheduled event, converting the acquired data into a predetermined standardized format, performing at least one data transformation function from the real time stream upon the converted data in accordance with the acquired data transformation specification information that was associated with the data source; and transmitting the transformed data to a destination data source.
  • Computer program products corresponding to the above-summarized methods are also described and claimed herein.
  • Additional features and advantages are realized through the techniques of the present invention. Other embodiments and aspects of the invention are described in detail herein and are considered a part of the claimed invention. For a better understanding of the invention with advantages and features, refer to the description and to the drawings.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The subject matter that is regarded as the invention is particularly pointed out and distinctly claimed in the claims at the conclusion of the specification. The foregoing and other objects, features, and advantages of the invention are apparent from the following detailed description taken in conjunction with the accompanying drawings in which:
  • FIG. 1 illustrates one example of aspects of the architecture of an embodiment for service oriented data extraction transformation.
  • FIG. 2 illustrates one example of a data flow diagram illustrating aspects of a scheduled data extraction functionality comprises within embodiments of the present invention.
  • FIG. 3 illustrates one example of a flow diagram detailing aspects of a change data flow functionality comprises within embodiments of the present invention.
  • The detailed description explains the preferred embodiments of the invention, together with advantages and features, by way of example with reference to the drawings.
  • DETAILED DESCRIPTION OF THE INVENTION
  • One or more exemplary embodiments of the invention are described below in detail. The disclosed embodiments are intended to be illustrative only since numerous modifications and variations therein will be apparent to those of ordinary skill in the art. In reference to the drawings, like numbers will indicate like parts continuously throughout the views.
  • Aspects of the present invention relate to systems and methodologies for the configuration and implementation of data extraction transformation and data load tool solutions for business enterprises. As such, a flexible service oriented solution to reduce the overall cost associated with data migration, sharing, reporting and mapping is presented. By allowing a dynamic, real-time reconfiguration, and operations of the extract, transform and load (ETL) processing landscape, aspects of the present invention have the capability to provide a business enterprise with the capacity to adapt to an ever changing business environment at a much lower cost. The present invention provides solutions that can be synergistically utilized to quickly enable new business transactions in addition to reducing the cost and time of migrations to more cost effective data management solutions.
  • Within aspects of the present invention reference, transformation, schedule, and real time specification of remote or local data sources are created and maintained within the inventive system. On the specified schedule, remote or locally stored data is extracted and uploaded to the destination data source(s). Specifically, within embodiments of the present invention, the extracted data is normalized in a per fact basis into a XML document. The normalized XML data/document is transformed into a desired format according to transformation specifications that have been associated with the extracted data. Thereafter, the transformed data is loaded to a destination data source.
  • Aspects of the present invention are carried out within a computing system environment. The computer system as operated by a system user can embody a conventional personal computer on which a Web services based application that is configured to accomplish the objectives of the present methodologies is operating. As is conventional, the computer system also includes other hardware and software elements that are conventionally included in personal computers.
  • Turning now to the drawings in greater detail, it will be seen that in FIG. 1 there is diagram detailing structural hardware and software components that may be implemented within embodiments of the present invention. As shown in FIG. 1, the present invention may embody a web services-based application 105, wherein the web services-based application 105 comprises a plurality of software processing components. Specifically, the web service-based application 105 comprises a maintenance component 110, wherein the function of the maintenance component 110 is to monitor the operations of the data source scheduler 115, the transformation specification repository 140, and the data source repository 130.
  • Further comprises, are a data extraction component 120, a data transformation component 135, and a data load component 150. Source data that is to be processed within the system is extracted form a source data source 145, normalized into a predetermined data format at a normalized data store 125, and eventually uploaded to the specified destination data source(s) 155.
  • A comprehensive listing of data sources that the system is configured to extract data from is stored at the data source repository 130. Within aspects of the present invention the data extractor 120 acts to gather specified data from a data source 145 that is referenced at the data source repository 130. The extraction operation can either be initiated at the will of a system operator, or as part of a scheduled event that has been notated by the data source scheduler 115. Specifically, a data extraction operation comprises the function of fetching data from a data source 145. The data is extracted from the specified data source 145 at a rate that complies with the performance limitations of the remote data source host 145. Thereafter, the extracted data is transmitted to the normalized data store 125, where the extracted data is normalized and stored into XML documents.
  • The normalized data is retrieved from the normalized data store 125 by the data transformation component 135. Thereafter, the normalized data is transformed according to a set of data transformation rules that are contained within a data transformation specification. Data transformation specifications are stored and maintained at the transformation specification repository 140. Further, within embodiments of the present invention, data transformation specifications can be altered, or modified by a system operator at the time of the execution of the transformation of the normalized data. Essentially, data transformation is defined as the processes of converting normalized data into its final form prior to the data load component's 150 operation of transmitting (i.e., inserting or updating) the transformed data to its final destination data source.
  • Within yet further aspects of the present invention, there are specific protocols in place to assist in the remote or local storage and maintenance of data source references. In particular, data source references can be stored along with all the properties necessary to establish a proper connection for the data extraction operation, including any such credentials that are necessary for a successful extraction process.
  • FIG. 2 shows a flow diagram that illustrates aspects of a scheduled data extraction process that the present invention may be configured to perform. At step 205, a determination is made as to whether a data extraction event is scheduled. As a result, the data extraction schedule 135 of the data source repository is accessed to confirm the inquiry. In the event that there is no scheduled data extraction to be performed, at step 206, the system goes into a sleep mode. In the event that it is determined that a data extraction operation is scheduled to be performed, then at step 210, the data source 145 is contacted, and summarily the specified source data is extracted. The extracted data is thereafter normalized (step 215), and transformed according to the data transformation specification instructions that are stored at the transformation specification repository 140 (step 220). At step 225, the transformed data is loaded to the destination data source 155, and the extraction schedule 135 is updated in regard to the performed data transformation operation.
  • FIG. 3 shows a flowchart detailing aspects of a method for updating the information that is stored at the transformation specification repository 140, the data source repository 130, and the data extraction schedule 135. Within aspects of the present invention, in the event of any change to a data extraction schedule, data source reference, or data transformation specification, the modification information is initially submitted to the maintenance module 110, wherein thereafter the appropriate registry is updated. As shown at step 305, a configuration change has been initiated. At step 310, a determination is made to determine what type of configuration modification is to be made. In the event that the change is to be made to the data transformation specification, then the transformation repository 140 is accessed and the modifications are performed. In the event that the changes are to be made to the data source repository 130 or the extraction schedule 135, then the elements are respectively accessed and the modification to the configuration for the relevant data is performed.
  • The capabilities of the present invention can be implemented in software, firmware, hardware or some combination thereof.
  • As one example, one or more aspects of the present invention can be included in an article of manufacture (e.g., one or more computer program products) having, for instance, computer usable media. The media has embodied therein, for instance, computer readable program code means for providing and facilitating the capabilities of the present invention. The article of manufacture can be included as a part of a computer system or sold separately.
  • Additionally, at least one program storage device readable by a machine, tangibly embodying at least one program of instructions executable by the machine to perform the capabilities of the present invention can be provided.
  • The flow diagrams depicted herein are just examples. There may be many variations to these diagrams or the steps (or operations) described therein without departing from the spirit of the invention. For instance, the steps may be performed in a differing order, or steps may be added, deleted or modified. All of these variations are considered a part of the claimed invention.
  • While the preferred embodiment to the invention has been described, it will be understood that those skilled in the art, both now and in the future, may make various improvements and enhancements which fall within the scope of the claims which follow. These claims should be construed to maintain the proper protection for the invention first described.

Claims (2)

1-5. (canceled)
6. A method for the configurable real time transformation of dissimilar data sources in a web service-based environment, the method comprising:
acquiring real time information pertaining to at least one data source, wherein the information comprises reference information that is associated with the data source, data transformation specification information that is associated with the data source, and scheduled event specification information that is associated with the data source;
maintaining the data source information;
acquiring data from the data source in accordance with a specified scheduled event;
converting the acquired data into a predetermined standardized format using standardized normalization and a conversion format;
performing at least one data transformation function identified from the real time information upon the converted data in accordance with the acquired data transformation specification information associated with the data source;
updating an extraction schedule; and
transmitting the transformed data to a destination data source;
wherein the standardized normalization and conversion format is managed using extensible markup language;
wherein the data sources is stored at a remote location;
wherein the data sources, transformation specification information, and scheduled event specification information associated with the data source are configured to be modified.
US11/624,893 2007-01-19 2007-01-19 Method for service oriented data extraction transformation and load Abandoned US20080177892A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US11/624,893 US20080177892A1 (en) 2007-01-19 2007-01-19 Method for service oriented data extraction transformation and load
US12/129,211 US8307025B2 (en) 2007-01-19 2008-05-29 Method for service oriented data extraction transformation and load

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US11/624,893 US20080177892A1 (en) 2007-01-19 2007-01-19 Method for service oriented data extraction transformation and load

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US12/129,211 Continuation US8307025B2 (en) 2007-01-19 2008-05-29 Method for service oriented data extraction transformation and load

Publications (1)

Publication Number Publication Date
US20080177892A1 true US20080177892A1 (en) 2008-07-24

Family

ID=39642344

Family Applications (2)

Application Number Title Priority Date Filing Date
US11/624,893 Abandoned US20080177892A1 (en) 2007-01-19 2007-01-19 Method for service oriented data extraction transformation and load
US12/129,211 Expired - Fee Related US8307025B2 (en) 2007-01-19 2008-05-29 Method for service oriented data extraction transformation and load

Family Applications After (1)

Application Number Title Priority Date Filing Date
US12/129,211 Expired - Fee Related US8307025B2 (en) 2007-01-19 2008-05-29 Method for service oriented data extraction transformation and load

Country Status (1)

Country Link
US (2) US20080177892A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110238613A1 (en) * 2010-03-28 2011-09-29 International Business Machines Corporation Comparing data series associated with two systems to identify hidden similarities between them
CN103092866A (en) * 2011-11-03 2013-05-08 金蝶软件(中国)有限公司 Data monitoring method and monitoring device
CN104008172A (en) * 2014-06-04 2014-08-27 浪潮软件集团有限公司 Data exchange method
CN104361133A (en) * 2014-12-10 2015-02-18 用友软件股份有限公司 Data extraction device and method
CN106126612A (en) * 2016-06-22 2016-11-16 重庆秒银科技有限公司 A kind of big ETL process dynamically divides the data pick-up method of timeslice
CN106897325A (en) * 2015-12-21 2017-06-27 阿里巴巴集团控股有限公司 A kind of data load method and device
CN107967347A (en) * 2017-12-07 2018-04-27 湖北三新文化传媒有限公司 Batch data processing method, server, system and storage medium
CN110826096A (en) * 2019-10-29 2020-02-21 杭州趣链科技有限公司 Data source sharing method based on block chain

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7209906B2 (en) 2002-01-14 2007-04-24 International Business Machines Corporation System and method for implementing a metrics engine for tracking relationships over time
WO2009146558A1 (en) * 2008-06-05 2009-12-10 Gss Group Inc. System and method for building a data warehouse
US9626416B2 (en) 2014-05-30 2017-04-18 International Business Machines Corporation Performance checking component for an ETL job
US11360952B2 (en) 2020-08-03 2022-06-14 Bank Of America Corporation System and method for managing data migration based on analysis of relevant data

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6032158A (en) * 1997-05-02 2000-02-29 Informatica Corporation Apparatus and method for capturing and propagating changes from an operational database to data marts
US20040254948A1 (en) * 2003-06-12 2004-12-16 International Business Machines Corporation System and method for data ETL in a data warehouse environment
US20040267751A1 (en) * 2003-04-25 2004-12-30 Marcus Dill Performing a data analysis process
US20050240354A1 (en) * 2003-08-27 2005-10-27 Ascential Software Corporation Service oriented architecture for an extract function in a data integration platform
US20060069803A1 (en) * 2004-07-30 2006-03-30 Sbc Knowledge Ventures, L.P. System and method for flexible data transfer
US20060106856A1 (en) * 2004-11-04 2006-05-18 International Business Machines Corporation Method and system for dynamic transform and load of data from a data source defined by metadata into a data store defined by metadata
US20060136354A1 (en) * 2004-12-17 2006-06-22 Bell Christopher R Apparatus and method for data warehousing
US20070078877A1 (en) * 2005-04-20 2007-04-05 Howard Ungar XBRL data conversion

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6076088A (en) * 1996-02-09 2000-06-13 Paik; Woojin Information extraction system and method using concept relation concept (CRC) triples
US6349276B1 (en) * 1998-10-29 2002-02-19 International Business Machines Corporation Multilingual information retrieval with a transfer corpus
US6584464B1 (en) * 1999-03-19 2003-06-24 Ask Jeeves, Inc. Grammar template query system
US6701294B1 (en) * 2000-01-19 2004-03-02 Lucent Technologies, Inc. User interface for translating natural language inquiries into database queries and data presentations
US7117215B1 (en) * 2001-06-07 2006-10-03 Informatica Corporation Method and apparatus for transporting data for data warehousing applications that incorporates analytic data interface
US6604110B1 (en) * 2000-08-31 2003-08-05 Ascential Software, Inc. Automated software code generation from a metadata-based repository
US7403938B2 (en) * 2001-09-24 2008-07-22 Iac Search & Media, Inc. Natural language query processing
US7209906B2 (en) * 2002-01-14 2007-04-24 International Business Machines Corporation System and method for implementing a metrics engine for tracking relationships over time
US20030188264A1 (en) * 2002-03-29 2003-10-02 Full Degree, Inc. Method and apparatus for XML data normalization

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6032158A (en) * 1997-05-02 2000-02-29 Informatica Corporation Apparatus and method for capturing and propagating changes from an operational database to data marts
US20040267751A1 (en) * 2003-04-25 2004-12-30 Marcus Dill Performing a data analysis process
US20040254948A1 (en) * 2003-06-12 2004-12-16 International Business Machines Corporation System and method for data ETL in a data warehouse environment
US20050240354A1 (en) * 2003-08-27 2005-10-27 Ascential Software Corporation Service oriented architecture for an extract function in a data integration platform
US20060069803A1 (en) * 2004-07-30 2006-03-30 Sbc Knowledge Ventures, L.P. System and method for flexible data transfer
US20060106856A1 (en) * 2004-11-04 2006-05-18 International Business Machines Corporation Method and system for dynamic transform and load of data from a data source defined by metadata into a data store defined by metadata
US20060136354A1 (en) * 2004-12-17 2006-06-22 Bell Christopher R Apparatus and method for data warehousing
US20070078877A1 (en) * 2005-04-20 2007-04-05 Howard Ungar XBRL data conversion

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110238613A1 (en) * 2010-03-28 2011-09-29 International Business Machines Corporation Comparing data series associated with two systems to identify hidden similarities between them
US8738563B2 (en) * 2010-03-28 2014-05-27 International Business Machines Corporation Comparing data series associated with two systems to identify hidden similarities between them
CN103092866A (en) * 2011-11-03 2013-05-08 金蝶软件(中国)有限公司 Data monitoring method and monitoring device
CN104008172A (en) * 2014-06-04 2014-08-27 浪潮软件集团有限公司 Data exchange method
CN104361133A (en) * 2014-12-10 2015-02-18 用友软件股份有限公司 Data extraction device and method
CN106897325A (en) * 2015-12-21 2017-06-27 阿里巴巴集团控股有限公司 A kind of data load method and device
CN106126612A (en) * 2016-06-22 2016-11-16 重庆秒银科技有限公司 A kind of big ETL process dynamically divides the data pick-up method of timeslice
CN107967347A (en) * 2017-12-07 2018-04-27 湖北三新文化传媒有限公司 Batch data processing method, server, system and storage medium
CN110826096A (en) * 2019-10-29 2020-02-21 杭州趣链科技有限公司 Data source sharing method based on block chain

Also Published As

Publication number Publication date
US8307025B2 (en) 2012-11-06
US20080294790A1 (en) 2008-11-27

Similar Documents

Publication Publication Date Title
US8307025B2 (en) Method for service oriented data extraction transformation and load
US8583603B2 (en) Employing user-context in connection with backup or restore of data
US20150026114A1 (en) System and method of automatically extracting data from plurality of data sources and loading the same to plurality of target databases
US10896226B2 (en) Data analysis system, data analysis method, and data analysis program
US10049142B1 (en) Multi-step code generation for bi processes
US10212256B2 (en) Delegating database queries
US9135454B2 (en) Systems and methods for enabling searchable encryption
US8745179B2 (en) Managing multiple application configuration versions in a heterogeneous network
US20200285622A1 (en) System and method for data structure migration control
US20160253401A1 (en) Synchronization adapter for synchronizing data to applications that do not directly support synchronization
US20180089293A1 (en) System and method for file management in data structures
US7756839B2 (en) Version tolerant serialization
US11757815B1 (en) Data aggregation from multiple entities
US10754868B2 (en) System for analyzing the runtime impact of data files on data extraction, transformation, and loading jobs
US8849755B2 (en) Configuration information management apparatus and dictionary generation method of configuration information management apparatus
US10795880B2 (en) System and method for inter-program file control communication
US11755613B2 (en) Streamlined data engineering
US9858323B2 (en) Intermediate data format for database population
US20140379650A1 (en) Managing a file-based versioning system
US20080168494A1 (en) Methods, arrangements and computer program products for digital media production
WO2017021186A1 (en) A computerized database management system
US8433699B1 (en) Object identity and addressability
US20100106746A1 (en) Modular interface for database conversion
US9208252B1 (en) Reducing multi-source feed reader content redundancy
WO2011053268A1 (en) Modular interface for database conversion

Legal Events

Date Code Title Description
AS Assignment

Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION, NEW Y

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ALBA, ALFREDO;REEL/FRAME:018856/0897

Effective date: 20070118

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION