KR20150102207A - Method of generating data warehouse and storage medium for storing program therefor - Google Patents

Method of generating data warehouse and storage medium for storing program therefor Download PDF

Info

Publication number
KR20150102207A
KR20150102207A KR1020140023789A KR20140023789A KR20150102207A KR 20150102207 A KR20150102207 A KR 20150102207A KR 1020140023789 A KR1020140023789 A KR 1020140023789A KR 20140023789 A KR20140023789 A KR 20140023789A KR 20150102207 A KR20150102207 A KR 20150102207A
Authority
KR
South Korea
Prior art keywords
database
data warehouse
model
generating
information
Prior art date
Application number
KR1020140023789A
Other languages
Korean (ko)
Inventor
나문수
Original Assignee
(주)로직소프트
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by (주)로직소프트 filed Critical (주)로직소프트
Priority to KR1020140023789A priority Critical patent/KR20150102207A/en
Publication of KR20150102207A publication Critical patent/KR20150102207A/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/252Integrating or interfacing systems involving database management systems between a Database Management System and a front-end application

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A data warehouse creating method for creating a data warehouse from a database storing business information on a predetermined business field, the method comprising: (a) acquiring a model database according to the business field; (b) generating a multidimensional modeling structure by performing Extraction, Transformation & Transfer (ETL) on the model database; (c) mapping the task information to the model database to generate a reference database; And (d) generating the data warehouse from the reference database based on the multidimensional modeling structure.

Description

[0001] METHOD OF GENERATING DATA WAREHOUSE AND STORAGE MEDIUM FOR STORING PROGRAM THEREFOR [0002]

The present invention relates to a data warehouse generating method and a recording medium storing a data warehouse generating program. More particularly, the present invention relates to a data warehouse generating program, And a recording medium storing a program for executing the method.

Description of the Related Art [0002] In recent years, technologies utilizing business information of companies, banks, hospitals, schools, public institutions, organizations (hereinafter referred to as "enterprises, etc.") have been advanced. One example is Enterprise Resource Planning (ERP).

In this regard, companies are increasingly building data warehouses to improve the analytical capabilities of business information. In this case, it is necessary to build a data warehouse on the basis of business information stored in an existing database held by a company or the like.

As a specific method thereof, Non-Patent Document 1 discloses a technique of converting data of an existing database into a CSV (Comma-Separated Values) file and constructing a data warehouse using the converted data.

However, according to the above-described technology, there is a problem that a data warehouse requires a lot of time, manpower, and resources because of its huge workload.

Therefore, there is a need for technologies that can save time, manpower, and resources in building data warehouses from existing databases owned by companies and the like.

 "Implementing Oracle BI Applications using Universal Adapters" by Saurabh Verma on May 21, 2013, Oracle BI applications blog (https://blogs.oracle.com/biapps/entry/implementing_oracle_bi_applications_using)

An object of the present invention is to provide a method for constructing a data warehouse from an existing database possessed by a company or the like with only a small amount of time, resources, and manpower, and a recording medium storing a program for executing the method.

According to an aspect of the present invention, there is provided a data warehouse creating method for creating a data warehouse from a database storing work information on a predetermined business field, the method comprising: (a) acquiring a model database according to the business field; (b) generating a multidimensional modeling structure by performing Extraction, Transformation & Transfer (ETL) on the model database; (c) mapping the task information to the model database to generate a reference database; And (d) generating the data warehouse from the reference database based on the multidimensional modeling structure.

According to another aspect of the present invention, there is provided an information processing method comprising the steps of: (a) acquiring a plurality of model databases respectively corresponding to a plurality of business fields; (b) performing ETL on the plurality of model databases to generate a plurality of multidimensional modeling structures; (c) mapping an object database to one of the plurality of business fields; (d) generating a reference database by mapping the data of the target database to a model database corresponding to the one business field among the plurality of model databases; And (e) generating a data warehouse from the reference database based on a multidimensional modeling structure corresponding to the one of the plurality of multidimensional modeling structures.

According to another aspect of the present invention, there is provided a storage medium storing a program for creating a data warehouse from a database storing task information related to a predetermined business field, the program comprising: (a) ; (b) generating a multidimensional modeling structure by performing an ETL on the model database; (c) generating a reference database by mapping the task information to the model database; And (d) generating a data warehouse from the reference database based on the multidimensional modeling structure.

According to the data warehouse generating method and the data warehouse generating program according to the present invention, it is possible to save time, manpower, and resources in constructing a data warehouse from an existing database owned by a company or the like .

1 is a schematic diagram illustrating an outline of a data warehouse generating method according to the prior art.
2 is a schematic diagram illustrating an outline of a data warehouse generating method according to the present invention.
3 is a schematic diagram illustrating a process of generating a multidimensional modeling structure from a model database and providing a model OLAP in the data warehouse generating method according to the present invention.
4 is a flowchart illustrating a data warehouse generating method according to the first embodiment of the present invention.
5 is a flowchart illustrating a data warehouse generating method according to a second embodiment of the present invention.

Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.

1 is a schematic diagram illustrating an outline of a data warehouse generating method according to the prior art.

Referring to FIG. 1, in the prior art described in Non-Patent Document 1, a CSV file is generated by coding each data stored in an existing database 100 held by a company or the like, .

However, since the amount of data stored in the database 100 is large, it takes a lot of time, manpower, and resources to code each of them to generate a CSV file.

2 is a schematic diagram illustrating an outline of a data warehouse generating method according to the present invention.

2, in the method of generating a data warehouse according to the present invention, a CSV file is not generated from the data stored in the database 100, but the data stored in the database 100 is stored in the model databases 110-1 and 110-2 , ... 110-n (the model database 110-2 in the example shown in FIG. 2) to generate the reference database 150, and the data warehouse 200 is generated based thereon.

2 exemplarily shows a plurality of model databases, but one model database may be used. Although the description herein refers to a case where the data stored in the database 100 is mapped to the model database 110-2 for the sake of convenience of description, Are mapped to those other than the model database 110-2 out of the models 110-1, 110-2, ... 110-n.

The model databases 110-1, 110-2, ..., 110-n may be constructed or prepared in advance for each business field. Business areas may include, for example, manufacturing, financial, medical, service, retail, education, and administration. In this case, for example, the model database 110-1 may be a standard database for manufacturing, the model database 110-2 may be a standard database for the financial industry, and the model database 110-3 may be a standard database for the medical industry .

The task of mapping the data stored in the database 100 to the model database 110-2 is simpler than the coding operation to generate the CSV file from the data stored in the database 100. [ Accordingly, the data warehouse creation method according to the present invention can save time, manpower, and resources for creating the data warehouse 200 compared to the prior art.

3 is a schematic diagram illustrating a process of generating a multidimensional modeling structure from a model database and providing a model OLAP (Online Analytical Processing) in a data warehouse generating method according to the present invention.

Referring to FIG. 3, the model database 110-2 performs an Extraction, Transformation & Transfer (ETL) to generate a multidimensional modeling structure 120, and a model OLAP 130 is generated based thereon.

Figure 3 shows a star schema with a fact table 121 and a dimension table 122 as an example of a multidimensional modeling structure 120. [ However, the present invention is not limited thereto, and the multidimensional modeling structure 120 may be, for example, a snowflake schema or a galaxy schema.

The model OLAP 130 may be, for example, a multi-dimensional OLAP, a relational OLAP, or a hybrid OLAP, and preferably includes a user interface.

4 is a flowchart illustrating a data warehouse generating method according to the first embodiment of the present invention.

Hereinafter, a data warehouse generating method according to the first embodiment will be described in detail with reference to FIGS. 2 to 4. FIG.

The method of generating a data warehouse according to the first embodiment is a method of generating a data warehouse 200 from a database 100 that stores business information related to a predetermined business field. The business field is not limited and may include at least one of, for example, manufacturing, financial, medical, service, retail, education, and administration.

First, the model database 110-2 according to the task field of the task information stored in the database 100 is acquired (step S110).

Next, the multi-dimensional modeling structure 120 is created by performing ETL on the model database 110-2 (step S120). The multidimensional modeling structure may include at least one of a star schema, a snow flake schema, and a Galaxy schema, and may be generated based on a key performance indicator (KPI).

If the multidimensional modeling structure 120 includes a star schema, the star schema includes a fact table 120-1 and a dimension table 120-2, and the dimension table may include at least one of time, organization, Dimensions.

After step S120, the model OLAP 130 is provided based on the multidimensional modeling structure 120 (step S130). FIG. 4 exemplarily illustrates that step S130 is performed between steps S120 and S140, but step S130 may be performed at any point in time prior to step S160 after step S120.

Next, the business information stored in the database 100 is mapped to the model database 110-2 to generate the reference database 150 (step S140). The task information may include at least one of financial information, delivery information, order information, inventory information, process information, evaluation information, and productivity information.

Next, the data warehouse 200 is created from the reference database 150 based on the multidimensional modeling structure 120 (step S150).

Next, OLAP for the data warehouse 200 is provided based on the model OLAP 130 (step S160).

5 is a flowchart illustrating a data warehouse generating method according to a second embodiment of the present invention.

Hereinafter, a data warehouse generating method according to the second embodiment will be described in detail with reference to FIGS. 2 and 5. FIG.

First, a plurality of model databases 110-1, 110-2,..., 110-n corresponding to a plurality of business fields are acquired (step S210). The business field is not limited and may include at least one of, for example, manufacturing, financial, medical, service, retail, education, and administration.

Next, ETL is performed on the plurality of model databases 110-1, 110-2,... 110-n to generate a plurality of multidimensional modeling structures (step S220). The plurality of multidimensional modeling structures may include at least one of a star schema, a snow flake schema, and a Galaxy schema, some or all of which may be generated based on a key performance indicator (KPI).

When at least one of the plurality of multidimensional modeling structures includes a star schema, the star schema includes a fact table and a dimension table, and the dimension table may include at least one dimension of time, organization, product, and region.

After step S220, a plurality of model OLAPs are provided based on a plurality of multidimensional modeling structures (step S230). FIG. 4 illustrates that step S230 is performed between step S220 and step S240. However, step S230 may be performed at any time prior to step S270.

Next, the database 100 is mapped to one of the plurality of business fields (S240). Specifically, the database 100 is corresponded to a business field (for example, financial business) closest to the field to which the data stored in the database 100 belongs among the plurality of business fields (for example, manufacturing industry, financial industry, .

Next, the reference database 150 is generated by mapping the data of the database 100 to a model database (for example, model database 110-2) corresponding to the one business field (for example, financial business) (Step S250). The data may include at least one of financial information, delivery information, order information, inventory information, process information, evaluation information, and productivity information.

Next, the data warehouse 200 is created from the reference database 150 based on the multidimensional modeling structure corresponding to the one business field (for example, financial business) among the plurality of multidimensional modeling structures (operation S260) .

Next, OLAP for the data warehouse 200 is provided based on a model OLAP corresponding to the one business field (for example, financial business) among the plurality of model OLAP provided in the step S230 (step S270).

Hereinafter, a data warehouse generating program according to the present invention and a recording medium storing the program will be described.

A data warehouse generating program according to the present invention is a program for causing a computer to execute a data warehouse generating method according to the first or second embodiment, wherein the recording medium according to the present invention is a computer- And store the program.

Specifically, the recording medium according to the present invention is a recording medium for storing a program for creating a data warehouse from a database storing business information related to a predetermined business field, the program comprising a model database Order to acquire; A step of performing an ETL on the model database to generate a multidimensional modeling structure; Generating a reference database by mapping the task information to the model database; And executing the order of generating the data warehouse from the reference database based on the multidimensional modeling structure.

Alternatively, the recording medium according to the present invention may comprise: a sequence of acquiring a plurality of model databases respectively corresponding to a plurality of business fields; Generating a plurality of multidimensional modeling structures by performing an ETL on the plurality of model databases; Mapping a target database to one of the plurality of business domains; Generating a reference database by mapping the data of the target database to a model database corresponding to the one business field among the plurality of model databases; And storing a program for causing a computer to execute a procedure of generating a data warehouse from the reference database based on a multidimensional modeling structure corresponding to the one business field among the plurality of multidimensional modeling structures.

Although the present invention has been described in detail, it should be understood that the present invention is not limited thereto. Those skilled in the art will appreciate that various modifications may be made without departing from the essential characteristics of the present invention. Will be possible.

Therefore, the embodiments disclosed in the present specification are intended to illustrate rather than limit the present invention, and the scope and spirit of the present invention are not limited by these embodiments. The scope of the present invention should be construed according to the following claims, and all the techniques within the scope of equivalents should be construed as being included in the scope of the present invention.

100: databases 110-1 to 110-n: model database
120: multidimensional modeling structure 120-1: fact table
120-2: Dimension table 130: Model OLAP
150: Reference database 200: Data warehouse

Claims (10)

A data warehouse generating method for generating a data warehouse from a database storing business information on a predetermined business field,
(a) obtaining a model database according to the task field;
(b) generating a multidimensional modeling structure by performing Extraction, Transformation & Transfer (ETL) on the model database;
(c) mapping the task information to the model database to generate a reference database; And
(d) generating the data warehouse from the reference database based on the multidimensional modeling structure
The data warehouse generating method comprising:
The method according to claim 1,
(e) providing a model OLAP (Online Analytical Processing) based on the multidimensional modeling structure after the step (b); And
(f) after step (d), providing OLAP for the data warehouse based on the model OLAP
The data warehouse generating method further comprising:
The method according to claim 1,
Wherein the multidimensional modeling structure comprises a star schema.
The method of claim 3,
The star schema includes a fact table and a dimension table,
Wherein the dimension table comprises at least one dimension of time, organization, product, and region.
The method according to claim 1,
In the step (b), the multidimensional modeling structure is generated based on a key performance indicator (KPI).
The method according to claim 1,
The business field includes at least one of manufacturing, financial, and medical care.
The method according to claim 1,
Wherein the business information includes at least one of financial information, shipping information, order information, inventory information, process information, evaluation information, and productivity information.
(a) acquiring a plurality of model databases corresponding respectively to a plurality of business fields;
(b) performing ETL on the plurality of model databases to generate a plurality of multidimensional modeling structures;
(c) mapping an object database to one of the plurality of business fields;
(d) generating a reference database by mapping the data of the target database to a model database corresponding to the one business field among the plurality of model databases; And
(e) generating a data warehouse from the reference database based on a multidimensional modeling structure corresponding to the one of the plurality of multidimensional modeling structures,
The data warehouse generating method comprising:
9. The method of claim 8,
(e) providing a plurality of model OLAPs based on the plurality of multidimensional modeling structures after the step (b); And
(f) providing OLAP for the data warehouse based on a model OLAP corresponding to the one business field among the plurality of model OLAPs after the step (e)
The data warehouse generating method further comprising:
A recording medium storing a program for creating a data warehouse from a database storing job information relating to a predetermined business field,
The program
(a) obtaining a model database according to the business field;
(b) generating a multidimensional modeling structure by performing an ETL on the model database;
(c) generating a reference database by mapping the task information to the model database; And
(d) a step of generating the data warehouse from the reference database based on the multidimensional modeling structure
Readable recording medium.
KR1020140023789A 2014-02-28 2014-02-28 Method of generating data warehouse and storage medium for storing program therefor KR20150102207A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020140023789A KR20150102207A (en) 2014-02-28 2014-02-28 Method of generating data warehouse and storage medium for storing program therefor

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020140023789A KR20150102207A (en) 2014-02-28 2014-02-28 Method of generating data warehouse and storage medium for storing program therefor

Publications (1)

Publication Number Publication Date
KR20150102207A true KR20150102207A (en) 2015-09-07

Family

ID=54243015

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020140023789A KR20150102207A (en) 2014-02-28 2014-02-28 Method of generating data warehouse and storage medium for storing program therefor

Country Status (1)

Country Link
KR (1) KR20150102207A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112148807A (en) * 2020-09-28 2020-12-29 中国电波传播研究所(中国电子科技集团公司第二十二研究所) Electromagnetic environment field data warehouse construction method
CN112328706A (en) * 2020-11-03 2021-02-05 成都中科大旗软件股份有限公司 Dimension modeling calculation method under digital bin system, computer equipment and storage medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112148807A (en) * 2020-09-28 2020-12-29 中国电波传播研究所(中国电子科技集团公司第二十二研究所) Electromagnetic environment field data warehouse construction method
CN112328706A (en) * 2020-11-03 2021-02-05 成都中科大旗软件股份有限公司 Dimension modeling calculation method under digital bin system, computer equipment and storage medium
CN112328706B (en) * 2020-11-03 2023-10-24 成都中科大旗软件股份有限公司 Dimension modeling calculation method under digital bin system, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
US11537635B2 (en) Hadoop OLAP engine
US11042569B2 (en) System and method for load, aggregate and batch calculation in one scan in a multidimensional database environment
Aboutorabiª et al. Performance evaluation of SQL and MongoDB databases for big e-commerce data
Russom Big data analytics
US10546021B2 (en) Adjacency structures for executing graph algorithms in a relational database
Muntean et al. Agile BI-The Future of BI.
CN109997126A (en) Event-driven is extracted, transformation, loads (ETL) processing
US9348874B2 (en) Dynamic recreation of multidimensional analytical data
US20110208691A1 (en) Accessing Large Collection Object Tables in a Database
Agarwal et al. Approximate incremental big-data harmonization
US20190303506A1 (en) Adaptable adjacency structure for querying graph data
US8892505B2 (en) Method for scheduling a task in a data warehouse
Hanlin et al. Research on extract, transform and load (ETL) in land and resources star schema data warehouse
Kim et al. Big data security hardening methodology using attributes relationship
Yeung et al. Integrating machine learning technology to data analytics for e-commerce on cloud
Özkarabacak et al. A comparison analysis between ERP and EAI
CN107181729B (en) Data encryption in a multi-tenant cloud environment
US20180018382A1 (en) System for defining clusters for a set of objects
CN113287100A (en) System and method for generating in-memory table model database
KR20150102207A (en) Method of generating data warehouse and storage medium for storing program therefor
US20140304234A1 (en) System decommissioning through reverse archiving of data
Arputhamary et al. A review on big data integration
US11347796B2 (en) Eliminating many-to-many joins between database tables
KR20180000413A (en) Method of generating data warehouse and storage medium for storing program therefor
US10558652B2 (en) Merging multiproviders in a database calculation scenario

Legal Events

Date Code Title Description
E601 Decision to refuse application