KR20150102207A - Method of generating data warehouse and storage medium for storing program therefor - Google Patents
Method of generating data warehouse and storage medium for storing program therefor Download PDFInfo
- Publication number
- KR20150102207A KR20150102207A KR1020140023789A KR20140023789A KR20150102207A KR 20150102207 A KR20150102207 A KR 20150102207A KR 1020140023789 A KR1020140023789 A KR 1020140023789A KR 20140023789 A KR20140023789 A KR 20140023789A KR 20150102207 A KR20150102207 A KR 20150102207A
- Authority
- KR
- South Korea
- Prior art keywords
- database
- data warehouse
- model
- generating
- information
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
- G06F16/252—Integrating or interfacing systems involving database management systems between a Database Management System and a front-end application
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A data warehouse creating method for creating a data warehouse from a database storing business information on a predetermined business field, the method comprising: (a) acquiring a model database according to the business field; (b) generating a multidimensional modeling structure by performing Extraction, Transformation & Transfer (ETL) on the model database; (c) mapping the task information to the model database to generate a reference database; And (d) generating the data warehouse from the reference database based on the multidimensional modeling structure.
Description
The present invention relates to a data warehouse generating method and a recording medium storing a data warehouse generating program. More particularly, the present invention relates to a data warehouse generating program, And a recording medium storing a program for executing the method.
Description of the Related Art [0002] In recent years, technologies utilizing business information of companies, banks, hospitals, schools, public institutions, organizations (hereinafter referred to as "enterprises, etc.") have been advanced. One example is Enterprise Resource Planning (ERP).
In this regard, companies are increasingly building data warehouses to improve the analytical capabilities of business information. In this case, it is necessary to build a data warehouse on the basis of business information stored in an existing database held by a company or the like.
As a specific method thereof, Non-Patent Document 1 discloses a technique of converting data of an existing database into a CSV (Comma-Separated Values) file and constructing a data warehouse using the converted data.
However, according to the above-described technology, there is a problem that a data warehouse requires a lot of time, manpower, and resources because of its huge workload.
Therefore, there is a need for technologies that can save time, manpower, and resources in building data warehouses from existing databases owned by companies and the like.
An object of the present invention is to provide a method for constructing a data warehouse from an existing database possessed by a company or the like with only a small amount of time, resources, and manpower, and a recording medium storing a program for executing the method.
According to an aspect of the present invention, there is provided a data warehouse creating method for creating a data warehouse from a database storing work information on a predetermined business field, the method comprising: (a) acquiring a model database according to the business field; (b) generating a multidimensional modeling structure by performing Extraction, Transformation & Transfer (ETL) on the model database; (c) mapping the task information to the model database to generate a reference database; And (d) generating the data warehouse from the reference database based on the multidimensional modeling structure.
According to another aspect of the present invention, there is provided an information processing method comprising the steps of: (a) acquiring a plurality of model databases respectively corresponding to a plurality of business fields; (b) performing ETL on the plurality of model databases to generate a plurality of multidimensional modeling structures; (c) mapping an object database to one of the plurality of business fields; (d) generating a reference database by mapping the data of the target database to a model database corresponding to the one business field among the plurality of model databases; And (e) generating a data warehouse from the reference database based on a multidimensional modeling structure corresponding to the one of the plurality of multidimensional modeling structures.
According to another aspect of the present invention, there is provided a storage medium storing a program for creating a data warehouse from a database storing task information related to a predetermined business field, the program comprising: (a) ; (b) generating a multidimensional modeling structure by performing an ETL on the model database; (c) generating a reference database by mapping the task information to the model database; And (d) generating a data warehouse from the reference database based on the multidimensional modeling structure.
According to the data warehouse generating method and the data warehouse generating program according to the present invention, it is possible to save time, manpower, and resources in constructing a data warehouse from an existing database owned by a company or the like .
1 is a schematic diagram illustrating an outline of a data warehouse generating method according to the prior art.
2 is a schematic diagram illustrating an outline of a data warehouse generating method according to the present invention.
3 is a schematic diagram illustrating a process of generating a multidimensional modeling structure from a model database and providing a model OLAP in the data warehouse generating method according to the present invention.
4 is a flowchart illustrating a data warehouse generating method according to the first embodiment of the present invention.
5 is a flowchart illustrating a data warehouse generating method according to a second embodiment of the present invention.
Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.
1 is a schematic diagram illustrating an outline of a data warehouse generating method according to the prior art.
Referring to FIG. 1, in the prior art described in Non-Patent Document 1, a CSV file is generated by coding each data stored in an existing
However, since the amount of data stored in the
2 is a schematic diagram illustrating an outline of a data warehouse generating method according to the present invention.
2, in the method of generating a data warehouse according to the present invention, a CSV file is not generated from the data stored in the
2 exemplarily shows a plurality of model databases, but one model database may be used. Although the description herein refers to a case where the data stored in the
The model databases 110-1, 110-2, ..., 110-n may be constructed or prepared in advance for each business field. Business areas may include, for example, manufacturing, financial, medical, service, retail, education, and administration. In this case, for example, the model database 110-1 may be a standard database for manufacturing, the model database 110-2 may be a standard database for the financial industry, and the model database 110-3 may be a standard database for the medical industry .
The task of mapping the data stored in the
3 is a schematic diagram illustrating a process of generating a multidimensional modeling structure from a model database and providing a model OLAP (Online Analytical Processing) in a data warehouse generating method according to the present invention.
Referring to FIG. 3, the model database 110-2 performs an Extraction, Transformation & Transfer (ETL) to generate a
Figure 3 shows a star schema with a fact table 121 and a dimension table 122 as an example of a
The model OLAP 130 may be, for example, a multi-dimensional OLAP, a relational OLAP, or a hybrid OLAP, and preferably includes a user interface.
4 is a flowchart illustrating a data warehouse generating method according to the first embodiment of the present invention.
Hereinafter, a data warehouse generating method according to the first embodiment will be described in detail with reference to FIGS. 2 to 4. FIG.
The method of generating a data warehouse according to the first embodiment is a method of generating a
First, the model database 110-2 according to the task field of the task information stored in the
Next, the
If the
After step S120, the model OLAP 130 is provided based on the multidimensional modeling structure 120 (step S130). FIG. 4 exemplarily illustrates that step S130 is performed between steps S120 and S140, but step S130 may be performed at any point in time prior to step S160 after step S120.
Next, the business information stored in the
Next, the
Next, OLAP for the
5 is a flowchart illustrating a data warehouse generating method according to a second embodiment of the present invention.
Hereinafter, a data warehouse generating method according to the second embodiment will be described in detail with reference to FIGS. 2 and 5. FIG.
First, a plurality of model databases 110-1, 110-2,..., 110-n corresponding to a plurality of business fields are acquired (step S210). The business field is not limited and may include at least one of, for example, manufacturing, financial, medical, service, retail, education, and administration.
Next, ETL is performed on the plurality of model databases 110-1, 110-2,... 110-n to generate a plurality of multidimensional modeling structures (step S220). The plurality of multidimensional modeling structures may include at least one of a star schema, a snow flake schema, and a Galaxy schema, some or all of which may be generated based on a key performance indicator (KPI).
When at least one of the plurality of multidimensional modeling structures includes a star schema, the star schema includes a fact table and a dimension table, and the dimension table may include at least one dimension of time, organization, product, and region.
After step S220, a plurality of model OLAPs are provided based on a plurality of multidimensional modeling structures (step S230). FIG. 4 illustrates that step S230 is performed between step S220 and step S240. However, step S230 may be performed at any time prior to step S270.
Next, the
Next, the
Next, the
Next, OLAP for the
Hereinafter, a data warehouse generating program according to the present invention and a recording medium storing the program will be described.
A data warehouse generating program according to the present invention is a program for causing a computer to execute a data warehouse generating method according to the first or second embodiment, wherein the recording medium according to the present invention is a computer- And store the program.
Specifically, the recording medium according to the present invention is a recording medium for storing a program for creating a data warehouse from a database storing business information related to a predetermined business field, the program comprising a model database Order to acquire; A step of performing an ETL on the model database to generate a multidimensional modeling structure; Generating a reference database by mapping the task information to the model database; And executing the order of generating the data warehouse from the reference database based on the multidimensional modeling structure.
Alternatively, the recording medium according to the present invention may comprise: a sequence of acquiring a plurality of model databases respectively corresponding to a plurality of business fields; Generating a plurality of multidimensional modeling structures by performing an ETL on the plurality of model databases; Mapping a target database to one of the plurality of business domains; Generating a reference database by mapping the data of the target database to a model database corresponding to the one business field among the plurality of model databases; And storing a program for causing a computer to execute a procedure of generating a data warehouse from the reference database based on a multidimensional modeling structure corresponding to the one business field among the plurality of multidimensional modeling structures.
Although the present invention has been described in detail, it should be understood that the present invention is not limited thereto. Those skilled in the art will appreciate that various modifications may be made without departing from the essential characteristics of the present invention. Will be possible.
Therefore, the embodiments disclosed in the present specification are intended to illustrate rather than limit the present invention, and the scope and spirit of the present invention are not limited by these embodiments. The scope of the present invention should be construed according to the following claims, and all the techniques within the scope of equivalents should be construed as being included in the scope of the present invention.
100: databases 110-1 to 110-n: model database
120: multidimensional modeling structure 120-1: fact table
120-2: Dimension table 130: Model OLAP
150: Reference database 200: Data warehouse
Claims (10)
(a) obtaining a model database according to the task field;
(b) generating a multidimensional modeling structure by performing Extraction, Transformation & Transfer (ETL) on the model database;
(c) mapping the task information to the model database to generate a reference database; And
(d) generating the data warehouse from the reference database based on the multidimensional modeling structure
The data warehouse generating method comprising:
(e) providing a model OLAP (Online Analytical Processing) based on the multidimensional modeling structure after the step (b); And
(f) after step (d), providing OLAP for the data warehouse based on the model OLAP
The data warehouse generating method further comprising:
Wherein the multidimensional modeling structure comprises a star schema.
The star schema includes a fact table and a dimension table,
Wherein the dimension table comprises at least one dimension of time, organization, product, and region.
In the step (b), the multidimensional modeling structure is generated based on a key performance indicator (KPI).
The business field includes at least one of manufacturing, financial, and medical care.
Wherein the business information includes at least one of financial information, shipping information, order information, inventory information, process information, evaluation information, and productivity information.
(b) performing ETL on the plurality of model databases to generate a plurality of multidimensional modeling structures;
(c) mapping an object database to one of the plurality of business fields;
(d) generating a reference database by mapping the data of the target database to a model database corresponding to the one business field among the plurality of model databases; And
(e) generating a data warehouse from the reference database based on a multidimensional modeling structure corresponding to the one of the plurality of multidimensional modeling structures,
The data warehouse generating method comprising:
(e) providing a plurality of model OLAPs based on the plurality of multidimensional modeling structures after the step (b); And
(f) providing OLAP for the data warehouse based on a model OLAP corresponding to the one business field among the plurality of model OLAPs after the step (e)
The data warehouse generating method further comprising:
The program
(a) obtaining a model database according to the business field;
(b) generating a multidimensional modeling structure by performing an ETL on the model database;
(c) generating a reference database by mapping the task information to the model database; And
(d) a step of generating the data warehouse from the reference database based on the multidimensional modeling structure
Readable recording medium.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020140023789A KR20150102207A (en) | 2014-02-28 | 2014-02-28 | Method of generating data warehouse and storage medium for storing program therefor |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020140023789A KR20150102207A (en) | 2014-02-28 | 2014-02-28 | Method of generating data warehouse and storage medium for storing program therefor |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20150102207A true KR20150102207A (en) | 2015-09-07 |
Family
ID=54243015
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020140023789A KR20150102207A (en) | 2014-02-28 | 2014-02-28 | Method of generating data warehouse and storage medium for storing program therefor |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20150102207A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112148807A (en) * | 2020-09-28 | 2020-12-29 | 中国电波传播研究所(中国电子科技集团公司第二十二研究所) | Electromagnetic environment field data warehouse construction method |
CN112328706A (en) * | 2020-11-03 | 2021-02-05 | 成都中科大旗软件股份有限公司 | Dimension modeling calculation method under digital bin system, computer equipment and storage medium |
-
2014
- 2014-02-28 KR KR1020140023789A patent/KR20150102207A/en not_active Application Discontinuation
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112148807A (en) * | 2020-09-28 | 2020-12-29 | 中国电波传播研究所(中国电子科技集团公司第二十二研究所) | Electromagnetic environment field data warehouse construction method |
CN112328706A (en) * | 2020-11-03 | 2021-02-05 | 成都中科大旗软件股份有限公司 | Dimension modeling calculation method under digital bin system, computer equipment and storage medium |
CN112328706B (en) * | 2020-11-03 | 2023-10-24 | 成都中科大旗软件股份有限公司 | Dimension modeling calculation method under digital bin system, computer equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11537635B2 (en) | Hadoop OLAP engine | |
US11042569B2 (en) | System and method for load, aggregate and batch calculation in one scan in a multidimensional database environment | |
Aboutorabiª et al. | Performance evaluation of SQL and MongoDB databases for big e-commerce data | |
Russom | Big data analytics | |
US10546021B2 (en) | Adjacency structures for executing graph algorithms in a relational database | |
Muntean et al. | Agile BI-The Future of BI. | |
CN109997126A (en) | Event-driven is extracted, transformation, loads (ETL) processing | |
US9348874B2 (en) | Dynamic recreation of multidimensional analytical data | |
US20110208691A1 (en) | Accessing Large Collection Object Tables in a Database | |
Agarwal et al. | Approximate incremental big-data harmonization | |
US20190303506A1 (en) | Adaptable adjacency structure for querying graph data | |
US8892505B2 (en) | Method for scheduling a task in a data warehouse | |
Hanlin et al. | Research on extract, transform and load (ETL) in land and resources star schema data warehouse | |
Kim et al. | Big data security hardening methodology using attributes relationship | |
Yeung et al. | Integrating machine learning technology to data analytics for e-commerce on cloud | |
Özkarabacak et al. | A comparison analysis between ERP and EAI | |
CN107181729B (en) | Data encryption in a multi-tenant cloud environment | |
US20180018382A1 (en) | System for defining clusters for a set of objects | |
CN113287100A (en) | System and method for generating in-memory table model database | |
KR20150102207A (en) | Method of generating data warehouse and storage medium for storing program therefor | |
US20140304234A1 (en) | System decommissioning through reverse archiving of data | |
Arputhamary et al. | A review on big data integration | |
US11347796B2 (en) | Eliminating many-to-many joins between database tables | |
KR20180000413A (en) | Method of generating data warehouse and storage medium for storing program therefor | |
US10558652B2 (en) | Merging multiproviders in a database calculation scenario |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
E601 | Decision to refuse application |