CN112612778A - Enterprise data architecture method - Google Patents

Enterprise data architecture method Download PDF

Info

Publication number
CN112612778A
CN112612778A CN202011562598.5A CN202011562598A CN112612778A CN 112612778 A CN112612778 A CN 112612778A CN 202011562598 A CN202011562598 A CN 202011562598A CN 112612778 A CN112612778 A CN 112612778A
Authority
CN
China
Prior art keywords
data
physical
model
enterprise
business
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011562598.5A
Other languages
Chinese (zh)
Inventor
胡盛行
刘彬
李春鸣
弘娅晖
陈帅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Aviation Industrial Group Co ltd
Original Assignee
Shanghai Aviation Industrial Group Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Aviation Industrial Group Co ltd filed Critical Shanghai Aviation Industrial Group Co ltd
Priority to CN202011562598.5A priority Critical patent/CN112612778A/en
Publication of CN112612778A publication Critical patent/CN112612778A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/211Schema design and management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

An enterprise data architecture method comprises the following steps of scanning an enterprise data warehouse, collecting metadata of the data warehouse, and obtaining an application data model; acquiring a cross mapping relation of a physical-application data model by using a data blood margin analysis tool; identifying and defining a business data object according to the cross mapping relation of the physical-application data model; and analyzing the cross mapping relation of the physical-application data model, establishing the relation between business data objects, and forming an enterprise logic data model.

Description

Enterprise data architecture method
Technical Field
The invention belongs to the field of manufacturing enterprise data management, and particularly relates to an enterprise data architecture method based on meta-model cross mapping.
Background
With the development of manufacturing industry, the enterprise becomes more and more complex from product design, development, production to business activities, and the operation and decision of the enterprise increasingly depend on efficient and high-quality data support. For the requirement, the current practice of enterprises is to collect, sort and gather the data of the whole enterprise business database into a data warehouse to support cross-department and cross-system data analysis and query services.
However, with the development of civil aircraft manufacturing activities by civil aircraft enterprises, the analysis requirements on data are more and more extensive and deeper, and the civil aircraft manufacturing business and technology are very complex, and have the characteristics of large data volume, wide distribution and multiple types. If the civil aircraft enterprise data architecture is unclear and lacks a data panoramic view, data redundancy is easily caused, data response is not timely, and the problem of inconsistent data supply is solved. If the data asset utilization efficiency of the civil aircraft enterprise is low, the improvement of the data sharing service capability is seriously restricted. Based on the above, the civil aircraft manufacturing industry needs to construct a data architecture with global height of an enterprise urgently to support enterprise data management and guarantee effective exertion of enterprise data value.
Disclosure of Invention
The invention aims to provide a data architecture method based on meta-model cross mapping, which is used for constructing a logical data model of the enterprise global height by comparing the mapping relation between a business system physical model and a data warehouse application data model, and provides powerful support for the fields of enterprise data management and data service.
In one embodiment of the present invention, an enterprise data architecture method includes the following steps,
scanning an enterprise data warehouse, and acquiring metadata of the data warehouse to obtain an application data model;
acquiring a cross mapping relation of a physical-application data model by using a data blood margin analysis tool;
identifying and defining a business data object according to the cross mapping relation of the physical-application data model;
and analyzing the cross mapping relation of the physical-application data model, establishing the relation between business data objects, and forming an enterprise logic data model.
Drawings
The above and other objects, features and advantages of exemplary embodiments of the present invention will become readily apparent from the following detailed description read in conjunction with the accompanying drawings. Several embodiments of the invention are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings and in which:
FIG. 1 is a flowchart of a metadata cross-mapping based enterprise data architecture method, according to one embodiment of the invention.
Detailed Description
Currently, in the field of enterprise data architecture development, most enterprises are implemented by building data models. Conventional implementations typically have both top-down and bottom-up. The top-down mode depends on the familiarity with the business, modeling personnel need a great deal of business research, and although the obtained business information is accurate, the time and the labor are often consumed. The bottom-up method needs to start from a database to construct a mapping relation between data and services, however, system construction technical architectures of enterprises are not uniform, and the data conditions of the enterprises to the systems are not completely mastered, so that the data asset inventory result is difficult to fit the actual services. Therefore, enterprises need a data architecture combing method which has the advantages of small difficulty, low personnel requirement, standardized process, accurate and reliable obtained result and higher landing performance.
For the above problems, the existing solution is to collect technical metadata by using a metadata collection tool, collect corresponding service metadata from a system service scene as a source point, perform link analysis on the service metadata, integrate a data architecture, and provide a data panoramic view to support data problem tracking and data heat analysis. The method is based on the metadata angle, the data link is tracked by using a technical means, the data architecture is constructed, the labor cost is effectively reduced, and the developed data architecture has strong data analysis capability. However, the scheme carries out data architecture design based on a business system, the metadata integration process is complex, the technical dependence degree is high, and implementation is difficult to promote.
In order to solve the problems, the invention considers starting from data warehouse data integrated facing business 'theme', explores a method for efficiently constructing an enterprise global high-level data architecture, so as to promote enterprise data management, realize enterprise data asset manageability and traceability and ensure effective sharing and service of data among all business systems.
According to one or more embodiments, an enterprise data architecture method based on meta-model cross-mapping includes scanning a data warehouse, collecting meta-data of the data warehouse, and obtaining an application data model; acquiring a cross mapping relation of a physical-application data model by using a data blood margin analysis tool; identifying and defining a business data object according to the cross mapping relation of the physical-application data model; and analyzing the cross mapping relation of the physical-application data model, establishing the relation between service data objects, and forming a logic data model.
And the scanning data warehouse collects the metadata information of the data warehouse and obtains the application metadata model. The specific operation is as follows: collecting data warehouse metadata includes: the subject of the metadata, the name of the data table, the name of the field, and the like. In addition, considering that the data warehouse construction is application-driven, the model constructed by the metadata is called an application data model. The metadata collection is divided into a client and a server, the server is adapted to the data warehouse and the service system and collects the metadata, and the client integrates and visually displays the metadata.
The physical-application data model is obtained by using a data blood margin analysis tool, and the method comprises the following steps: with the aid of a data blood margin analysis tool, a data source is traced forward by taking metadata (application data model) of a data warehouse as a starting point, so as to clear where the current data comes from and what kind of processing is performed. And collecting source business system metadata (hereinafter, the source business system metadata is referred to as a physical data model) based on the blood-related analysis result, and constructing a mapping relation of the physical-application data model. Wherein collecting source service system metadata comprises: the information of database type, connection mode, Schema, data structure, database table name, field, description, etc.
The step of identifying and defining business data objects based on the cross-mapping relationship of the physical-application data model extracts data objects based on the data warehouse application data model. The reason is that the above-mentioned materials are,
the physical data model of the source system generally strictly follows the third paradigm to support the extensibility of the business, and metadata with close business relations needs to be further aggregated to form a data object. The data warehouse application model emphasizes data integration, and the design of the data warehouse form is based on the principle of 'high cohesion and low coupling', and in physical implementation, the integration is performed on the data warehouse form with tight business relation and small source system influence difference; the business relation is small, and the source system influences the big difference to divide and place. In contrast, identifying data objects from an application data model is more efficient. Moreover, the data warehouse application model usually adopts a star model, and is composed of a fact table and a dimension table, wherein the fact table records the full information of a certain business theme. Thus, the data warehouse fact table may be abstracted as a business data object.
In addition, in general, in a multi-bin implementation, there are multiple data tables to extract and integrate the same or similar source data, resulting in duplication or similarity of the multi-bin service metadata. Therefore, in the process of data asset combing, metadata pointing to the same business data object needs to be merged to simplify the analysis of the incidence relation of subsequent business data objects. The concrete implementation mode is that,
and calculating the similarity of the data sources of the two service data objects according to the cross mapping relation of the physical-application data model, and merging the two data objects if the similarity is higher than a certain threshold value.
And analyzing the cross mapping relation of the physical-application data model, establishing the relation between service data objects, and forming a logic data model. The method aims to construct the incidence relation among the business data objects by analyzing the data incidence relation of a physical data model of a source system. The reason is that the above-mentioned materials are,
the source system physical model typically designs a fact table to track business process detail data based on each business action in the business process, while data objects identified based on data warehouse data typically aggregate the subject information of a business process and keep the relationships between data objects loose. Therefore, the association relationship between the data objects needs to be retrieved by means of the physical data model, and the specific implementation manner is,
and aiming at the data objects with the homologous data, establishing an incidence relation between the corresponding data objects according to the calculation result. And inquiring whether any source data table between the two data objects has a main foreign key association relation or not aiming at the data objects without the homologous data, and if so, associating the two data objects.
According to the method, a physical model is quickly converted into a business level logic data model through analysis of a physical-application data model cross mapping relation. The method effectively solves the problems that the time period for constructing the logic data model from top to bottom is long, the database data is not known from bottom to top, and the combing result cannot fit the business fact.
In accordance with one or more embodiments, a flow diagram of a data architecture method based on meta-model cross-mapping is shown in FIG. 1. The method comprises the following steps:
step 101, scanning a data warehouse, and collecting metadata of the data warehouse to obtain an application data model;
here, the data warehouse is scanned according to the account number of the data warehouse, and all metadata information of the data warehouse is obtained. Collecting data warehouse metadata includes: the subject of the metadata, fact table, table field information.
The metadata management unit is divided into a client and a server, the server is matched with the data warehouse and the service system and collects data, and the client integrates and visually displays the metadata.
102, acquiring a cross mapping relation of a physical-application data model by using a data blood margin analysis tool;
with the aid of a data blood margin analysis tool, a data source is traced forward by taking metadata (application data model) of a data warehouse as a starting point, so as to clear where the current data comes from and what kind of processing is performed.
And collecting source business system metadata (physical data model) based on the blood relationship analysis result, and constructing a mapping relation of the physical-application data model. Wherein recording the source system metadata comprises: the information of database type, connection mode, Schema, data structure, database table name, field, description, etc.
Step 103, identifying and defining a business data object according to the cross mapping relation of the physical-application data model;
here, the data object is extracted based on the data warehouse application data model and the data warehouse fact table is abstracted as a business data object. Particularly, for a data object with high similarity at the source of the data, the data object is considered to be merged into one data object, specifically:
assume two data objects are O1,O2If the data object O is in a physical-application data model cross-mapping relationship1,O2If homologous data exists, the similarity of the data sources is calculated. Suppose O1,O2The number of the mapped source data forms is n respectively1,n2(except for the main data sheet), the number of the homologous forms is m (except for the main data sheet), and then O1,O2The data source similarity is as follows:
Sim(O1,O2)=m/min(n1,n2) (1)
a) when Sim (O)1,O2) Epsilon (epsilon preset threshold), business data object O1,O2Are merged into one data object O1
b) When Sim (O)1,O2) < ε (ε Preset threshold), service data object O1,O2No merging is required.
And 104, analyzing the cross mapping relation of the physical-application data model, establishing the relation between service data objects, and forming a logic data model.
On one hand: and aiming at the data objects with the homologous data, establishing an incidence relation between the corresponding data objects according to the calculation result in the step 103.
On the other hand: for a data object without homologous data, whether a data table mapped to a physical model of a source system by the data object has a main foreign key association relation or not needs to be inquired, and if so, association of the two data objects is carried out;
and integrating the association relationship combing results of the data objects, importing a set template, and generating a global logic data model under a business view angle. The data management and analysis method is used for determining the data range and the data service capability of the enterprise and supporting data management, data query and analysis.
While the invention has been described with reference to specific embodiments, the invention is not limited thereto, and various equivalent modifications and substitutions can be easily made by those skilled in the art within the technical scope of the invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (6)

1. An enterprise data architecture method, comprising the steps of,
scanning an enterprise data warehouse, and acquiring metadata of the data warehouse to obtain an application data model;
acquiring a cross mapping relation of a physical-application data model by using a data blood margin analysis tool;
identifying and defining a business data object according to the cross mapping relation of the physical-application data model;
and analyzing the cross mapping relation of the physical-application data model, establishing the relation between business data objects, and forming an enterprise logic data model.
2. The enterprise data architecture method of claim 1, wherein the step of scanning the data warehouse collects metadata of the data warehouse including a subject, a data table name, a field name of the metadata,
the server is adapted to the data warehouse and the business system, metadata is collected, and the metadata is integrated and visually displayed through the client.
3. The enterprise data architecture method of claim 2, wherein the step of utilizing a data consanguinity analysis tool comprises,
by means of a data blood relationship analysis tool, taking an application data model of a data warehouse as a starting point, tracing back a data source forwards,
and collecting source service system metadata based on the blood relationship analysis result, defining the source service system metadata model as a physical data model, and constructing a mapping relation of the physical-application data model.
4. The enterprise data architecture method of claim 3, wherein the step of identifying and defining business data objects based on physical-to-application data model cross-mappings comprises,
extracting data objects based on the data warehouse application data model;
and calculating the similarity of the data sources of the business data objects according to the cross mapping relation of the physical-application data model, and merging the data objects if the similarity is higher than a certain threshold value.
5. The enterprise data architecture method of claim 4, wherein the step of analyzing the cross-mapping relationships of the physical-application data models, establishing relationships between business data objects, and forming the logical data model constructs relationships between the business data objects by analyzing data relationships of the physical data models of the source system, the step being performed by the process of analyzing the data relationships of the physical data models of the source system,
aiming at the data objects with the same source data, establishing an incidence relation between the corresponding data objects;
and inquiring whether any source data table between the two data objects has a main foreign key association relation or not aiming at the data objects without the homologous data, and if so, associating the two data objects.
6. The enterprise data architecture method of claim 1, wherein the enterprise is a civil aircraft manufacturing enterprise.
CN202011562598.5A 2020-12-25 2020-12-25 Enterprise data architecture method Pending CN112612778A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011562598.5A CN112612778A (en) 2020-12-25 2020-12-25 Enterprise data architecture method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011562598.5A CN112612778A (en) 2020-12-25 2020-12-25 Enterprise data architecture method

Publications (1)

Publication Number Publication Date
CN112612778A true CN112612778A (en) 2021-04-06

Family

ID=75247815

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011562598.5A Pending CN112612778A (en) 2020-12-25 2020-12-25 Enterprise data architecture method

Country Status (1)

Country Link
CN (1) CN112612778A (en)

Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2318299A1 (en) * 2000-09-01 2002-03-01 Cognos Incorporated Metadata exchange
US20060112109A1 (en) * 2004-11-23 2006-05-25 Chowdhary Pawan R Adaptive data warehouse meta model
CN101566981A (en) * 2008-04-24 2009-10-28 长沙创智天马财务软件有限公司 Method for establishing dynamic virtual data base in analyzing and processing system
CN102023979A (en) * 2009-09-09 2011-04-20 中国工商银行股份有限公司 Meta-data management method and system
CN102402522A (en) * 2010-09-09 2012-04-04 中国移动通信集团上海有限公司 Data query system and method
CN102662994A (en) * 2012-03-14 2012-09-12 北京久其软件股份有限公司 Method and system for establishing data warehouse utilizing virtual multidimensional data set
WO2012169807A2 (en) * 2011-06-07 2012-12-13 Baek Seung Ho Method and system for building database using data warehouse
CN104252506A (en) * 2013-06-28 2014-12-31 易保网络技术(上海)有限公司 Method and system for synchronously constructing and mapping service model and data warehouse model
CN106157132A (en) * 2016-06-20 2016-11-23 中国工商银行股份有限公司 Credit risk monitoring system and method
CN108733713A (en) * 2017-04-21 2018-11-02 阿里巴巴集团控股有限公司 Data query method and device in data warehouse
US20180356800A1 (en) * 2017-06-08 2018-12-13 Rockwell Automation Technologies, Inc. Predictive maintenance and process supervision using a scalable industrial analytics platform
CN109376469A (en) * 2018-11-21 2019-02-22 中国航空无线电电子研究所 A kind of modeling method of the avionics system component meta-model based on AADL
CN109558393A (en) * 2018-11-28 2019-04-02 中国海洋石油集团有限公司 A kind of data model construction method, device, equipment and storage medium
CN109582695A (en) * 2018-12-04 2019-04-05 用友网络科技股份有限公司 Data general-purpose querying method and inquiry system under a kind of micro services framework
CN111159184A (en) * 2019-12-25 2020-05-15 上海中信信息发展股份有限公司 Metadata tracing method and device and server
CN111611458A (en) * 2020-06-09 2020-09-01 普元信息技术股份有限公司 Method for realizing system data architecture combing based on metadata and data analysis technology in big data management
CN111625579A (en) * 2019-02-27 2020-09-04 阿里巴巴集团控股有限公司 Information processing method, device and system

Patent Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2318299A1 (en) * 2000-09-01 2002-03-01 Cognos Incorporated Metadata exchange
US20060112109A1 (en) * 2004-11-23 2006-05-25 Chowdhary Pawan R Adaptive data warehouse meta model
CN101566981A (en) * 2008-04-24 2009-10-28 长沙创智天马财务软件有限公司 Method for establishing dynamic virtual data base in analyzing and processing system
CN102023979A (en) * 2009-09-09 2011-04-20 中国工商银行股份有限公司 Meta-data management method and system
CN102402522A (en) * 2010-09-09 2012-04-04 中国移动通信集团上海有限公司 Data query system and method
WO2012169807A2 (en) * 2011-06-07 2012-12-13 Baek Seung Ho Method and system for building database using data warehouse
CN102662994A (en) * 2012-03-14 2012-09-12 北京久其软件股份有限公司 Method and system for establishing data warehouse utilizing virtual multidimensional data set
US20150006467A1 (en) * 2013-06-28 2015-01-01 eBao Tech Corporation Method and system for designing business domain model, data warehouse model and mapping therebetween synchronously
CN104252506A (en) * 2013-06-28 2014-12-31 易保网络技术(上海)有限公司 Method and system for synchronously constructing and mapping service model and data warehouse model
CN106157132A (en) * 2016-06-20 2016-11-23 中国工商银行股份有限公司 Credit risk monitoring system and method
CN108733713A (en) * 2017-04-21 2018-11-02 阿里巴巴集团控股有限公司 Data query method and device in data warehouse
US20180356800A1 (en) * 2017-06-08 2018-12-13 Rockwell Automation Technologies, Inc. Predictive maintenance and process supervision using a scalable industrial analytics platform
CN109376469A (en) * 2018-11-21 2019-02-22 中国航空无线电电子研究所 A kind of modeling method of the avionics system component meta-model based on AADL
CN109558393A (en) * 2018-11-28 2019-04-02 中国海洋石油集团有限公司 A kind of data model construction method, device, equipment and storage medium
CN109582695A (en) * 2018-12-04 2019-04-05 用友网络科技股份有限公司 Data general-purpose querying method and inquiry system under a kind of micro services framework
CN111625579A (en) * 2019-02-27 2020-09-04 阿里巴巴集团控股有限公司 Information processing method, device and system
CN111159184A (en) * 2019-12-25 2020-05-15 上海中信信息发展股份有限公司 Metadata tracing method and device and server
CN111611458A (en) * 2020-06-09 2020-09-01 普元信息技术股份有限公司 Method for realizing system data architecture combing based on metadata and data analysis technology in big data management

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
刘彬;梁建交;: "基于复杂组织体架构方法的企业数据中心总体设计", 智能制造, no. 08 *
史松;冀亚林;蔡娜;王霞;马成龙;: "数据架构模型在企业管理中的应用", 价值工程, no. 33 *
宋利康;崔德刚;周儒荣;: "企业模型与ERP系统间映射技术", 航空学报, no. 06 *
张国蕊;程燕;徐保文;: "航空工业企业架构构建方法研究", 航空制造技术, no. 19 *
李春鸣: "数据架构设计方法研究", IT技术论坛 *

Similar Documents

Publication Publication Date Title
CN113010506B (en) Multi-source heterogeneous water environment big data management system
CN112183036B (en) Format document generation method, device, equipment and storage medium
CN104679827A (en) Big data-based public information association method and mining engine
CN112183916B (en) Land reserve life cycle management system
Shahid et al. Application of data warehouse in real life: State-of-the-art survey from user preferences’ perspective
CN116384889A (en) Intelligent analysis method for information big data based on natural language processing technology
CN114880405A (en) Data lake-based data processing method and system
CN112732949A (en) Service data labeling method and device, computer equipment and storage medium
US10628421B2 (en) Managing a single database management system
CN111459900B (en) Big data life cycle setting method, device, storage medium and server
JP4989761B2 (en) Event history storage device, event history tracking device, event history storage method, and event history storage program
CN111061793B (en) Data processing system and method
CN112363996A (en) Method, system, and medium for building a physical model of a power grid knowledge graph
CN116881430A (en) Industrial chain identification method and device, electronic equipment and readable storage medium
CN112182259A (en) Drawing parameter processing method and device
CN112612778A (en) Enterprise data architecture method
CN112699245A (en) Construction method and device and application method and device of budget management knowledge graph
CN107004036A (en) Method and system to search for the daily record for including a large amount of entries
CN112667736B (en) Digital operation management system
CN115934693A (en) Dynamic calculation method for regional real population
CN114925042A (en) Method for constructing metadata relation based on graphic database
CN114021887A (en) Method for collecting enterprise employee work information
CN107391695A (en) A kind of information extracting method based on big data
CN114281795A (en) Data model construction method based on service data standard table
CN1588405A (en) Data processing device and method for risk control system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination