CN109684486A - Construction method, device, computer equipment and the storage medium of metadata schema - Google Patents

Construction method, device, computer equipment and the storage medium of metadata schema Download PDF

Info

Publication number
CN109684486A
CN109684486A CN201811599601.3A CN201811599601A CN109684486A CN 109684486 A CN109684486 A CN 109684486A CN 201811599601 A CN201811599601 A CN 201811599601A CN 109684486 A CN109684486 A CN 109684486A
Authority
CN
China
Prior art keywords
data
attribute
metadata
incidence relation
type field
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811599601.3A
Other languages
Chinese (zh)
Inventor
余芸
陈彬
衡星辰
王志英
邹文景
穆文杰
董召杰
吴丹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dingxin Information Technology Co Ltd
China Southern Power Grid Co Ltd
Original Assignee
Dingxin Information Technology Co Ltd
China Southern Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dingxin Information Technology Co Ltd, China Southern Power Grid Co Ltd filed Critical Dingxin Information Technology Co Ltd
Priority to CN201811599601.3A priority Critical patent/CN109684486A/en
Publication of CN109684486A publication Critical patent/CN109684486A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This application involves a kind of construction method of metadata schema, device, computer equipment and storage mediums, the type field that computer equipment passes through the unstructured data of acquisition each business domains of enterprise, and the first incidence relation between each type field, then according to business domains, type field and the first incidence relation, data correlation model is constructed;And according to preset tetrahedral model, the metadata of unstructured data is obtained, then above-mentioned metadata is added in data correlation model, constructs metadata schema.Using the above method, metadata schema integrated level can be made higher, it is easier to safeguard, and the building efficiency of lift scheme.

Description

Construction method, device, computer equipment and the storage medium of metadata schema
Technical field
The present invention relates to data technique fields, more particularly to the construction method, device, computer of a kind of metadata schema Equipment and storage medium.
Background technique
With the development of network technology, electric power enterprise produces a large amount of unstructured data, including text, video sound Frequently, the different-formats file such as picture is managed unstructured data using metadata, in the true of maintenance unstructured data It is real, complete, can be used, be appreciated that etc. extensive concern has been obtained.Wherein, metadata is a kind of data of structuring, is used It is described in unstructured data, unstructured data is allowed to be understood and handled by computer equipment.Non- knot The design of structure data metadata model is the key point that metadata is applied to unstructured data management.
Currently, the design method of existing metadata schema, mainly computer equipment are to unstructured platform institute, enterprise Some data entities are analyzed, and the attribute type of data entity is taken out, and are then classified to above-mentioned attribute type, merger Deng processing, the building of metadata schema is completed.
But when constructing metadata schema using the above method, the metadata schema structure for constructing completion is more complicated, leads Cause maintenance difficulties higher.
Summary of the invention
Based on this, it is necessary in view of the above technical problems, provide the construction method, device, calculating of a kind of metadata schema Machine equipment and storage medium.
A kind of construction method of metadata schema, which comprises
The first association obtained between the type field and each type field of the unstructured data of each business domains of enterprise is closed System;
According to the business domains, the type field and first incidence relation, data correlation model is constructed;
According to preset tetrahedral model, the metadata of the unstructured data is obtained;The metadata is used for institute The attribute for stating unstructured data is described;
It adds the metadata in the data correlation model, constructs metadata schema.
The tetrahedral model includes unstructured data, operating information system and industry in one of the embodiments, Three dimensions of service type, and the four attribute faces formed according to three dimensions;
Four attribute faces are used to describe each attribute of unstructured data.
Four attribute faces include: data service system characteristic attribute face, data service in one of the embodiments, Attribute face, data essential attribute face and data value characteristic attribute face;
Data service system characteristic attribute face be used for attribute of the unstructured data in operation system into Row description;The data traffic attributes face is for being described the unstructured data service attribute;The data are basic Attribute face is for being described the essential attribute of the unstructured data;Data value characteristic attribute face is used for described The attribute of the data value of unstructured data is described.
Sub- metadata of each attribute bread containing the metadata in one of the embodiments, the sub- metadata are Tertiary structure, in which: the first level structure includes at least one element set;Second level structure includes at least one in the element set A element and corresponding component identification;Third layer structure includes that at least one of the element limits element and corresponding Limit component identification.
First incidence relation obtained between each type field in one of the embodiments, comprising:
Obtain the second incidence relation between each attribute in same type domain;
Obtain the third incidence relation between the attribute in different type domain;
According to second incidence relation and the third incidence relation, first incidence relation is obtained.
It is described according to the business domains, the type field and first incidence relation, structure in one of the embodiments, Build data correlation model, comprising:
Table data store lattice are established according to the type field;
According between the type field and the business domains hierarchical relationship and first incidence relation, generate close It is supporting paper;
According to the table data store lattice and the relationship supporting paper, the data correlation model is constructed.
It is described in one of the embodiments, to add the metadata in the data correlation model, construct first number According to model, comprising:
According to the corresponding type field of the unstructured data, the metadata of the unstructured data is added to described In the corresponding table data store lattice of type field.
A kind of construction device of metadata schema, described device include:
First obtains module, for obtaining the type field of the unstructured data that each business domains of enterprise generate and each Incidence relation between type field;According to the business domains, the type field and first incidence relation, data correlation is constructed Model;
Second obtains module, for obtaining the metadata of the unstructured data according to preset tetrahedral model;Institute Metadata is stated for the attribute of the unstructured data to be described;
Module is constructed, for adding the metadata in the data correlation model, constructs metadata schema.
A kind of computer equipment, including memory and processor, the memory are stored with computer program, the processing Device realizes the step of construction method of above-mentioned metadata schema when executing the computer program.
A kind of computer readable storage medium, is stored thereon with computer program, and the computer program is held by processor The step of construction method of above-mentioned metadata schema is realized when row.
Construction method, device, computer equipment and the storage medium of above-mentioned metadata schema, computer equipment pass through acquisition The first incidence relation between the type field and each type field of the unstructured data of each business domains of enterprise, then basis Business domains, type field and the first incidence relation construct data correlation model;And according to preset tetrahedral model, non-knot is obtained Then above-mentioned metadata is added in data correlation model by the metadata of structure data, construct metadata schema.Due to calculating Machine equipment constructs data correlation model according to the business domains of enterprise, type field and the first incidence relation, so that the data are closed Gang mould type is more suitable for the business structure of the enterprise, and structure integrated level is high, it is easier to safeguard;Further, computer equipment root The metadata that unstructured data is obtained according to tetrahedral model, can carry out multi-faceted whole description to unstructured data, Allow to further improve the collection of metadata schema using unified description scheme to various types of unstructured datas Cheng Du makes metadata schema be easier to safeguard;In addition, computer equipment first constructs data correlation model, then by metadata It is added in data correlation model, improves metadata schema building efficiency.
Detailed description of the invention
Fig. 1 is the applied environment figure of the construction method of metadata schema in one embodiment;
Fig. 2 is the flow diagram of the construction method of metadata schema in one embodiment;
Fig. 3 a is the schematic diagram of tetrahedral model in one embodiment;
Fig. 3 b is the schematic diagram of metadata structure in one embodiment;
Fig. 4 is the flow diagram of the construction method of metadata schema in another embodiment;
Fig. 5 is the flow diagram of the construction method of metadata schema in another embodiment;
Fig. 6 is the structural block diagram of the construction method of metadata schema in one embodiment;
Fig. 7 is the structural block diagram of the construction method of metadata schema in another embodiment;
Fig. 8 is the structural block diagram of the construction method of metadata schema in another embodiment;
Fig. 9 is the internal structure chart of computer equipment in one embodiment.
Specific embodiment
It is with reference to the accompanying drawings and embodiments, right in order to which the objects, technical solutions and advantages of the application are more clearly understood The application is further elaborated.It should be appreciated that specific embodiment described herein is only used to explain the application, not For limiting the application.
The construction method of metadata schema provided by the embodiments of the present application can be applied to application environment as shown in Figure 1 In.Wherein, computer equipment 120 constructs metadata schema according to unstructured data 110.Wherein, computer equipment can with but It is not limited to various personal computers, laptop, smart phone and tablet computer etc..
In one embodiment, it as shown in Fig. 2, providing a kind of construction method of metadata schema, applies in this way It is illustrated for computer equipment in Fig. 1, the above method includes:
S101, obtain each business domains of enterprise unstructured data type field and each type field between first Incidence relation.
Wherein, above-mentioned business domains refer to is analyzed by the business structure to enterprise, the type of service of the enterprise of acquisition, For example, the business domains of electric power enterprise may include strategic management, planning construction, safety in production, the marketing, human resources, wealth It is engaged in management, handling of goods and materials, information management and integrated management totally nine types of service;In addition, business domains can also include to upper It states nine types of service to be finely divided, first-level class, secondary traffic classification and operation flow of acquisition etc..Such as in business domains Human resources can also include recruitment, discharge etc. and business classification, and further segment secondary traffic classification etc..On The type for stating the unstructured data that type field refers to that each business domains of enterprise generate, for example, human resources business domain is non- The type field of structural data may include the file types such as contract, list, opinion and video;Marketing business domains it is non- The type field of structural data may include the file types such as contract, bidding documents, customer profile.
Specifically, computer equipment, can be by each when obtaining the type field of unstructured data of each business domains The file type for including in the operation flow of business domains extracts the type fields of the business domains;It can also be directly existing from enterprise It is directly acquired in database, for example, type can be obtained from the common information model (abbreviation CIM model) in electric power enterprise Domain, wherein CIM model is an abstract model, is mainly used for describing all main objects of electric power enterprise.For the above-mentioned type The acquisition modes in domain, it is not limited here.
Wherein, above-mentioned first incidence relation is used to describe the association between above-mentioned each type field, for example, contract and employee First incidence relation of list, the contract of can be described as refer to the registration contract of employee;Client's list is associated with the first of account Relationship can be described as the account situation etc. of account reflection client.
Specifically, Unified Modeling Language (Unified Modeling Language, abbreviation can be used in computer equipment UML), the first incidence relation is established between each type field, wherein above-mentioned UML is the standardization modeling language of object-oriented, Association between each type field, type field can be made graphically to be depicted and.
S102, according to business domains, type field and the first incidence relation, construct data correlation model.
Computer equipment can construct on the basis of obtaining above-mentioned business domains, type field and the first incidence relation Data correlation model, above-mentioned data correlation model are the frame model of metadata schema, and computer equipment is by the non-structural of enterprise The metadata for changing data is added in said frame model, and the building of metadata schema can be completed.
Above-mentioned data correlation model may include the storage location of metadata, between each business domains and each type field Hierarchical relationship, such as first layer are nine business domains of electric power enterprise, and the second layer is the type field that each business domains respectively contain, with And the first incidence relation between each business domains.User can pass through above-mentioned data when retrieving to above-mentioned metadata Hierarchical relationship and the first incidence relation in correlation model, to obtain metadata relevant to content is retrieved.
Specifically, computer equipment, can be according to the different business domains of enterprise, to create when constructing data correlation model Different table data store lattice are built, the different type fields that can also include according to business domains create different table data stores Lattice are not limited thereto the building mode of above-mentioned data correlation model.
S103, according to preset tetrahedral model, obtain the metadata of unstructured data;Metadata is used for non-structural The attribute for changing data is described.
Wherein, metadata is to describe the data of unstructured data, is described for the attribute to unstructured data, A kind of machine understandable data are established for digital information source, allows computer equipment by reading metadata, comes Obtain the information of above-mentioned unstructured data;It is that identifying, evaluate and track unstructured data is using using purpose Variation in the process, realization simply and efficiently manage unstructured data.
Above-mentioned tetrahedral model is used to carry out unstructured data comprehensive description of multiple dimensions, above-mentioned tetrahedral model Four attribute faces that dimension is constituted, aforementioned four attribute are described including three different description dimensions, and by above three Face describes the different types of attribute an of unstructured data respectively.Computer equipment can be come according to different type fields Determine the different attribute types in upper four attribute faces of above-mentioned tetrahedral model.For example, being the non-knot of video for type field Four attribute faces of structure data, tetrahedral model may include essential attribute face, for describing the general property of video file, Including title, founder, creation time etc.;Further include semantic feature attribute face, for describing the semantic attribute of video file, wraps Include subject specification etc.;It further include low-level image feature attribute face, for describing color, the Texture eigenvalue attribute of video file;May be used also To include initial data characteristic attribute face, the attributes such as size for describing video file.It can obtain according to different needs Three dimensions for taking above-mentioned tetrahedral model, are not limited thereto the particular content of above-mentioned tetrahedral model.
Computer equipment, can be first according to the corresponding class of unstructured data when obtaining the metadata of unstructured data Type domain obtains the metadata of unstructured data according to above-mentioned different attribute to determine the different attribute in tetrahedral model.
S104, it adds metadata in data correlation model, constructs metadata schema.
Computer equipment is creating data correlation model, and after obtaining the metadata of unstructured data, by first number According to being added in above-mentioned data correlation model, the building of metadata schema can be completed.Wherein, metadata schema refers to that storage is non- The data model of the metadata of structural data.Computer equipment can be according to the corresponding type field of unstructured data, will The metadata of unstructured data is added in the corresponding table data store lattice in the type domain, completes the building of metadata schema; It can also be according to the different attribute type of the corresponding type field of unstructured data, by the description different attribute of unstructured data Metadata, be added in corresponding table data store lattice, complete the building of metadata schema;For above-mentioned metadata schema Building mode is not limited thereto.
The construction method of above-mentioned metadata schema, the unstructured data that computer equipment passes through acquisition each business domains of enterprise Type field and each type field between the first incidence relation, be then associated with according to business domains, type field with first System constructs data correlation model;And according to preset tetrahedral model, the metadata of unstructured data is obtained, it then will be upper It states metadata to be added in data correlation model, constructs metadata schema.Business domains, class due to computer equipment according to enterprise Type domain and the first incidence relation construct data correlation model, so that the data correlation model is more suitable for the business of the enterprise Framework, structure integrated level are high, it is easier to safeguard;Further, computer equipment obtains unstructured number according to tetrahedral model According to metadata, multi-faceted whole description can be carried out to unstructured data, is allowed to various types of non-structural Change data and use unified description scheme, further improve the integrated level of metadata schema, metadata schema is made to be easier to tie up Shield;In addition, computer equipment first constructs data correlation model, then add metadata in data correlation model, is promoted Metadata schema building efficiency.
In one embodiment, as shown in Figure 3a, above-mentioned tetrahedral model includes unstructured data, operating information system And three dimensions of type of service, and the four attribute faces formed according to three dimensions.
Further, aforementioned four attribute face may include: data service system characteristic attribute face, data traffic attributes Face, data essential attribute face and data value characteristic attribute face.Wherein, above-mentioned data service system characteristic attribute face is used for non- Attribute of the structural data in operation system is described, for example, being contract for type field in human resources business domain Unstructured data, the attribute in operation system may include contract approval time, the contract in human resources business system The attributes such as the version of process code and the contract in systems in system.Above-mentioned data traffic attributes face is used for unstructured Data traffic attributes are described, and above-mentioned service attribute refers to above-mentioned unstructured data, and the structuring number are generated in enterprise According to the relevant attribute of vocational work, continue by taking human resources business domain as an example, type field be contract unstructured data, Service attribute may include the attributes such as the department name for signing the contract, the upper layer department of the department.Above-mentioned data essential attribute Face is for being described the essential attribute of unstructured data, for example, the essential attribute of said contract may include signature pair As, sign date, signature key message etc..Above-mentioned data value characteristic attribute face is used for the category to the data value of unstructured data Property be described, for example, occupied storage sky when the attribute of the data value of said contract may include contract documents storage Between size etc..By above three dimension and four attribute faces, to all unstructured datas of enterprise, according to above-mentioned Attribute is described, and can make the structure of metadata schema apparent.
In one embodiment, as shown in Figure 3b, sub- metadata of each attribute bread containing metadata, sub- metadata are three Level structure, in which: the first level structure includes at least one element set;Second level structure includes at least one of element set member Element and corresponding component identification;Third layer structure includes at least one the restriction element and corresponding restriction element of element Mark.
Wherein, computer equipment the attribute on each attribute face can be described, and generate different sub- metadata, In, the corresponding sub- metadata in each attribute face;Above-mentioned unstructured data can also include multiple categories on each attribute face Property, description of the computer equipment to above-mentioned multiple attributes can form the element set of sub- metadata, that is, the of sub- metadata Primary structure, such as a sub- metadata may include element set A and element set B;Multiple attributes on the attribute face, with element The multiple elements concentrated correspond, and constitute the second level structure of sub- metadata, for example, element set A includes elements A 1 and member Plain A2;Further, above-mentioned element is further divided into different restriction elements, constitutes the third level structure of sub- metadata, for example, Elements A 1 includes to limit elements A 11 and limit elements A 12, and elements A 2 is comprising limiting elements A 21 and limiting elements A 22.Above-mentioned In the tertiary structure of sub- metadata, also comprising component identification and limit component identification, for above-mentioned element and limit element into Row unique identification, such as can be above-mentioned element or limit the English description of element.
Element by taking type field is the unstructured data of video as an example, in tetrahedral model on data essential attribute face Concentration may include three elements: save entity attribute, owner's attribute and document entity attribute;Wherein, entity category is saved The coding and the attribute in terms of encryption safe that property can be used for describing video;Above-mentioned owner's attribute can be used for describing video The attribute of owner's relevant information;Above-mentioned document entity attribute can be used for description and video file technological accumulation and inheritance attribute etc..Into One step, the restriction element of above-mentioned document entity attribute may include: media formats, data format, file extension, original wound Build environment and carrier expiration time etc..The above-mentioned English description for limiting component identification to limit element, such as media formats Component identification is limited as Media Format.
The construction method of above-mentioned metadata schema, computer equipment construct the son on each attribute face by tertiary structure Metadata can make the structure integrated level of metadata schema higher, and metadata schema is made to be easier to safeguard.
Fig. 4 is the flow diagram of the construction method of metadata schema in another embodiment, and the present embodiment is related to calculating Machine equipment obtains a kind of concrete mode of the first incidence relation, on the basis of the above embodiments, as shown in figure 4, above-mentioned S101 Include:
S201, obtain same type domain each attribute between the second incidence relation.
Specifically, computer equipment can be obtained first same when obtaining the first incidence relation between each type field The second incidence relation between each attribute of a type field;By taking type field is video as an example, the available view of computer equipment Incidence relation between the data format and storage size of frequency file.
S202, obtain different type domain attribute between third incidence relation.
Computer equipment can also obtain the third incidence relation between the attribute of different type fields, for example, for people Two type field contracts and wages in power resource system, may exist between the signature object of contract and the granting object of wages Incidence relation.
S203, according to the second incidence relation and third incidence relation, obtain the first incidence relation.
On the basis of obtaining above-mentioned second incidence relation and third incidence relation, above-mentioned first incidence relation includes The second incidence relation between the different attribute in same type domain, also between the attribute in the different type domain including same business domains Third incidence relation, further include the third incidence relation between the attribute of the type field of different business domains, therefore, Ke Yijian Erect the first incidence relation between all types domain of enterprise.
The construction method of above-mentioned metadata schema, computer equipment obtain type field according to the different attribute of type field Between the first incidence relation, the integrated level of metadata schema can be made higher, inhomogeneity can be realized in metadata schema Associative search between type domain.
Fig. 5 is the flow diagram of the construction method of metadata schema in another embodiment, and the present embodiment is related to one kind Computer equipment constructs the concrete mode of data correlation model, on the basis of the above embodiments, as shown in figure 5, above-mentioned S102 Include:
S301, table data store lattice are established according to type field.
Specifically, computer equipment can be deposited when constructing data correlation model according to each type field to establish data Table is stored up, may include the corresponding each attribute in the type domain in above-mentioned table data store lattice, for storing unstructured data Metadata in, metadata entity that different attributes is described.By taking type field is video as an example, the table data store lattice In the first row can be the different attribute type of video file, remaining every a line is for storing a unstructured data Metadata.
S302, according to the hierarchical relationship and the first incidence relation between type field and business domains, production Methods explanation File.
Computer equipment is closed according to the level between the first incidence relation of acquisition and the business domains and type field of enterprise System, can be generated relationship supporting paper, for example, graphically showing above-mentioned first incidence relation and hierarchical relationship.
S303, according to table data store lattice and relationship supporting paper, construct data correlation model.
Computer equipment is establishing table data store lattice according to type field, and after generating relationship supporting paper, obtains Obtained the relationship expository writing of the relation on attributes between multiple table data store lattice and each table in the different type domain of enterprise Part completes the building of metadata schema.
Further, computer equipment, can be according to unstructured number after obtaining the metadata of unstructured data According to corresponding type field, the metadata of unstructured data is added in the corresponding table data store lattice of type field.
The construction method of above-mentioned metadata schema, computer equipment establishes table data store lattice according to type field, and builds Vertical relationship supporting paper, can make the metadata of the unstructured data of enterprise, be stored according to type field, make metadata The structure of model is apparent.
It should be understood that although each step in the flow chart of Fig. 2, Fig. 4, Fig. 5 is successively shown according to the instruction of arrow Show, but these steps are not that the inevitable sequence according to arrow instruction successively executes.Unless expressly state otherwise herein, this There is no stringent sequences to limit for the execution of a little steps, these steps can execute in other order.Moreover, Fig. 2, Fig. 4, figure At least part step in 5 may include that perhaps these sub-steps of multiple stages or stage be not necessarily for multiple sub-steps It is to execute completion in synchronization, but can execute at different times, the execution sequence in these sub-steps or stage It is not necessarily and successively carries out, but can be at least part wheel of the sub-step or stage of other steps or other steps Stream alternately executes.
In one embodiment, as shown in fig. 6, providing a kind of construction device of metadata schema, comprising: first obtains Module 10, first constructs module 20, second and obtains module 30 and the second building module 40, in which:
First obtains module 10, for obtaining the type field and each class of the unstructured data of each business domains of enterprise The first incidence relation between type domain.
First building module 20, for constructing number according to the business domains, the type field and first incidence relation According to correlation model.
Second obtains module 30, for obtaining the metadata of the unstructured data according to preset tetrahedral model; Metadata is for being described the attribute of unstructured data.
Second building module 40 constructs metadata schema for adding metadata in data correlation model.
The construction device of the metadata schema of above-mentioned offer can execute above method embodiment, realization principle and skill Art effect is similar, and details are not described herein.
In one embodiment, tetrahedral model includes unstructured data, operating information system and type of service three A dimension, and the four attribute faces formed according to three dimensions;Four attribute faces are for describing the various of unstructured data Attribute.
In one embodiment, four attribute faces include: data service system characteristic attribute face, data traffic attributes face, Data essential attribute face and data value characteristic attribute face;Data service system characteristic attribute face is for existing to unstructured data Attribute in operation system is described;Data traffic attributes face is for being described unstructured data service attribute;Number According to essential attribute face for the essential attribute of unstructured data to be described;Data value characteristic attribute face is used for non-structural The attribute for changing the data value of data is described.
In one embodiment, sub- metadata of each attribute bread containing metadata, sub- metadata are tertiary structure, In: the first level structure includes at least one element set;Second level structure includes at least one element in element set, and corresponding Component identification;Third layer structure includes at least one the restriction element and corresponding restriction component identification of element.
In one embodiment, as shown in fig. 7, on the basis of the above embodiments, the first acquisition module 10 includes:
First acquisition unit 101, the second incidence relation between each attribute for obtaining same type domain.
Second acquisition unit 102, the third incidence relation between attribute for obtaining different type domain.
Third acquiring unit 103, for obtaining the first incidence relation according to the second incidence relation and third incidence relation.
In one embodiment, as shown in figure 8, on the basis of the above embodiments, the first building module 20 includes:
Unit 201 is established, for establishing table data store lattice according to type field.
Generation unit 202, for according to the hierarchical relationship and the first incidence relation between type field and business domains, life At relationship supporting paper.
Construction unit 203, for constructing data correlation model according to table data store lattice and relationship supporting paper.
In one embodiment, on the basis of the above embodiments, the second building module 40 is specifically used for: according to non-structural Change the corresponding type field of data, the metadata of unstructured data is added in the corresponding table data store lattice of type field.
The construction device of metadata schema provided by the embodiments of the present application can execute above method embodiment, realize Principle is similar with technical effect, and details are not described herein.
The specific of construction device about metadata schema limits the building that may refer to above for metadata schema The restriction of method, details are not described herein.Modules in the construction device of above-mentioned metadata schema can be fully or partially through Software, hardware and combinations thereof are realized.Above-mentioned each module can be embedded in the form of hardware or independently of the place in computer equipment It manages in device, can also be stored in a software form in the memory in computer equipment, in order to which processor calls execution or more The corresponding operation of modules.
In one embodiment, a kind of computer equipment is provided, which can be terminal, internal structure Figure can be as shown in Figure 9.The computer equipment includes processor, the memory, network interface, display connected by system bus Screen and input unit.Wherein, the processor of the computer equipment is for providing calculating and control ability.The computer equipment is deposited Reservoir includes non-volatile memory medium, built-in storage.The non-volatile memory medium is stored with operating system and computer journey Sequence.The built-in storage provides environment for the operation of operating system and computer program in non-volatile memory medium.The calculating The network interface of machine equipment is used to communicate with external terminal by network connection.When the computer program is executed by processor with Realize a kind of construction method of metadata schema.The display screen of the computer equipment can be liquid crystal display or electric ink Display screen, the input unit of the computer equipment can be the touch layer covered on display screen, be also possible to outside computer equipment Key, trace ball or the Trackpad being arranged on shell can also be external keyboard, Trackpad or mouse etc..
It will be understood by those skilled in the art that structure shown in Fig. 9, only part relevant to application scheme is tied The block diagram of structure does not constitute the restriction for the computer equipment being applied thereon to application scheme, specific computer equipment It may include perhaps combining certain components or with different component layouts than more or fewer components as shown in the figure.
In one embodiment, a kind of computer equipment, including memory and processor are provided, is stored in memory Computer program, the processor perform the steps of when executing computer program
The first association obtained between the type field and each type field of the unstructured data of each business domains of enterprise is closed System;
According to business domains, type field and the first incidence relation, data correlation model is constructed;
According to preset tetrahedral model, the metadata of unstructured data is obtained;Metadata is used for unstructured number According to attribute be described;
It adds metadata in data correlation model, constructs metadata schema.
In one embodiment, tetrahedral model includes unstructured data, operating information system and type of service three A dimension, and the four attribute faces formed according to three dimensions;Four attribute faces are for describing the various of unstructured data Attribute.
In one embodiment, four attribute faces include: data service system characteristic attribute face, data traffic attributes face, Data essential attribute face and data value characteristic attribute face;
Data service system characteristic attribute face is for being described attribute of the unstructured data in operation system;Number According to service attribute face for unstructured data service attribute to be described;Data essential attribute face is used for unstructured number According to essential attribute be described;Data value characteristic attribute face is used to retouch the attribute of the data value of unstructured data It states.
In one embodiment, sub- metadata of each attribute bread containing metadata, sub- metadata are tertiary structure, In: the first level structure includes at least one element set;Second level structure includes at least one element in element set, and corresponding Component identification;Third layer structure includes at least one the restriction element and corresponding restriction component identification of element.
In one embodiment, it is also performed the steps of when processor executes computer program and obtains same type domain The second incidence relation between each attribute;Obtain the third incidence relation between the attribute in different type domain;It is closed according to second Connection relationship and third incidence relation obtain the first incidence relation.
In one embodiment, it is also performed the steps of when processor executes computer program and number is established according to type field According to storage table;According to the hierarchical relationship and the first incidence relation between type field and business domains, production Methods expository writing Part;According to table data store lattice and relationship supporting paper, data correlation model is constructed.
In one embodiment, it also performs the steps of when processor executes computer program according to unstructured data The metadata of unstructured data is added in the corresponding table data store lattice of type field by corresponding type field.
In one embodiment, a kind of computer readable storage medium is provided, computer program is stored thereon with, is calculated Machine program performs the steps of when being executed by processor
The first association obtained between the type field and each type field of the unstructured data of each business domains of enterprise is closed System;
According to business domains, type field and the first incidence relation, data correlation model is constructed;
According to preset tetrahedral model, the metadata of unstructured data is obtained;Metadata is used for unstructured number According to attribute be described;
It adds metadata in data correlation model, constructs metadata schema.
In one embodiment, tetrahedral model includes unstructured data, operating information system and type of service three A dimension, and the four attribute faces formed according to three dimensions;Four attribute faces are for describing the various of unstructured data Attribute.
In one embodiment, four attribute faces include: data service system characteristic attribute face, data traffic attributes face, Data essential attribute face and data value characteristic attribute face;Data service system characteristic attribute face is for existing to unstructured data Attribute in operation system is described;Data traffic attributes face is for being described unstructured data service attribute;Number According to essential attribute face for the essential attribute of unstructured data to be described;Data value characteristic attribute face is used for non-structural The attribute for changing the data value of data is described.
In one embodiment, sub- metadata of each attribute bread containing metadata, sub- metadata are tertiary structure, In: the first level structure includes at least one element set;Second level structure includes at least one element in element set, and corresponding Component identification;Third layer structure includes at least one the restriction element and corresponding restriction component identification of element.
In one embodiment, it is also performed the steps of when computer program is executed by processor and obtains same type domain Each attribute between the second incidence relation;Obtain the third incidence relation between the attribute in different type domain;According to second Incidence relation and third incidence relation obtain the first incidence relation.
In one embodiment, it also performs the steps of when computer program is executed by processor and is established according to type field Table data store lattice;According to the hierarchical relationship and the first incidence relation between type field and business domains, production Methods explanation File;According to table data store lattice and relationship supporting paper, data correlation model is constructed.
In one embodiment, it also performs the steps of when computer program is executed by processor according to unstructured number According to corresponding type field, the metadata of unstructured data is added in the corresponding table data store lattice of type field.
Computer readable storage medium provided in this embodiment, implementing principle and technical effect and above method embodiment Similar, details are not described herein.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, the computer program can be stored in a non-volatile computer In read/write memory medium, the computer program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein, To any reference of memory, storage, database or other media used in each embodiment provided herein, Including non-volatile and/or volatile memory.Nonvolatile memory may include read-only memory (ROM), programming ROM (PROM), electrically programmable ROM (EPROM), electrically erasable ROM (EEPROM) or flash memory.Volatile memory may include Random access memory (RAM) or external cache.By way of illustration and not limitation, RAM is available in many forms, Such as static state RAM (SRAM), dynamic ram (DRAM), synchronous dram (SDRAM), double data rate sdram (DDRSDRAM), enhancing Type SDRAM (ESDRAM), synchronization link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic ram (DRDRAM) and memory bus dynamic ram (RDRAM) etc..
Each technical characteristic of above embodiments can be combined arbitrarily, for simplicity of description, not to above-described embodiment In each technical characteristic it is all possible combination be all described, as long as however, the combination of these technical characteristics be not present lance Shield all should be considered as described in this specification.
The several embodiments of the application above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously It cannot therefore be construed as limiting the scope of the patent.It should be pointed out that coming for those of ordinary skill in the art It says, without departing from the concept of this application, various modifications and improvements can be made, these belong to the protection of the application Range.Therefore, the scope of protection shall be subject to the appended claims for the application patent.

Claims (10)

1. a kind of construction method of metadata schema, which is characterized in that the described method includes:
Obtain the first incidence relation between the type field and each type field of the unstructured data of each business domains of enterprise;
According to the business domains, the type field and first incidence relation, data correlation model is constructed;
According to preset tetrahedral model, the metadata of the unstructured data is obtained;The metadata is used for described non- The attribute of structural data is described;
It adds the metadata in the data correlation model, constructs metadata schema.
2. the method according to claim 1, wherein the tetrahedral model includes unstructured data, business Three dimensions of information system and type of service, and the four attribute faces formed according to three dimensions;
Four attribute faces are used to describe each attribute of unstructured data.
3. according to the method described in claim 2, it is characterized in that, four attribute faces include: data service system feature Attribute face, data traffic attributes face, data essential attribute face and data value characteristic attribute face;
Data service system characteristic attribute face is for retouching attribute of the unstructured data in operation system It states;The data traffic attributes face is for being described the unstructured data service attribute;The data essential attribute Face is for being described the essential attribute of the unstructured data;Data value characteristic attribute face is used for the non-knot The attribute of the data value of structure data is described.
4. according to the method described in claim 3, it is characterized in that, sub- metadata of each attribute bread containing the metadata, The sub- metadata is tertiary structure, in which: the first level structure includes at least one element set;Second level structure includes the member At least one element and corresponding component identification in element collection;Third layer structure includes at least one restriction of the element Element and corresponding restriction component identification.
5. method according to claim 1-4, which is characterized in that first obtained between each type field Incidence relation, comprising:
Obtain the second incidence relation between each attribute in same type domain;
Obtain the third incidence relation between the attribute in different type domain;
According to second incidence relation and the third incidence relation, first incidence relation is obtained.
6. method according to claim 1-4, which is characterized in that described according to the business domains, the type Domain and first incidence relation construct data correlation model, comprising:
Table data store lattice are established according to the type field;
According between the type field and the business domains hierarchical relationship and first incidence relation, production Methods say Prescribed paper;
According to the table data store lattice and the relationship supporting paper, the data correlation model is constructed.
7. according to the method described in claim 6, it is characterized in that, described add the metadata to the data correlation mould In type, metadata schema is constructed, comprising:
According to the corresponding type field of the unstructured data, the metadata of the unstructured data is added to the type In the corresponding table data store lattice in domain.
8. a kind of construction device of metadata schema, which is characterized in that described device includes:
First obtain module, for obtain the unstructured data of each business domains of enterprise type field and each type field it Between the first incidence relation;
First building module, for constructing data correlation according to the business domains, the type field and first incidence relation Model;
Second obtains module, for obtaining the metadata of the unstructured data according to preset tetrahedral model;The member Data are for being described the attribute of the unstructured data;
Second building module constructs metadata schema for adding the metadata in the data correlation model.
9. a kind of computer equipment, including memory and processor, the memory are stored with computer program, feature exists In the step of processor realizes any one of claims 1 to 7 the method when executing the computer program.
10. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the computer program The step of method described in any one of claims 1 to 7 is realized when being executed by processor.
CN201811599601.3A 2018-12-26 2018-12-26 Construction method, device, computer equipment and the storage medium of metadata schema Pending CN109684486A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811599601.3A CN109684486A (en) 2018-12-26 2018-12-26 Construction method, device, computer equipment and the storage medium of metadata schema

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811599601.3A CN109684486A (en) 2018-12-26 2018-12-26 Construction method, device, computer equipment and the storage medium of metadata schema

Publications (1)

Publication Number Publication Date
CN109684486A true CN109684486A (en) 2019-04-26

Family

ID=66189465

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811599601.3A Pending CN109684486A (en) 2018-12-26 2018-12-26 Construction method, device, computer equipment and the storage medium of metadata schema

Country Status (1)

Country Link
CN (1) CN109684486A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110222202A (en) * 2019-05-28 2019-09-10 北京信远通科技有限公司 Loose coupling metadata schema design method and system based on information technology standard
CN110377568A (en) * 2019-07-26 2019-10-25 北京明略软件系统有限公司 A kind of metadata acquisition method and device
CN111831720A (en) * 2020-07-15 2020-10-27 北京思特奇信息技术股份有限公司 Data display method and system and electronic equipment
CN112214490A (en) * 2020-10-14 2021-01-12 上海妙一生物科技有限公司 Business object storage method and device and computer readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102591896A (en) * 2011-01-05 2012-07-18 北京大用科技有限责任公司 System, implementation, application, and query language for a tetrahedral data model for unstructured data
CN104298705A (en) * 2014-08-20 2015-01-21 龙国良 Converting method of relational data and unstructured data
CN108763324A (en) * 2018-05-03 2018-11-06 苏州朗动网络科技有限公司 Recognition methods, device, storage medium and the computer equipment of business data

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102591896A (en) * 2011-01-05 2012-07-18 北京大用科技有限责任公司 System, implementation, application, and query language for a tetrahedral data model for unstructured data
CN104298705A (en) * 2014-08-20 2015-01-21 龙国良 Converting method of relational data and unstructured data
CN108763324A (en) * 2018-05-03 2018-11-06 苏州朗动网络科技有限公司 Recognition methods, device, storage medium and the computer equipment of business data

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
张新阳: ""企业非结构化数据元数据模型设计"", 《云南电力技术》 *
王志强 等: ""基于公共模型技术的非结构化元数据管理技术研究与应用"", 《工业仪表与自动化装置》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110222202A (en) * 2019-05-28 2019-09-10 北京信远通科技有限公司 Loose coupling metadata schema design method and system based on information technology standard
CN110222202B (en) * 2019-05-28 2022-03-01 北京信远通科技有限公司 Information technology standard-based loose coupling metadata model design method and system
CN110377568A (en) * 2019-07-26 2019-10-25 北京明略软件系统有限公司 A kind of metadata acquisition method and device
CN111831720A (en) * 2020-07-15 2020-10-27 北京思特奇信息技术股份有限公司 Data display method and system and electronic equipment
CN112214490A (en) * 2020-10-14 2021-01-12 上海妙一生物科技有限公司 Business object storage method and device and computer readable storage medium

Similar Documents

Publication Publication Date Title
US20210192650A1 (en) System and method for managing data state across linked electronic resources
CN109684486A (en) Construction method, device, computer equipment and the storage medium of metadata schema
CN113792159B (en) Knowledge graph data fusion method and system
US11973760B2 (en) Hierarchical permissions model within a document
US8340995B2 (en) Method and system of using artifacts to identify elements of a component business model
CN111061475B (en) Software code generating method, device, computer equipment and storage medium
US20120016805A1 (en) Generating Machine-Understandable Representations of Content
US11599719B2 (en) System and method for electronic document interaction with external resources
Petrasch et al. Data integration and interoperability: Towards a model-driven and pattern-oriented approach
da Silva et al. SPReaD: service-oriented process for reengineering and DevOps: Developing microservices for a Brazilian state department of taxation
CN116029273A (en) Text processing method, device, computer equipment and storage medium
Wang et al. Policy-Driven Process Mapping (PDPM): Discovering process models from business policies
Lüder et al. Description means for information artifacts throughout the life cycle of CPPS
Prado-Romero et al. Developing and evaluating graph counterfactual explanation with GRETEL
Oliveira et al. ETL standard processes modelling-a novel BPMN approach
El Beggar et al. CIM for data warehouse requirements using an UML profile
CN111444368A (en) Method and device for constructing user portrait, computer equipment and storage medium
US20230205498A1 (en) Restructuring enterprise application
US11688027B2 (en) Generating actionable information from documents
CN111241089B (en) ERP system secondary development method, system, device and readable storage medium
Masuda et al. Direction of digital it and enterprise architecture
Belo et al. Automatic generation of ETL physical systems from BPMN conceptual models
Gamito From Rigorous Requirements and User Interfaces Specifications into Software Business Applications: The ASL Approach
Herzog et al. Cooperating and Competing Digital Twins for Industrie 4.0 in Urban Planning Contexts
Volk et al. Towards an Automatized Way for Modeling Big Data System Architectures

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190426