CN113918774A - Data management method, device, equipment and storage medium - Google Patents
Data management method, device, equipment and storage medium Download PDFInfo
- Publication number
- CN113918774A CN113918774A CN202111266686.5A CN202111266686A CN113918774A CN 113918774 A CN113918774 A CN 113918774A CN 202111266686 A CN202111266686 A CN 202111266686A CN 113918774 A CN113918774 A CN 113918774A
- Authority
- CN
- China
- Prior art keywords
- data
- standard
- generating
- determining
- metadata
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 63
- 238000013523 data management Methods 0.000 title abstract description 12
- 230000000694 effects Effects 0.000 claims abstract description 74
- 238000005516 engineering process Methods 0.000 claims abstract description 19
- 238000004590 computer program Methods 0.000 claims description 11
- 230000008569 process Effects 0.000 abstract description 13
- 238000007726 management method Methods 0.000 abstract description 8
- 238000011161 development Methods 0.000 abstract description 7
- 238000004458 analytical method Methods 0.000 abstract description 4
- 238000004891 communication Methods 0.000 abstract description 4
- 230000004888 barrier function Effects 0.000 abstract description 3
- 238000012545 processing Methods 0.000 description 9
- 238000003058 natural language processing Methods 0.000 description 6
- 230000003287 optical effect Effects 0.000 description 6
- 238000013461 design Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000012550 audit Methods 0.000 description 1
- 238000013474 audit trail Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/901—Indexing; Data structures therefor; Storage structures
- G06F16/9024—Graphs; Linked lists
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The application relates to the field of data management, and provides a data management method, a device, equipment and a storage medium, wherein the method comprises the following steps: acquiring target data and determining metadata in the target data; determining an incidence relation between the target data, and generating a data relation map according to the incidence relation; generating standard data according to the data relation map, and determining a treatment effect under the current treatment rule according to the standard data; when the treatment effect does not meet the standard, adjusting the current treatment rule according to the treatment effect; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range. Therefore, the resource utilization condition of the data platform can be monitored through data relation map analysis, reasonable management is automatically made, the development process of the data platform is optimized, and the quality of data products is provided; by combining the graph database and the NLP technology, the barrier of IT and business can be opened, the communication is efficient, and the flow from the demand side to the development side is standardized.
Description
Technical Field
The present application relates to the field of data management, and in particular, to a data management method, apparatus, device, and storage medium.
Background
Data governance is a whole set of administrative activities in an organization that involve the use of data. Initiated and enforced by enterprise data governance departments is a series of policies and procedures on how to formulate and enforce business applications and technical management for data throughout the enterprise.
The key to the success of data governance is metadata management, i.e., a frame of reference that gives context and meaning to the data. The metadata, effectively administered, can provide a view of the data flow, the performance of impact analysis, accountability of the universal business vocabulary and its terms and definitions, and ultimately an audit trail for meeting compliance. Metadata management becomes an important function for IT (Internet Technology) departments to monitor changes in complex data integration environments while delivering trusted, secure data. Therefore, a good metadata management tool plays a central role in global data governance.
With the popularization of big data and AI (Artificial Intelligence) applications, the amount of structured and unstructured data owned by both emerging internet companies and enterprises transitioning from the traditional mode to the field of IT technology has increased dramatically, and the data types are complex and diverse. While maintaining high resource cost, enterprises do not know how the owned data has asset value, and cannot acquire enough information from non-standardized data to support business or make business decisions. Under the background, an effective data management method and an effective data management system are needed, massive disordered data are managed, and a data management platform and a service are enabled.
Disclosure of Invention
In view of the above problems, the present application is provided to provide an effective data management method, apparatus, device and storage medium for managing a large amount of disordered data, and managing a problem of enabling a service by using a data management platform, including:
a data governance method, comprising:
acquiring target data and determining metadata in the target data; wherein the metadata type comprises a technology type, an operation type and a service type;
determining an incidence relation between the target data, and generating a data relation map according to the incidence relation;
generating standard data according to the data relation map, and determining a treatment effect under the current treatment rule according to the standard data;
when the treatment effect does not meet the standard, adjusting the current treatment rule according to the treatment effect; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range.
Further, the step of determining the association relationship between the target data and generating a data relationship map according to the association relationship includes:
determining specification information corresponding to the target data; wherein the specification information types are in one-to-one correspondence with the metadata types;
generating the association relation according to the specification information of each type and the corresponding metadata;
and generating the data relation map according to the target data and the incidence relation.
Further, the step of generating the association relationship according to each type of the specification information and the corresponding metadata includes:
generating node information and attribute information according to the metadata of each type and the graph database;
generating a corresponding relation according to the specification information of each type;
and generating the association relation according to the node information, the attribute information and the corresponding relation.
Further, the step of generating standard data according to the data relationship map and determining the treatment effect under the current treatment rule according to the standard data includes:
acquiring a data object according to the data relation map;
generating standard data according to the data object and the service requirement;
and determining the treatment effect under the current treatment rule according to the standard data.
Further, the step of generating standard data according to the data object and the business requirement includes:
determining data information according to the data object; wherein the data information comprises data object tags, data object attributes and data object relationships;
and generating the standard data according to the data object label, the data object attribute, the data object relation and the service requirement.
Further, the step of generating the standard data according to the data object tag, the data object attribute, the data object relationship, and the service requirement includes:
generating standard coding information according to the data object label, the data object attribute, the data object relation and the service requirement;
acquiring a standard index, a standard dimension and a standard label according to the standard coding information;
and generating the standard data according to the standard indexes, the standard dimensions and the standard labels.
Further, said step of determining said abatement effect under said current abatement rule in dependence upon said criteria data comprises:
determining the current governing rule according to the standard data; wherein the current governance rules comprise resource utilization rate and compliance rate;
and generating the treatment effect according to the standard data, the resource utilization rate and the compliance rate.
An embodiment of the present invention further discloses a data management device, which includes:
the acquisition module is used for acquiring target data and determining metadata in the target data; wherein the metadata type comprises a technology type, an operation type and a service type;
the generation module is used for determining the incidence relation among the target data and generating a data relation map according to the incidence relation;
the determining module is used for generating standard data according to the data relation map and determining the treatment effect under the current treatment rule according to the standard data;
the adjusting module is used for adjusting the current treatment rule according to the treatment effect when the treatment effect does not meet the standard; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range.
An embodiment of the present invention further discloses a computer device, which includes a processor, a memory, and a computer program stored in the memory and capable of running on the processor, and when the computer program is executed by the processor, the steps of the data governance method are implemented.
An embodiment of the present invention further discloses a computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the steps of the data governance method are implemented.
The application has the following advantages:
in the embodiment of the application, target data is obtained, and metadata in the target data is determined; wherein the metadata type comprises a technology type, an operation type and a service type; determining an incidence relation between the target data, and generating a data relation map according to the incidence relation; generating standard data according to the data relation map, and determining a treatment effect under the current treatment rule according to the standard data; when the treatment effect does not meet the standard, adjusting the current treatment rule according to the treatment effect; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range. Therefore, the resource utilization condition of the data platform can be monitored through data relation map analysis, reasonable management is automatically made, the development process of the data platform is optimized, and the quality of data products is provided; by combining a graph database and an NLP (Natural Language Processing) technology, IT and business barriers can be opened, communication can be achieved efficiently, and the flow from a demand side to a development side can be standardized.
Drawings
In order to more clearly illustrate the technical solutions of the present application, the drawings needed to be used in the description of the present application will be briefly introduced below, and it is apparent that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without inventive labor.
FIG. 1 is a flow chart illustrating steps of a data governance method according to an embodiment of the present disclosure;
FIG. 2 is a flow chart illustrating steps of a data governance method according to an embodiment of the present application;
FIG. 3 is a flow chart illustrating steps of a data governance method according to an embodiment of the present application;
FIG. 4 is a flow chart illustrating steps of a data governance method according to an embodiment of the present application;
FIG. 5 is a flow chart illustrating steps of a data governance method according to an embodiment of the present application;
FIG. 6 is a flow chart illustrating steps of a data governance method according to an embodiment of the present application;
FIG. 7 is a flow chart illustrating steps of a data governance method according to an embodiment of the present application;
FIG. 8 is a block diagram of a data governance device according to an embodiment of the present disclosure;
fig. 9 is a schematic structural diagram of a computer device according to an embodiment of the present invention.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present application more comprehensible, the present application is described in further detail with reference to the accompanying drawings and the detailed description. It is to be understood that the embodiments described are only a few embodiments of the present application and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
Referring to fig. 1, a flow chart of steps of a data governance method according to an embodiment of the present application is shown;
a method of data governance, the method comprising:
s110, acquiring target data and determining metadata in the target data; wherein the metadata type comprises a technology type, an operation type and a service type;
s120, determining the incidence relation among the target data, and generating a data relation map according to the incidence relation;
s130, generating standard data according to the data relation map, and determining a treatment effect under the current treatment rule according to the standard data;
s140, when the treatment effect does not meet the standard, adjusting the current treatment rule according to the treatment effect; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range.
In the embodiment of the application, target data is obtained, and metadata in the target data is determined; wherein the metadata type comprises a technology type, an operation type and a service type; determining an incidence relation between the target data, and generating a data relation map according to the incidence relation; generating standard data according to the data relation map, and determining a treatment effect under the current treatment rule according to the standard data; when the treatment effect does not meet the standard, adjusting the current treatment rule according to the treatment effect; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range. Therefore, the resource utilization condition of the data platform can be monitored through data relation map analysis, reasonable management is automatically made, the development process of the data platform is optimized, and the quality of data products is provided; by combining the graph database and the NLP technology, the barrier of IT and business can be opened, the communication is efficient, and the flow from the demand side to the development side is standardized.
Next, a data governance method in the present exemplary embodiment will be further described.
Acquiring target data and determining metadata in the target data as described in the step S110; wherein the metadata types include technology type, operation type, and business type.
It should be noted that, by acquiring target data, metadata in the target data is determined, where the metadata type at least includes a technology type, i.e., technology metadata, an operation type, i.e., operation metadata, and a service type, i.e., service metadata; that is, metadata can be divided into technical metadata, operational metadata, and business metadata.
Determining the association relationship between the target data, and generating a data relationship map according to the association relationship, as described in step S120.
It should be noted that the data relationship map is generated through the management relationship among the target data, that is, the data relationship map is generated through the association relationship among the technical metadata, the operation metadata and the business metadata and the three themselves.
In an embodiment of the present invention, the specific process of "determining the association relationship between the target data and generating the data relationship map according to the association relationship" in step S120 may be further described with reference to the following description.
Referring to fig. 2, a flow chart of steps of a data governance method according to an embodiment of the present application is shown;
as will be described in the following steps,
s210, determining the corresponding specification information of the target data; wherein the specification information types are in one-to-one correspondence with the metadata types;
s220, generating the incidence relation according to the specification information of each type and the corresponding metadata;
and S230, generating the data relation map according to the target data and the incidence relation.
It should be noted that specification information corresponding to the target data is determined, where the specification information type corresponds to the metadata type one to one, that is, the technical metadata, the operation metadata, and the service metadata all have specification information corresponding to them one to one, where the specification information is divided into the technical metadata specification information, the operation metadata specification information, and the service metadata specification information according to the metadata type;
in a specific implementation, specification information is determined according to technical metadata, operation metadata and business metadata, wherein the specification information comprises technical metadata specification information, operation metadata specification information and business metadata specification information; specifically, the service metadata specification information, i.e., the service metadata design principle, follows the theme grouping, the theme domain, the service object, the logic entity, and the attribute, and specifies the data standard; the operation metadata specification information, namely the operation metadata is acquired by an audit log or a system buried point and needs to contain the information of system events, user behaviors and the like which are specified; the technical metadata specification information, namely the technical metadata, is composed of a database, a physical table, a view and the like;
it should be noted that the association relationship is generated according to each type of the specification information and the corresponding metadata, that is, the association relationship among the technical metadata, the operation metadata and the service metadata is determined according to the technical metadata specification information, the operation metadata specification information and the service metadata specification information;
in a specific implementation, the association relationship, i.e. the design specification, needs to satisfy the above metadata to directly achieve accurate and unique association, such as: the logical entity of the service metadata corresponds to the physical table of the technical metadata, the attribute needs to be covered in the metadata design specification with the operation metadata corresponding to the field and the related design of the technical metadata, such as the corresponding relation of each database table and the field, the corresponding relation of the field, the dimension and the index of an Online Analytical Processing (OLAP), and the corresponding relation of an Extract-Transform-Load (ETL) task and the table;
it should be noted that the data relationship map is generated according to the target data and the association relationship, that is, the data relationship map is generated according to the technical metadata, the operation metadata, the service metadata and the association relationship;
in a specific implementation, the technical metadata, the operation metadata and the service metadata are accessed and communicated, the graph database is utilized to convert each metadata into nodes (points) and attributes in the graph database, and then a relationship (edge) is constructed according to the association relationship to form a data relationship graph.
In an embodiment of the present invention, a specific process of "generating the association relationship according to each type of the specification information and the corresponding metadata" in step S220 may be further described with reference to the following description.
Referring to fig. 3, a flow chart of steps of a data governance method according to an embodiment of the present application is shown;
as will be described in the following steps,
s310, generating node information and attribute information according to the metadata of each type and the graph database;
s320, generating a corresponding relation according to the specification information of each type;
s330, generating the association relationship according to the node information, the attribute information and the corresponding relationship.
It should be noted that node information and attribute information are generated according to the metadata and the graph database of each type, that is, the node information and the attribute information are generated through technical metadata, operation metadata, service metadata and the graph database;
it should be noted that, a corresponding relationship is generated according to each type of the specification information, that is, a corresponding relationship is generated among the technical metadata specification information, the operation metadata specification information, and the service metadata specification information;
it should be noted that the association relationship is generated according to the node information, the attribute information, and the correspondence, that is, the association relationship is generated according to the node information, the attribute information, and the correspondence;
in a specific implementation, the technical metadata, the operation metadata and the service metadata are accessed and communicated, the technical metadata, the operation metadata and the service metadata are converted into nodes (points) and attributes in a graph database by utilizing the graph database, and then a relationship (edge) is constructed according to an incidence relationship to form a data relationship graph.
And determining the treatment effect under the current treatment rule according to the data relation map as described in the step S130.
It should be noted that standard data is generated according to the data relationship map, and a treatment effect is generated according to the standard data and the current treatment rule.
In an embodiment of the present invention, the specific process of "generating standard data according to the data relationship map and determining the abatement effect under the current abatement rule" in step S130 may be further described with reference to the following description.
Referring to fig. 4, a flow chart illustrating steps of a data governance method according to an embodiment of the present application is shown;
as will be described in the following steps,
s410, acquiring a data object according to the data relation map;
s420, generating standard data according to the data object and the service requirement;
and S430, determining the treatment effect under the current treatment rule according to the standard data.
It should be noted that, data objects are obtained according to the data relationship map, that is, a plurality of data objects are obtained according to the data relationship map;
it should be noted that, standard data is generated according to the data objects and the service requirements, that is, standard data is generated by acquiring a plurality of data objects in a data relation map and generating standard data according to the service requirements and the data objects; the standard data is convenient for IT and business users to query the needed data target by using respective visual angles;
it should be noted that the abatement effect under the current abatement rule is determined according to the standard data, that is, the abatement effect that can be achieved by the current abatement rule is determined according to the standard data.
In an embodiment of the present invention, the specific process of "generating standard data according to the data object and the business requirement" in step S420 can be further described with reference to the following description.
Referring to fig. 5, a flowchart illustrating steps of a data governance method according to an embodiment of the present application is shown;
as will be described in the following steps,
s510, determining data information according to the data object; wherein the data information comprises data object tags, data object attributes and data object relationships;
s520, generating the standard data according to the data object label, the data object attribute, the data object relation and the service requirement.
It should be noted that, the data information is determined according to the data object; the data information at least comprises a data object label, a data object attribute and a data object relation; and generating standard data according to the data object label, the data object attribute, the data object relation and the service requirement.
In an embodiment of the present invention, the specific process of "generating the standard data according to the data object tag, the data object attribute, the data object relationship, and the business requirement" in step S520 may be further described with reference to the following description.
Referring to fig. 6, a flow chart illustrating steps of a data governance method according to an embodiment of the present application is shown;
as will be described in the following steps,
s610, generating standard coding information according to the data object label, the data object attribute, the data object relation and the service requirement;
s620, acquiring a standard index, a standard dimension and a standard label according to the standard coding information;
s630, generating the standard data according to the standard indexes, the standard dimensions and the standard labels.
It should be noted that, standard coding information is generated according to the data object tag, the data object attribute, the data object relationship and the service requirement, that is, standard coding information is generated by performing natural language processing on the data object tag, the data object attribute, the data object relationship and the service requirement;
it should be noted that the standard index, the standard dimension and the standard label are obtained according to the standard encoding information, that is, the standard index, the standard dimension and the standard label are obtained according to the standard encoding information;
it should be noted that the standard data is generated according to the standard index, the standard dimension, and the standard label, that is, the standard data is generated through the standard index, the standard dimension, and the standard label.
In a specific implementation, a standard index, a standard dimension and a standard label are obtained according to the standard coding information; generating the standard data according to the standard indexes, the standard dimensions and the standard labels; the method comprises the steps of converting service descriptions (namely service requirements) such as the number of signing clients, real-time insurance fee, good owner response and the like into standard indexes (namely standard codes) such as operation indexes, standard dimensions such as composite dimensions and standard labels such as group labels of standard coding information (namely standard codes) by an NLP (natural language processing) technology, and converting the standard indexes into standard data such as tables and fields of logic storage, resource operation generated in a data processing process, and data products such as API (Application Programming Interface), reports and the like.
In an embodiment of the present invention, the specific process of "determining the abatement effect under the current abatement rule according to the standard data" in step S430 may be further described with reference to the following description.
Referring to fig. 7, a flowchart illustrating steps of a data governance method according to an embodiment of the present application is shown;
as will be described in the following steps,
s710, determining the current governing rule according to the standard data; wherein the current governance rules comprise resource utilization rate and compliance rate;
s720, generating the treatment effect according to the standard data, the resource utilization rate and the compliance rate.
It should be noted that, the current governing rule is confirmed according to the standard data; wherein, the current governing rule comprises a resource utilization rate and a compliance rate; generating a treatment effect according to the standard data, the resource utilization rate and the compliance rate; the method comprises the steps of getting through a data processing link, and designating current treatment rules for standard data, namely procedural data such as files, a data lake table, a yarn (Another Resource coordinator) task, an ETL task and data products such as a market table, a label table, a dimension field, an index field, an API (application programming interface) and a report, so that the development efficiency and quality are improved, and data resources (namely compliance rate) touching the red line are automatically monitored; tracking the change of metadata (namely resource utilization rate) to measure the effect of the governing rule, and also reversely measuring the reasonability of data specification, updating the specification or accessing other effective metadata; the platform is administered in iterations.
If the abatement effect does not meet the standard in step S140, adjusting the current abatement rule according to the abatement effect; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range.
It should be noted that, the treatment effect not meeting the standard means that the treatment value is not within the preset range, that is, when the treatment effect is determined not to reach the expected target or effect, the treatment effect is in the expected target or effect by adjusting the current treatment rule; when the treatment value is within the preset range, the current treatment rule does not need to be adjusted; whether to adjust the current treatment rule can be judged through the treatment effect.
For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.
Referring to fig. 8, a block diagram of a data governance device according to an embodiment of the present disclosure is shown;
a data governance device, the device specifically includes:
an obtaining module 810, configured to obtain target data and determine metadata in the target data; wherein the metadata type comprises a technology type, an operation type and a service type;
a generating module 820, configured to determine an association relationship between the target data, and generate a data relationship map according to the association relationship;
a determining module 830, configured to generate standard data according to the data relationship map, and determine a treatment effect under a current treatment rule according to the standard data;
an adjusting module 840, configured to adjust a current governing rule according to the governing effect when the governing effect does not meet a standard; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range.
In an embodiment of the present invention, the generating module 820 includes:
the first determining submodule is used for determining the corresponding specification information of the target data; wherein the specification information types are in one-to-one correspondence with the metadata types;
the first generation submodule is used for generating the incidence relation according to the specification information of each type and the corresponding metadata;
and the second generation submodule is used for generating the data relation map according to the target data and the incidence relation.
In an embodiment of the present invention, the first generation submodule includes:
the first generation unit is used for generating node information and attribute information according to the metadata of each type and the graph database;
the second generating unit is used for generating a corresponding relation according to the specification information of each type;
and the third generating unit is used for generating the association relation according to the node information, the attribute information and the corresponding relation.
In an embodiment of the present invention, the determining module 830 includes:
the first obtaining submodule is used for obtaining a data object according to the data relation map;
the third generation submodule is used for generating standard data according to the data object and the service requirement;
and the second determining submodule is used for determining the treatment effect under the current treatment rule according to the standard data.
In an embodiment of the present invention, the second generating sub-module includes:
a first determining unit for determining data information according to the data object; wherein the data information comprises data object tags, data object attributes and data object relationships;
and the fourth generating unit is used for generating the standard data according to the data object label, the data object attribute, the data object relation and the service requirement.
In an embodiment of the present invention, the fourth generating unit includes:
the first generating subunit is used for generating standard coding information according to the data object label, the data object attribute, the data object relationship and the service requirement;
the first acquisition subunit is used for acquiring a standard index, a standard dimension and a standard label according to the standard coding information;
and the second generation subunit is used for generating the standard data according to the standard indexes, the standard dimensions and the standard labels.
In an embodiment of the present invention, the second determining sub-module includes:
the second determining unit is used for determining the current governing rule according to the standard data; wherein the current governance rules comprise resource utilization rate and compliance rate;
and the fifth generating unit is used for generating the treatment effect according to the standard data, the resource utilization rate and the compliance rate.
Referring to fig. 9, a computer device of a data governance method according to the present invention is shown, which may specifically include the following:
the computer device 12 described above is embodied in the form of a general purpose computing device, and the components of the computer device 12 may include, but are not limited to: one or more processors or processing units 16, a system memory 28, and a bus 18 that couples various system components including the system memory 28 and the processing unit 16.
The system memory 28 may include computer system readable media in the form of volatile memory, such as Random Access Memory (RAM)30 and/or cache memory 32. Computer device 12 may further include other removable/non-removable, volatile/nonvolatile computer system storage media. By way of example only, storage system 34 may be used to read from and write to non-removable, nonvolatile magnetic media (commonly referred to as "hard drives"). Although not shown in FIG. 9, a magnetic disk drive for reading from and writing to a removable, nonvolatile magnetic disk (e.g., a "floppy disk") and an optical disk drive for reading from or writing to a removable, nonvolatile optical disk (e.g., a CD-ROM, DVD-ROM, or other optical media) may be provided. In these cases, each drive may be connected to bus 18 by one or more data media interfaces. The memory may include at least one program product having a set (e.g., at least one) of program modules 42, with the program modules 42 configured to carry out the functions of embodiments of the invention.
A program/utility 40 having a set (at least one) of program modules 42 may be stored, for example, in memory, such program modules 42 including, but not limited to, an operating system, one or more application programs, other program modules 42, and program data, each of which examples or some combination thereof may comprise an implementation of a network environment. Program modules 42 generally carry out the functions and/or methodologies of the described embodiments of the invention.
The processing unit 16 executes various functional applications and data processing by running programs stored in the system memory 28, for example, to implement a data governance method provided by the embodiments of the present invention.
That is, the processing unit 16 implements, when executing the program,: acquiring target data and determining metadata in the target data; wherein the metadata type comprises a technology type, an operation type and a service type; determining an incidence relation between the target data, and generating a data relation map according to the incidence relation; generating standard data according to the data relation map, and determining a treatment effect under the current treatment rule according to the standard data; when the treatment effect does not meet the standard, adjusting the current treatment rule according to the treatment effect; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range.
In an embodiment of the present invention, the present invention further provides a computer-readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the computer program implements a data governance method as provided in all embodiments of the present application:
that is, the program when executed by the processor implements: acquiring target data and determining metadata in the target data; wherein the metadata type comprises a technology type, an operation type and a service type; determining an incidence relation between the target data, and generating a data relation map according to the incidence relation; generating standard data according to the data relation map, and determining a treatment effect under the current treatment rule according to the standard data; when the treatment effect does not meet the standard, adjusting the current treatment rule according to the treatment effect; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range.
Any combination of one or more computer-readable media may be employed. The computer readable medium may be a computer readable signal medium or a computer readable storage medium. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
A computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Computer program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the operator's computer, partly on the operator's computer, as a stand-alone software package, partly on the operator's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the operator's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider). The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
While preferred embodiments of the present application have been described, additional variations and modifications of these embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including the preferred embodiment and all such alterations and modifications as fall within the true scope of the embodiments of the application.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or terminal that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or terminal. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or terminal that comprises the element.
The data governance method, device, equipment and storage medium provided by the present application are introduced in detail, and a specific example is applied in the present application to explain the principle and the implementation of the present application, and the description of the above embodiment is only used to help understand the method and the core idea of the present application; meanwhile, for a person skilled in the art, according to the idea of the present application, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present application.
Claims (10)
1. A data governance method, comprising:
acquiring target data and determining metadata in the target data; wherein the metadata type comprises a technology type, an operation type and a service type;
determining an incidence relation between the target data, and generating a data relation map according to the incidence relation;
generating standard data according to the data relation map, and determining a treatment effect under the current treatment rule according to the standard data;
when the treatment effect does not meet the standard, adjusting the current treatment rule according to the treatment effect; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range.
2. The method according to claim 1, wherein the step of determining the association relationship between the target data and generating a data relationship map according to the association relationship comprises:
determining specification information corresponding to the target data; wherein the specification information types are in one-to-one correspondence with the metadata types;
generating the association relation according to the specification information of each type and the corresponding metadata;
and generating the data relation map according to the target data and the incidence relation.
3. The method according to claim 2, wherein the step of generating the association relationship according to each type of the specification information and the corresponding metadata comprises:
generating node information and attribute information according to the metadata of each type and the graph database;
generating a corresponding relation according to the specification information of each type;
and generating the association relation according to the node information, the attribute information and the corresponding relation.
4. The method of claim 1, wherein said step of generating standard data from said data relationship map and determining abatement effect under current abatement rules from said standard data comprises:
acquiring a data object according to the data relation map;
generating standard data according to the data object and the service requirement;
and determining the treatment effect under the current treatment rule according to the standard data.
5. The method of claim 4, wherein the step of generating standard data based on the data objects and business requirements comprises:
determining data information according to the data object; wherein the data information comprises data object tags, data object attributes and data object relationships;
and generating the standard data according to the data object label, the data object attribute, the data object relation and the service requirement.
6. The method of claim 5, wherein the step of generating the standard data from the data object tags, the data object attributes, the data object relationships, and the business requirements comprises:
generating standard coding information according to the data object label, the data object attribute, the data object relation and the service requirement;
acquiring a standard index, a standard dimension and a standard label according to the standard coding information;
and generating the standard data according to the standard indexes, the standard dimensions and the standard labels.
7. The method of claim 4, wherein said step of determining said abatement effect under said current abatement rule in dependence upon said criteria data comprises:
determining the current governing rule according to the standard data; wherein the current governance rules comprise resource utilization rate and compliance rate;
and generating the treatment effect according to the standard data, the resource utilization rate and the compliance rate.
8. A data governance device, comprising:
the acquisition module is used for acquiring target data and determining metadata in the target data; wherein the metadata type comprises a technology type, an operation type and a service type;
the generation module is used for determining the incidence relation among the target data and generating a data relation map according to the incidence relation;
the determining module is used for generating standard data according to the data relation map and determining the treatment effect under the current treatment rule according to the standard data;
the adjusting module is used for adjusting the current treatment rule according to the treatment effect when the treatment effect does not meet the standard; wherein, the treatment effect does not reach the standard, namely the treatment value is not in the preset range.
9. A computer device comprising a processor, a memory, and a computer program stored on the memory and capable of running on the processor, the computer program, when executed by the processor, implementing the method of any one of claims 1 to 7.
10. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1 to 7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111266686.5A CN113918774A (en) | 2021-10-28 | 2021-10-28 | Data management method, device, equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111266686.5A CN113918774A (en) | 2021-10-28 | 2021-10-28 | Data management method, device, equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113918774A true CN113918774A (en) | 2022-01-11 |
Family
ID=79243397
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111266686.5A Pending CN113918774A (en) | 2021-10-28 | 2021-10-28 | Data management method, device, equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113918774A (en) |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109213747A (en) * | 2018-08-08 | 2019-01-15 | 麒麟合盛网络技术股份有限公司 | A kind of data managing method and device |
CN109299083A (en) * | 2018-10-16 | 2019-02-01 | 全球能源互联网研究院有限公司 | A kind of data governing system |
CN109344133A (en) * | 2018-08-27 | 2019-02-15 | 成都四方伟业软件股份有限公司 | A kind of data administer driving data and share exchange system and its working method |
CN110019150A (en) * | 2019-04-11 | 2019-07-16 | 软通动力信息技术有限公司 | A kind of data administering method, system and electronic equipment |
CN110119395A (en) * | 2019-05-27 | 2019-08-13 | 普元信息技术股份有限公司 | The method that data standard and quality of data association process are realized based on metadata in big data improvement |
CN111159191A (en) * | 2019-12-30 | 2020-05-15 | 深圳博沃智慧科技有限公司 | Data processing method, device and interface |
CN111597177A (en) * | 2020-05-14 | 2020-08-28 | 重庆农村商业银行股份有限公司 | Data governance method for improving data quality |
CN112231315A (en) * | 2020-12-16 | 2021-01-15 | 武汉凡松科技有限公司 | Data management method based on big data |
CN112286916A (en) * | 2020-10-22 | 2021-01-29 | 北京锐安科技有限公司 | Data processing method, device, equipment and storage medium |
CN113392227A (en) * | 2021-05-31 | 2021-09-14 | 交控科技股份有限公司 | Metadata knowledge map engine system facing rail transit field |
-
2021
- 2021-10-28 CN CN202111266686.5A patent/CN113918774A/en active Pending
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109213747A (en) * | 2018-08-08 | 2019-01-15 | 麒麟合盛网络技术股份有限公司 | A kind of data managing method and device |
CN109344133A (en) * | 2018-08-27 | 2019-02-15 | 成都四方伟业软件股份有限公司 | A kind of data administer driving data and share exchange system and its working method |
CN109299083A (en) * | 2018-10-16 | 2019-02-01 | 全球能源互联网研究院有限公司 | A kind of data governing system |
CN110019150A (en) * | 2019-04-11 | 2019-07-16 | 软通动力信息技术有限公司 | A kind of data administering method, system and electronic equipment |
CN110119395A (en) * | 2019-05-27 | 2019-08-13 | 普元信息技术股份有限公司 | The method that data standard and quality of data association process are realized based on metadata in big data improvement |
CN111159191A (en) * | 2019-12-30 | 2020-05-15 | 深圳博沃智慧科技有限公司 | Data processing method, device and interface |
CN111597177A (en) * | 2020-05-14 | 2020-08-28 | 重庆农村商业银行股份有限公司 | Data governance method for improving data quality |
CN112286916A (en) * | 2020-10-22 | 2021-01-29 | 北京锐安科技有限公司 | Data processing method, device, equipment and storage medium |
CN112231315A (en) * | 2020-12-16 | 2021-01-15 | 武汉凡松科技有限公司 | Data management method based on big data |
CN113392227A (en) * | 2021-05-31 | 2021-09-14 | 交控科技股份有限公司 | Metadata knowledge map engine system facing rail transit field |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Stvilia et al. | A framework for information quality assessment | |
US8527451B2 (en) | Business semantic network build | |
CA2736622C (en) | A computer implemented system and method for providing a community and collaboration platform around knowledge transfer, expertise, innovation, tangible assets, intangible assets and information assets | |
Schintler et al. | Encyclopedia of big data | |
US9015118B2 (en) | Determining and presenting provenance and lineage for content in a content management system | |
US20140012988A1 (en) | Provisioning computer resources on a network | |
US20140067942A1 (en) | Determining recommended recipients of a communication | |
US20110131247A1 (en) | Semantic Management Of Enterprise Resourses | |
CN111125266A (en) | Data processing method, device, equipment and storage medium | |
CN115619364A (en) | Recruitment information publishing method, device and system based on artificial intelligence | |
CN115617776A (en) | Data management system and method | |
CN110807016A (en) | Data warehouse construction method and device applied to financial business and electronic equipment | |
Alghushairy et al. | Data storage | |
CN113297139B (en) | Metadata query method and system and electronic equipment | |
CN106156904B (en) | Cross-platform virtual asset tracing method based on eID | |
Li et al. | Optimal Design of an Information Management System for Government: A Bridge between Government and Citizens | |
CN111753020A (en) | Method and device for establishing relational network model | |
CN113918774A (en) | Data management method, device, equipment and storage medium | |
CN114925050A (en) | Data verification method and device based on knowledge base, electronic equipment and storage medium | |
CN112346938B (en) | Operation auditing method and device, server and computer readable storage medium | |
US10409871B2 (en) | Apparatus and method for searching information | |
Yaremchuck et al. | Search of similar programs using code metrics and big data-based assessment of software reliability | |
US20140143278A1 (en) | Application programming interface layers for analytical applications | |
Zhang et al. | [Retracted] Exploring Artificial Intelligence Architecture in Data Cleaning Based on Bayesian Networks | |
CN109325725A (en) | A kind of information layered acquisition management system of corporate client |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |