CN109241179A - Data administering method, system and computer equipment based on data space - Google Patents

Data administering method, system and computer equipment based on data space Download PDF

Info

Publication number
CN109241179A
CN109241179A CN201810864946.0A CN201810864946A CN109241179A CN 109241179 A CN109241179 A CN 109241179A CN 201810864946 A CN201810864946 A CN 201810864946A CN 109241179 A CN109241179 A CN 109241179A
Authority
CN
China
Prior art keywords
data
space
mentioned
source
layers
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810864946.0A
Other languages
Chinese (zh)
Inventor
胡博
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Collaborative Data Technology (shenzhen) Co Ltd
Original Assignee
Collaborative Data Technology (shenzhen) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Collaborative Data Technology (shenzhen) Co Ltd filed Critical Collaborative Data Technology (shenzhen) Co Ltd
Priority to CN201810864946.0A priority Critical patent/CN109241179A/en
Publication of CN109241179A publication Critical patent/CN109241179A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Present invention discloses a kind of data administering method, system and computer equipment based on data space, comprising steps of building data space source data layer;Data space BI layers is constructed according to the data of data space source data layer and object module;Data space AI layers is constructed according to the data of data space BI layers of building of data, object module and data spatial source data Layer;Corresponding data service is provided according to AI layers of data space of data for third-party application.Data administering method based on data space of the invention, system and computer equipment have the beneficial effect that through building data space source data layer, BI layers of data space and AI layers of data space, form the space with abstraction hierarchy, make to be extended from depth, range and time dimension when more interoperability of system or bigger business occur and meet the needs of data service.

Description

Data administering method, system and computer equipment based on data space
Technical field
The present invention relates to big data field, especially relates to a kind of data administering method based on data space, is System and computer equipment.
Background technique
Interoperability refers to that different computer systems, network, operating system and application program work together and share letter The ability of breath is to determine that information system moves towards the key foundation of intelligent use from data management.However for a long time, mostly System is established on the basis of certain business demands, not compatible with other systems from bigger global consideration, is caused Data standard according to different, business bore disunity, data consistency and availability it is particularly thorny.
The target that data are administered is so that data can be exchanged and be executed with cross-system, and existing data Treatment process method is big It is consider that the mode that " downward to pushing up " is taken by method or metadata management system based on data standard is realized more, that is, it needs The multi-side system for participating in interoperability first reaches an agreement to metadata, final to realize the not homologous ray understanding common to data. Traditional data administering method is also being continuously increased with the increase for participating in inter-operation system, data, it is difficult in different systems Between reach the consistent of data standard and source data.
Summary of the invention
The main object of the present invention is to provide a kind of data administering method, system and computer based on data space to set It is standby, the technical issues of to solve mentioned in above-mentioned background technique.
The present invention proposes a kind of data administering method based on data space, comprising steps of
Construct data space source data layer;
Data space BI layers is constructed according to the data of above-mentioned data space source data layer and object module;
According to BI layers of above-mentioned building data space of data, the number of above-mentioned object module and above-mentioned data space source data layer According to data space AI layers of building;
Corresponding data service is provided according to above-mentioned data space AI layers of data for third-party application.
Further, above-mentioned according to above-mentioned data space source number in the data administering method above-mentioned based on data space According to layer data and object module construct data space BI layers the step of comprising steps of
Corresponding above-mentioned object module is obtained according to the type of service of above-mentioned third-party application and data target type;
According to the data of above-mentioned data space source data layer and above-mentioned data space BI layers of the building of above-mentioned object module.
Further, above-mentioned according to above-mentioned building data space in the data administering method above-mentioned based on data space The step of data of BI layers of data, above-mentioned object module and above-mentioned data space source data layer construct data space AI layers include Step:
It makes inferences to obtain unknown data according to BI layers of data of above-mentioned building data space;
Above-mentioned data are constructed according to above-mentioned unknown data, the data of above-mentioned data space source data layer and above-mentioned object module Space AI layers.
Further, in the data administering method above-mentioned based on data space, above-mentioned building data space source data layer The step of before, further comprise the steps of:
The native data of different business systems is obtained, data lake is formed;
According to the type of service of above-mentioned third-party application and data target type to the native data in above-mentioned data lake into The identification of row data and cleaning, obtain source data.
Further, in the data administering method above-mentioned based on data space, above-mentioned building data space source data layer The step of comprising steps of
Body data warehouse is formed according to above-mentioned source data, and constructs above-mentioned data space source data layer.
Further, in the data administering method above-mentioned based on data space, in the above-mentioned step for obtaining above-mentioned source data After rapid, further comprise the steps of:
Above-mentioned source data is tracked, and grade is carried out to above-mentioned source data according to the source-information of above-mentioned source data and is commented Estimate.
The present invention also proposes a kind of data governing system based on data space, comprising:
Source data layer building module, for constructing data space source data layer;
BI layer building module, for constructing data space according to the data and object module of above-mentioned data space source data layer BI layers;
AI layer building module, for according to BI layers of above-mentioned building data space of data, above-mentioned object module and above-mentioned number Data space AI layers is constructed according to the data of spatial source data Layer;
API module, for providing corresponding data service according to above-mentioned data space AI layers of data for third-party application.
Further, the above-mentioned data governing system based on data space, further includes:
Modeling tool and model library module, for being obtained according to the type of service and data target type of above-mentioned third-party application Take corresponding above-mentioned object module;
Inquiry and inference engine module, it is unknown for making inferences to obtain according to BI layers of above-mentioned building data space of data Data.
Further, the above-mentioned data governing system based on data space, further includes:
Data lake module forms data lake for obtaining the native data of different business systems;
Identification and cleaning module, for the type of service and data target type according to above-mentioned third-party application to above-mentioned number Data identification and cleaning are carried out according to the native data in lake, obtains source data;
Data blood relationship tracing module, is tracked above-mentioned source data, and according to the source-information of above-mentioned source data to upper It states source data and carries out grade assessment.
The present invention also proposes a kind of computer equipment, including memory, processor and storage are on a memory and can be The computer program run on processor, above-mentioned processor are realized as described in any one of embodiment when executing above procedure Method.
Data administering method based on data space, system and having the beneficial effect that for computer equipment of the invention passes through Building data space source data layer, BI layers of data space and AI layers of data space form the space with abstraction hierarchy, make When more interoperability of system or bigger business occur, it can be extended and meet from depth, range and time dimension The demand of data service;And data space AI is improved on the basis of data space source data layer and I layers of data space B of data The data of layer, find and infer deeper data, so that the known knowledge contained from data infers unknown knowledge.
Detailed description of the invention
Fig. 1 is the flow diagram of the data administering method based on data space of one embodiment of the invention;
Fig. 2 is the flow diagram of the data administering method based on data space of one embodiment of the invention;
Fig. 3 is the flow diagram of the data administering method based on data space of one embodiment of the invention;
Fig. 4 is the flow diagram of the data administering method based on data space of one embodiment of the invention;
Fig. 5 is the flow diagram of the data administering method based on data space of one embodiment of the invention;
Fig. 6 is the flow diagram of the data administering method based on data space of one embodiment of the invention;
Fig. 7 is the structural schematic diagram of the data governing system based on data space of one embodiment of the invention;
Fig. 8 is a kind of structural schematic diagram of computer equipment of one embodiment of the invention.
1, source data layer building module;2, BI layer building module;3, AI layer building module;4, API module;5, modeling tool And model library module;6, inquiry and inference engine module;7, data lake module;8, identification and cleaning module;9, data blood relationship chases after Track module;10, computer equipment;11, external equipment;12, processing unit;13, bus;14, network adapter;15, (I/O) connects Mouthful;16, display;17, system storage;18, random access memory (RAM);19, cache memory;20, storage system System;21, program/utility;22, program module.
The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiment is only a part of the embodiments of the present invention, instead of all the embodiments.Base Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts it is all its His embodiment, shall fall within the protection scope of the present invention.
It in addition, the technical solution between each embodiment can be combined with each other, but must be with ordinary skill Based on personnel can be realized, this technical side will be understood that when the combination of technical solution appearance is conflicting or cannot achieve The combination of case is not present, also not the present invention claims protection scope within.
Referring to Fig.1, in embodiments of the present invention, the present invention proposes a kind of data administering method based on data space, packet Include step:
S1, building data space source data layer;
S2, data space BI layers is constructed according to the data and object module of above-mentioned data space source data layer;
S3, according to BI layers of above-mentioned building data space of data, above-mentioned object module and above-mentioned data space source data layer Data construct data space AI layers;
S4, corresponding data service is provided for third-party application according to above-mentioned data space AI layers of data.
Such as above-mentioned steps S1, data space source data layer is constructed, different numbers is extracted from different types of operation system According to, and integration processing is carried out to data, to construct above-mentioned data space source data layer, wherein above-mentioned data space source number It is based on the databases such as Apache Hadoop, MySQL, MangoDB according to layer.
Such as above-mentioned steps S2, data space BI is constructed according to the data of above-mentioned data space source data layer and object module Layer, when constructing above-mentioned data space BI layers using different types of data, above-mentioned object module is also different, can to construct Above-mentioned data space BI layers of the data of the above-mentioned data space source data layer of alignment processing, wherein the algorithm of above-mentioned object module It generally comprises: linear regression, cluster, classification, time series, decision tree, nerve center method, improper detection, relevance, factor Analysis, text mining, support vector machines, sequential mining, naive Bayesian, random forest or TensorFlow.
It is empty according to BI layers of above-mentioned building data space of data, above-mentioned object module and above-mentioned data such as above-mentioned steps S3 Between source data layer data construct data space AI layers, to be improved on the basis of above-mentioned data space BI layers of data above-mentioned The data that AI layers of data space find and infer deeper data, thus the known knowledge reasoning contained from data Unknown knowledge out, above-mentioned data space source data layer, above-mentioned data space BI layers and above-mentioned data space AI layers of combination formation one A space with abstraction hierarchy makes when more interoperability of system or bigger business occur, can from depth, range and Time dimension is extended and meets the needs of data service.
Such as above-mentioned steps S4, corresponding data are provided for third-party application according to above-mentioned data space AI layers of data and are taken Business makes above-mentioned third-party application can use above-mentioned data space source data layer, above-mentioned data space BI layers and above-mentioned data sky Between AI layers of realization data administer, above-mentioned third-party application can be based on above-mentioned data space source data layer, above-mentioned data space BI Layer and the building of above-mentioned data space AI layers of data meet the business service of more application scenarios.
Referring to Fig. 2, in the present embodiment, above-mentioned above-mentioned source data and target mould according to above-mentioned data space source data layer Type construct data space BI layers the step of comprising steps of
S5, corresponding above-mentioned object module is obtained according to the type of service and data target type of above-mentioned third-party application;
S6, above-mentioned data space BI layers are constructed according to the data of above-mentioned data space source data layer and above-mentioned object module.
Such as above-mentioned steps S5, obtained according to the type of service of above-mentioned third-party application and data target type corresponding above-mentioned Object module is constructed according to different types of service to the above-mentioned object module for being applied to extract with calculate, and above-mentioned data The type of the data of spatial source data Layer is corresponding with the type of service of above-mentioned third-party application and data target type, wherein The algorithm of above-mentioned object module generally comprises: linear regression, cluster, classification, time series, decision tree, nerve center method, it is non-just Normal detection, relevance, factor analysis, text mining, support vector machines, sequential mining, naive Bayesian, random forest or TensorFlow。
Such as above-mentioned steps S6, above-mentioned data are constructed according to the data of above-mentioned data space source data layer and above-mentioned object module I layers of space B, to construct above-mentioned data space BI layers of the data for capableing of the above-mentioned data space source data layer of alignment processing.
Referring to Fig. 3, in the present embodiment, the above-mentioned data according to BI layers of above-mentioned building data space and above-mentioned object module Building data space AI layers the step of comprising steps of
S7, it makes inferences to obtain unknown data according to BI layers of above-mentioned building data space of data,;
S8, according to above-mentioned unknown data, the data of above-mentioned data space source data layer and above-mentioned object module construct it is above-mentioned AI layers of data space.
Such as above-mentioned steps S7, make inferences to obtain unknown data according to BI layers of data of above-mentioned building data space, thus Go out above-mentioned unknown data according to above-mentioned data space source data layer and above-mentioned data space BI layers of relation inference, wherein above-mentioned Unknown data includes qualitative data and quantitative data.
Such as above-mentioned steps S8, according to above-mentioned unknown data, the data of above-mentioned data space source data layer and above-mentioned target mould Above-mentioned data space AI layers of type building, thus according to above-mentioned unknown data and on the basis of above-mentioned data space BI layers of data Above-mentioned data space AI layers of data are improved, find and infer deeper data, thus known to containing from data Knowledge reasoning goes out unknown knowledge, wherein the algorithm of above-mentioned object module generally comprises: linear regression, cluster, classification, time system Column, decision tree, nerve center method, improper detection, relevance, factor analysis, text mining, support vector machines, sequential mining, Naive Bayesian, random forest or TensorFlow.
Referring to Fig. 4, in the present embodiment, the step of above-mentioned building data space source data layer before, further comprise the steps of:
S9, the native data for obtaining different business systems form data lake;
S10, according to the type of service and data target type of above-mentioned third-party application to the primary number in above-mentioned data lake According to data identification and cleaning is carried out, source data is obtained.
Such as above-mentioned steps S9, the native data of different business systems is obtained, forms data lake, using data extraction tool, Above-mentioned native data is directly extracted from different operation systems, and is converged in above-mentioned data lake, wherein in above-mentioned data lake In data be above-mentioned initial data real-time or near real-time mirror image.
Such as above-mentioned steps S10, according to the type of service of above-mentioned third-party application and data target type to above-mentioned data lake In native data carry out data identification and cleaning, obtain source data, wherein the native data in above-mentioned data lake is to be based on The realization of the tools such as Kafka, Sqoop is obtained from different business systems, and above-mentioned source data is using the methods of ETL to above-mentioned number The identification of data is carried out according to the initial data in lake and cleaning obtains.
Referring to Fig. 5, in the present embodiment, the step of above-mentioned building data space source data layer comprising steps of
S11, body data warehouse is formed according to above-mentioned source data, and constructs above-mentioned data space source data layer.
Such as above-mentioned steps S11, body data warehouse is formed according to above-mentioned source data, and constructs above-mentioned data space source data Layer, different source datas is extracted from different types of operation system, and carry out being integrally formed above-mentioned master according to above-mentioned source data Volume data warehouse, so that the aforementioned body data warehouse of different levels and type is formed, to construct above-mentioned data space source Data Layer, wherein above-mentioned data space source data layer is based on the databases such as Apache Hadoop, MySQL, MangoDB.
It after above-mentioned the step of obtaining above-mentioned source data, is further comprised the steps of: in the present embodiment referring to Fig. 6
S12, above-mentioned source data is tracked, and according to the source-information of above-mentioned source data above-mentioned source data is carried out etc. Grade assessment.
Such as above-mentioned steps S12, above-mentioned source data is tracked, and according to the source-information of above-mentioned source data to above-mentioned source Data carry out grade assessment, and the feasibility degree of the above-mentioned source data of different stage has differences, and above-mentioned source data source is closer The higher grade of source data, above-mentioned source data, and confidence level is also higher.
Referring to Fig.1-6, in the present embodiment, the data administering method based on data space, comprising steps of
S9, the native data for obtaining different business systems form data lake;
S10, according to the type of service and data target type of above-mentioned third-party application to the primary number in above-mentioned data lake According to data identification and cleaning is carried out, above-mentioned source data is obtained;
S12, above-mentioned source data is tracked, and according to the source-information of above-mentioned source data above-mentioned source data is carried out etc. Grade assessment;
S11, data space source data layer is constructed according to above-mentioned source data;
S5, corresponding above-mentioned object module is obtained according to the type of service and data target type of above-mentioned third-party application;
S6, above-mentioned data space BI layers are constructed according to above-mentioned source data and above-mentioned object module;
S7, it makes inferences to obtain unknown data according to BI layers of above-mentioned building data space of data;
S8, above-mentioned data space AI layers are constructed according to above-mentioned unknown data, above-mentioned source data and above-mentioned object module;
S4, corresponding data service is provided for third-party application according to above-mentioned data space AI layers of data.
Referring to Fig. 7, the present invention also proposes a kind of data governing system based on data space, comprising:
Source data layer building module 1 is extracted for constructing data space source data layer from different types of operation system Different data, and integration processing is carried out to data, to construct above-mentioned data space source data layer, wherein above-mentioned data Spatial source data Layer is based on the databases such as Apache Hadoop, MySQL, MangoDB;
BI layer building module 2, it is empty for constructing data according to the data and object module of above-mentioned data space source data layer Between BI layers, using different types of data construct above-mentioned data space BI layers when, above-mentioned object module is also different, to construct It is capable of above-mentioned data space BI layers of the data of the above-mentioned data space source data layer of alignment processing, wherein above-mentioned object module Algorithm generally comprises: linear regression, cluster, classification, time series, decision tree, nerve center method, improper detection, relevance, Factor analysis, text mining, support vector machines, sequential mining, naive Bayesian, random forest or TensorFlow;
AI layer building module 3, for according to BI layers of above-mentioned building data space of data, above-mentioned object module and above-mentioned number Data space AI layers is constructed according to the data of spatial source data Layer, thus perfect on the basis of above-mentioned data space BI layers of data Above-mentioned data space AI layers of data find and infer deeper data, thus the known knowledge contained from data Infer unknown knowledge, above-mentioned data space source data layer, above-mentioned data space BI layers and above-mentioned data space AI layers of combination shape At a space with abstraction hierarchy, make when more interoperability of system or bigger business occur, it can be from depth, wide Degree and time dimension are extended and meet the needs of data service;
API module 4 takes for providing corresponding data according to above-mentioned data space AI layers of data for third-party application Business makes above-mentioned third-party application can use above-mentioned data space source data layer, above-mentioned data space BI layers and above-mentioned data sky Between AI layers of realization data administer, above-mentioned third-party application can be based on above-mentioned data space source data layer, above-mentioned data space BI Layer and the building of above-mentioned data space AI layers of data meet the business service of more application scenarios.
In the present embodiment, further includes:
Modeling tool and model library module 5, for the type of service and data target type according to above-mentioned third-party application Corresponding above-mentioned object module is obtained, according to different type of service buildings to the above-mentioned target mould for being applied to extract with calculate Type, and the type of service and data target class of the type of the data of above-mentioned data space source data layer and above-mentioned third-party application Type is corresponding, wherein the algorithm of above-mentioned object module generally comprises: linear regression, cluster, classification, time series, decision tree, Nerve center method, improper detection, relevance, factor analysis, text mining, support vector machines, sequential mining, simple pattra leaves This, random forest or TensorFlow;
Inquiry and inference engine module 6, for making inferences to obtain not according to BI layers of above-mentioned building data space of data Primary data, to go out above-mentioned unknown number according to above-mentioned data space source data layer and above-mentioned data space BI layers of relation inference According to, wherein above-mentioned unknown data includes qualitative data and quantitative data.
In the present embodiment, further includes:
Data lake module 7 forms data lake, utilizes data pick-up work for obtaining the native data of different business systems Tool, directly extracts above-mentioned native data, and converge in above-mentioned data lake, wherein in above-mentioned number from different operation systems It is the real-time or near real-time mirror image of above-mentioned initial data according to the data in lake;
Identification and cleaning module 8, for the type of service and data target type according to above-mentioned third-party application to above-mentioned Native data in data lake carries out data identification and cleaning, obtains source data, wherein the native data in above-mentioned data lake is It is realized based on tools such as Kafka, Sqoop and is obtained from different business systems, and above-mentioned source data is using the methods of ETL to upper State identification and cleaning acquisition that the initial data in data lake carries out data;
Data blood relationship tracing module 9, is tracked above-mentioned source data, and according to the source-information of above-mentioned source data to upper It states source data and carries out grade assessment, the feasibility degree of the above-mentioned source data of different stage has differences, and above-mentioned source data source is got over Close to source data, the higher grade of above-mentioned source data, and confidence level is also higher.
In the present embodiment, further includes:
Secondary BI layer building module, for according to the data of above-mentioned data space source data layer and the building of above-mentioned object module BI layers of data space are stated, to construct the above-mentioned data space for capableing of the data of the above-mentioned data space source data layer of alignment processing BI layers;
Secondary AI layer building module, for according to the data of above-mentioned unknown data, above-mentioned data space source data layer and above-mentioned Above-mentioned data space AI layers of object module building, thus according to above-mentioned unknown data and in above-mentioned data space BI layers of data On the basis of improve above-mentioned data space AI layers of data, find and infer deeper data, to contain from data Known knowledge infer unknown knowledge, wherein the algorithm of above-mentioned object module generally comprises: linear regression, cluster, classification, Time series, decision tree, nerve center method, improper detection, relevance, factor analysis, text mining, support vector machines, sequence Arrange excavation, naive Bayesian, random forest or TensorFlow;
Body data warehouse module, for forming body data warehouse according to above-mentioned source data, and it is empty to construct above-mentioned data Between source data layer, different source datas is extracted from different types of operation system, and integration shape is carried out according to above-mentioned source data At aforementioned body data warehouse, so that the aforementioned body data warehouse of different levels and type is formed, to construct above-mentioned number According to spatial source data Layer, wherein above-mentioned data space source data layer is based on the data such as Apache Hadoop, MySQL, MangoDB Library;
Quality of data monitoring instrument module, for monitoring the quality of collected data, once monitor above-mentioned data Quality changes, and just gives and warns in advance.
Data security protecting tool desensitization protection, the production peace such as encrypts for carrying out to the data for being related to secret or privacy Full data copy or setting data access authority.
Referring to Fig. 8, in embodiments of the present invention, the present invention also provides a kind of computer equipment, above-mentioned computer equipment 10 It is showed in the form of universal computing device, the component of computer equipment 10 can include but is not limited to: one or more processing Device or processing unit 10, system storage 17 connect different system components (including system storage 17 and processing unit 12) Bus 13;
Bus 13 indicates one of a few class bus structures or a variety of, including memory bus or Memory Controller, Peripheral bus, graphics acceleration port, processor or the local bus using any bus structures in a variety of bus structures.It lifts For example, these architectures include but is not limited to industry standard architecture (ISA) bus, microchannel architecture (MAC) Bus, enhanced isa bus, Video Electronics Standards Association (VESA) local bus and peripheral component interconnection (PCI) bus.
Computer equipment 10 typically comprises a variety of computer system readable media.These media can be it is any can be by The usable medium that computer equipment 10 accesses, including volatile and non-volatile media, moveable and immovable medium.
System storage 17 may include the computer system readable media of form of volatile memory, such as arbitrary access Memory (RAM) 18 and/or cache memory 19.Computer equipment 10 may further include other movement/it is not removable Dynamic, volatile/non-volatile computer decorum storage medium.Only as an example, storage system 20 can be used for read and write can not Mobile, non-volatile magnetic media (commonly referred to as " hard disk drive ").Although being not shown in Fig. 8, can provide for can The disc driver of mobile non-volatile magnetic disk (such as " floppy disk ") read-write, and to removable anonvolatile optical disk (such as CD~ ROM, DVD~ROM or other optical mediums) read-write CD drive.In these cases, each driver can pass through one A or multiple data media interfaces are connected with bus 13.Memory may include at least one program product, the program product With one group of (for example, at least one) program module 22, these program modules 22 are configured to perform the function of various embodiments of the present invention Energy.
Program/utility 21 with one group of (at least one) program module 22, can store in memory, for example, Such program module 22 includes --- but being not limited to --- operating system, one or more application program, other program moulds It may include the realization of network environment in block and program data, each of these examples or certain combination.Program module 22 usually execute function and/or method in embodiment described in the invention.
Computer equipment 10 can also with one or more external equipments 11 (such as keyboard, sensing equipment, display 16, Camera etc.) communication, the equipment interacted with the computer equipment 10 can be also enabled a user to one or more to be communicated, and/ Or with the computer equipment 10 is communicated with one or more of the other calculating equipment any equipment (such as network interface card, Modem etc.) communication.This communication can be carried out by input/output (I/O) interface 15.Also, computer equipment 10 can also by network adapter 14 and one or more network (such as local area network (LAN)), wide area network (WAN) and/or Public network (such as internet) communication.As shown, network adapter 14 passes through other of bus 13 and computer equipment 10 Module communication.It should be understood that although being not shown in Fig. 8 other hardware and/or software mould can be used in conjunction with computer equipment 10 Block, including but not limited to: microcode, device driver, redundant processing unit, external disk drive array, RAID system, tape Driver and data backup storage system etc..
Processing unit 12 by the program that is stored in system storage 17 of operation, thereby executing various function application and Data processing, such as realize the data administering method based on data space provided by the embodiment of the present invention.
That is, above-mentioned processing unit 12 is realized when executing above procedure: building data space source data layer, and according to above-mentioned The data and object module of data space source data layer construct data space BI layers, further according to BI layers of above-mentioned building data space The data of data, above-mentioned object module and above-mentioned data space source data layer construct data space AI layers, also, according to above-mentioned number Corresponding data service is provided according to space AI layers of data for third-party application.
Data administering method based on data space, system and having the beneficial effect that for computer equipment of the invention passes through Construct above-mentioned data space source data layer, data space BI layers and data space AI layers above-mentioned above-mentioned, formed one have it is abstract The space of level makes when more interoperability of system or bigger business occur, can from depth, range and time dimension into Row extends and meets the needs of data service;And in above-mentioned data space source data layer and above-mentioned data space BI layers of data On the basis of improve above-mentioned data space AI layers of data, find and infer deeper data, to contain from data Known knowledge infer unknown knowledge.
The above description is only a preferred embodiment of the present invention, is not intended to limit the scope of the invention, all utilizations Equivalent structure or equivalent flow shift made by description of the invention and accompanying drawing content is applied directly or indirectly in other correlations Technical field, be included within the scope of the present invention.

Claims (10)

1. a kind of data administering method based on data space, which is characterized in that comprising steps of
Construct data space source data layer;
Data space BI layers is constructed according to the data of the data space source data layer and object module;
According to the data of data space BI layers of the building, the data structure of the object module and the data space source data layer Build AI layers of data space;
Corresponding data service is provided according to data space AI layers of the data for third-party application.
2. the data administering method according to claim 1 based on data space, which is characterized in that described according to the number According to spatial source data Layer data and object module construct data space BI layers the step of comprising steps of
The corresponding object module is obtained according to the type of service of the third-party application and data target type;
According to the data of the data space source data layer and described data space BI layers of object module building.
3. the data administering method according to claim 1 based on data space, which is characterized in that described according to the structure Build data space AI layers of the data building of BI layers of data space of data, the object module and the data space source data layer The step of comprising steps of
It makes inferences to obtain unknown data according to the data of data space BI layers of the building;
The data space is constructed according to the unknown data, the data of the data space source data layer and the object module AI layers.
4. the data administering method according to claim 1 based on data space, which is characterized in that the building data are empty Between source data layer the step of before, further comprise the steps of:
The native data of different business systems is obtained, data lake is formed;
The native data in the data lake is counted according to the type of service of the third-party application and data target type According to identification and cleaning, source data is obtained.
5. the data administering method according to claim 4 based on data space, which is characterized in that the building data are empty Between source data layer the step of comprising steps of
Body data warehouse is formed according to the source data, and constructs the data space source data layer.
6. the data administering method according to claim 4 based on data space, which is characterized in that described in described obtain After the step of source data, further comprise the steps of:
The source data is tracked, and grade assessment is carried out to the source data according to the source-information of the source data.
7. a kind of data governing system based on data space characterized by comprising
Source data layer building module, for constructing data space source data layer;
BI layer building module, for constructing data space BI according to the data and object module of the data space source data layer Layer;
AI layer building module, it is empty for data, the object module and the data according to data space BI layers of the building Between source data layer data construct data space AI layers;
API module, for providing corresponding data service according to data space AI layers of the data for third-party application.
8. the data governing system according to claim 7 based on data space, which is characterized in that further include:
Modeling tool and model library module, for according to the acquisition pair of the type of service and data target type of the third-party application The object module answered;
Inquiry and inference engine module, for making inferences to obtain unknown number according to the data of data space BI layers of the building According to.
9. the data governing system according to claim 7 based on data space, which is characterized in that further include:
Data lake module forms data lake for obtaining the native data of different business systems;
Identification and cleaning module, for the type of service and data target type according to the third-party application to the data lake In native data carry out data identification and cleaning, obtain source data;
Data blood relationship tracing module, is tracked the source data, and according to the source-information of the source data to the source Data carry out grade assessment.
10. a kind of computer equipment, can run on a memory and on a processor including memory, processor and storage Computer program, which is characterized in that the processor is realized when executing described program such as any one of claim 1~6 institute The method stated.
CN201810864946.0A 2018-08-01 2018-08-01 Data administering method, system and computer equipment based on data space Pending CN109241179A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810864946.0A CN109241179A (en) 2018-08-01 2018-08-01 Data administering method, system and computer equipment based on data space

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810864946.0A CN109241179A (en) 2018-08-01 2018-08-01 Data administering method, system and computer equipment based on data space

Publications (1)

Publication Number Publication Date
CN109241179A true CN109241179A (en) 2019-01-18

Family

ID=65073417

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810864946.0A Pending CN109241179A (en) 2018-08-01 2018-08-01 Data administering method, system and computer equipment based on data space

Country Status (1)

Country Link
CN (1) CN109241179A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110263229A (en) * 2019-06-27 2019-09-20 北京中油瑞飞信息技术有限责任公司 A kind of data administering method and device based on data lake
CN110457405A (en) * 2019-08-20 2019-11-15 上海观安信息技术股份有限公司 A kind of database audit method based on genetic connection
CN111897864A (en) * 2020-08-13 2020-11-06 创智和宇信息技术股份有限公司 Expert database data extraction method and system based on Internet AI outbound
CN118379026A (en) * 2024-06-21 2024-07-23 广州信安数据有限公司 Intelligent data management system based on consultation planning

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020070953A1 (en) * 2000-05-04 2002-06-13 Barg Timothy A. Systems and methods for visualizing and analyzing conditioned data
CN1967579A (en) * 2006-02-13 2007-05-23 湖南大学 Aid excavating analysis system of financial risk management
CN101075304A (en) * 2006-05-18 2007-11-21 河北全通通信有限公司 Method for constructing decision supporting system of telecommunication industry based on database
CN102231869A (en) * 2011-06-27 2011-11-02 北京邮电大学 Realization method for refinement operation system architecture of valued-added service
CN102523246A (en) * 2011-11-23 2012-06-27 陈刚 Cloud computation treating system and method
CN106339509A (en) * 2016-10-26 2017-01-18 国网山东省电力公司临沂供电公司 Power grid operation data sharing system based on large data technology
CN106570081A (en) * 2016-10-18 2017-04-19 同济大学 Semantic net based large scale offline data analysis framework

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020070953A1 (en) * 2000-05-04 2002-06-13 Barg Timothy A. Systems and methods for visualizing and analyzing conditioned data
CN1967579A (en) * 2006-02-13 2007-05-23 湖南大学 Aid excavating analysis system of financial risk management
CN101075304A (en) * 2006-05-18 2007-11-21 河北全通通信有限公司 Method for constructing decision supporting system of telecommunication industry based on database
CN102231869A (en) * 2011-06-27 2011-11-02 北京邮电大学 Realization method for refinement operation system architecture of valued-added service
CN102523246A (en) * 2011-11-23 2012-06-27 陈刚 Cloud computation treating system and method
CN106570081A (en) * 2016-10-18 2017-04-19 同济大学 Semantic net based large scale offline data analysis framework
CN106339509A (en) * 2016-10-26 2017-01-18 国网山东省电力公司临沂供电公司 Power grid operation data sharing system based on large data technology

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110263229A (en) * 2019-06-27 2019-09-20 北京中油瑞飞信息技术有限责任公司 A kind of data administering method and device based on data lake
CN110263229B (en) * 2019-06-27 2020-06-02 北京中油瑞飞信息技术有限责任公司 Data lake-based data management method and device
CN110457405A (en) * 2019-08-20 2019-11-15 上海观安信息技术股份有限公司 A kind of database audit method based on genetic connection
CN110457405B (en) * 2019-08-20 2021-09-21 上海观安信息技术股份有限公司 Database auditing method based on blood relationship
CN111897864A (en) * 2020-08-13 2020-11-06 创智和宇信息技术股份有限公司 Expert database data extraction method and system based on Internet AI outbound
CN118379026A (en) * 2024-06-21 2024-07-23 广州信安数据有限公司 Intelligent data management system based on consultation planning
CN118379026B (en) * 2024-06-21 2024-09-24 广州信安数据有限公司 Intelligent data management system based on consultation planning

Similar Documents

Publication Publication Date Title
CN110797124B (en) Model multiterminal collaborative training method, medical risk prediction method and device
US11727311B2 (en) Classifying user behavior as anomalous
Choudhary et al. Distributed mapping with privacy and communication constraints: Lightweight algorithms and object-based models
Tran et al. Shape grammar approach to 3D modeling of indoor environments using point clouds
CN109241179A (en) Data administering method, system and computer equipment based on data space
CN105653559B (en) Method and apparatus for scanning in the database
CN104794151A (en) Spatial knowledge service system building method based on collaborative plotting technology
CN104298785B (en) Searching method for public searching resources
DE102016204322B4 (en) Securing a unit using graphical analysis
Li et al. Integration Methods and Advantages of Machine Learning with Cloud Data Warehouses
CN111737364A (en) Safe multi-party data fusion and federal sharing method, device, equipment and medium
CN107346315A (en) The structure and application process of object data association index system and the system
Desarkar et al. Big-data analytics, machine learning algorithms and scalable/parallel/distributed algorithms
US11042749B2 (en) Augmented reality mapping systems and related methods
Xuqin Application of network protocol improvement and image content search in mathematical calculus 3D modeling video analysis
EP3249557B1 (en) Computer implemented and computer controlled method, computer program product and platform for arranging data for processing and storage at a data storage engine
US20150234859A1 (en) Managing Inferred Data
WO2020001614A1 (en) Ontology construction method and apparatus
Ma et al. [Retracted] The Construction of Big Data Computational Intelligence System for E‐Government in Cloud Computing Environment and Its Development Impact
US20190171756A1 (en) Cognitive decision system for security and log analysis using associative memory mapping in graph database
RU2739473C1 (en) Data structure formation method and system based on multilayer graph
US20220398331A1 (en) Property-level visibilities for knowledge-graph objects
CN114331774A (en) Student sleep judging method and device, electronic equipment and storage medium
US20240020286A1 (en) Database management apparatus and database management method
US20220075803A1 (en) Graph framework (database methods) to analyze trillion cell reservoir and basin simulation results

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination