CN108228747A - Data genetic connection visualized graphs system in data improvement - Google Patents

Data genetic connection visualized graphs system in data improvement Download PDF

Info

Publication number
CN108228747A
CN108228747A CN201711383801.0A CN201711383801A CN108228747A CN 108228747 A CN108228747 A CN 108228747A CN 201711383801 A CN201711383801 A CN 201711383801A CN 108228747 A CN108228747 A CN 108228747A
Authority
CN
China
Prior art keywords
data
node
visualized graphs
genetic connection
stream compression
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711383801.0A
Other languages
Chinese (zh)
Inventor
朱科支
欧阳翔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Number Plus Data Technology Co Ltd
Original Assignee
Jiangsu Number Plus Data Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Number Plus Data Technology Co Ltd filed Critical Jiangsu Number Plus Data Technology Co Ltd
Priority to CN201711383801.0A priority Critical patent/CN108228747A/en
Publication of CN108228747A publication Critical patent/CN108228747A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/26Visual data mining; Browsing structured data

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Quality & Reliability (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The present invention provides the data genetic connection visualized graphs system during a kind of data are administered, and including information node, also includes with lower module:Stream compression circuit:Refer to the path of the stream compression;Extract at least one of polices node, cleaning rule node, transformation rule node, loading rule node and processing regular node node;The extraction polices node is used to illustrate how data extract;The cleaning rule node is used to represent the screening criteria of the data during the stream compression;The transformation rule node is used to represent the variation standard of the data during the stream compression;The loading rule node is used to illustrate how data are put in storage;The processing regular node is used to represent the data filing or destruction.The present invention is by the genetic connections of different levels, and the understanding data that can be will be apparent that migrate circulation, and the management of assessment, data for data value provides foundation.

Description

Data genetic connection visualized graphs system in data improvement
Technical field
The present invention relates to enterprise's portrait field, the data genetic connection visualized graphs system in particularly a kind of data improvement System.
Background technology
In human society, genetic connection refers to the interpersonal relationships generated by marriage or fertility.Such as parent and child Relationship, siblings' relationship and other kinships derived from therefrom.It is the inborn inherent relationship of people, Human society is just existing at the beginning of generating, and is a kind of social relationships formed earliest.
In the current big data epoch, data burst increases, and magnanimity, various types of data are quickly generating. The data information of these bulky complex is merged by marriage, converts transformation, circulation circulation, and generate new data, pooled The ocean of data.
The generations of data, processing fusion, circulation circulation, it is natural between data to form a kind of relationship to final extinction. We use for reference a kind of relationship similar in human society to express this relationship between data, and the referred to as blood relationship of data is closed System.Different from the genetic connection in human society, the genetic connection of data further comprises some distinctive features:
1. belongingness.In general, specifically tissue or individual, data have belongingness to specific attribution data.
2. polyphyly.Same data can have multiple sources(Multiple fathers).One data generation can be multiple numbers According to what is generated by processing, and this process can be multiple.
3. trackability.The genetic connection of data embodies the life cycle of data, embodies data from generating extinction Whole process, have trackability.
Hierarchy.The genetic connection of data has levels.The description to data such as classification, conclusion, summary to data Information forms new data, and different degrees of description information forms the level of data.
How the visualization of data genetic connection is become everybody and compare focus of attention.
One is disclosed in sohu.com《Have relationship by blood between data-data must not administer the genetic connection that is ignorant of and comb Method》(Network address is http://www.sohu.com/a/161142366_99934777), disclose a kind of data genetic connection Visual presentation system, within the system, including information node, stream compression circuit, cleaning rule node, transformation rule section Regular node is destroyed in point and data filing, can clearly show data genetic connection.On the one hand the invention lacks for each The setting of kind rule so that the judgement of cleaning, conversion and destruction is susceptible to problem, and still further aspect lacks the extraction to data The description of method and loading strategy easily makes the visualized graphs of data genetic connection generate deviation.
Invention content
To solve the above-mentioned problems, the data genetic connection visualized graphs system in being administered the present invention provides a kind of data For system by the genetic connection of different levels, the understanding data that can be will be apparent that migrate circulation, are assessment, the data of data value Management provide foundation.Detailed rule is set, data are preferably combed with facilitating.
The specific technical solution of the present invention is as follows:
The present invention proposes the data genetic connection visualized graphs system during a kind of data are administered, and including information node, also wraps Containing with lower module:
Stream compression circuit:Refer to the path of the stream compression;
Extract polices node, cleaning rule node, transformation rule node, transformation rule node, processing regular node and loading rule Then at least one of node node;
The extraction polices node is used to illustrate how data extract;
The cleaning rule node is used to represent the screening criteria of the data during the stream compression;
The transformation rule node is used to represent the variation standard of the data during the stream compression;
The loading rule node is used to illustrate how data are put in storage;
The processing regular node is used to represent the data filing or destruction.
Preferably, described information node is used to show the owner of data and data hierarchical information or end message.
In said program preferably, described information node includes host node, data flow egress and data inflow section At least one of point.
In said program preferably, the host node is located at the center of the visualized graphs, is the visualization The core node of figure.
In said program preferably, the data flow ingress is the father node of the host node, represents that data are come Source includes at least one data flow ingress in the visualized graphs.
In said program preferably, the data flow egress is the child node of the host node, represents data Whereabouts includes at least one data flow egress in the visualized graphs.
In said program preferably, terminal node is a kind of special data flow egress, and mark data is not It circulates down again.
In said program preferably, the stream compression circuit is come out from the data flow ingress toward the main section Point convergence, and flow out toward the data flow egress and spread from the host node.
In said program preferably, the data spread circuit table and reveal direction, data update magnitude and data more The information of at least one dimension in the new frequency.
In said program preferably, in the visualized graphs, the data update magnitude passes through the thick of lines It is thin to show.
In said program preferably, in the visualized graphs, length that the data update frequency passes through lines It spends to show.
In said program preferably, described collect is divided into full dose extraction and increment extraction.
In said program preferably, the cleaning refers to data receiving according to the requirement of data to filter access The process of data.
In said program preferably, the cleaning rule node is located on the stream compression circuit, described in expression Data standard described in data fit on stream compression circuit can continue circulation and go down.
In said program preferably, the conversion refers to data to need to carry out specially treated to meet demand data The process of the requirement of side.
In said program preferably, the transformation rule node is located on the stream compression circuit, represents data Circulate the variation or transformation occurred in the process.
In said program preferably, the transformation rule is included according to total score, and practical scoring is converted to what is specified Divide system.
In said program preferably, the loading is divided into full dose loading and step increment method.
In said program preferably, the loading rule includes the following contents:
1)Paging full dose extracts data, and step increment method is realized according to the period of loading policy definition;
2)Comparison loading data and target data still do not update target data according to the update of the loading strategy decision of configuration.
In said program preferably, the processing refers to the mistake data filing or destruction according to processing rule Journey.
In said program preferably, the processing rule refers to, according to enterprise requirements setting processing condition, work as data After meeting the treatment conditions, which is handled.
Data genetic connection visualized graphs system in data improvement proposed by the present invention, is to utilize computer graphics And image processing techniques, it converts the data into figure or image is shown on the screen, and carry out the theory of interaction process, side Method and technology.Visually meaning is to transmit signal fasterly, and image intuitively shows data and its relationship Come, user is facilitated to inquire into, essence is explored, pinpoints the problems.
Description of the drawings
Fig. 1 is a preferred embodiment of the data genetic connection visualized graphs system during data according to the invention are administered Visualization model schematic diagram.
Fig. 2 is the structural data genetic connection of enterprise's portrait system according to the invention based on multi objective dimensional model An embodiment hierarchical chart.
Fig. 3 is the file server data blood relationship of enterprise's portrait system according to the invention based on multi objective dimensional model The hierarchical chart of one embodiment of relationship.
Specific embodiment
Embodiment 1
Visualization from technological concept, is using computer graphics and image processing techniques, converts the data into figure Or image is shown on the screen, and carries out the theory of interaction process, methods and techniques.Visual meaning is rapid fast Signal is transmitted promptly, and image intuitively shows data and its relationship, and user is facilitated to inquire into, explore essence, finds to ask Topic.
For the genetic connection of data, visualization is particularly important.Only by visualizing, what genetic connection can be just apparent from Show in front of the user.
According to the characteristics of data genetic connection, we devise the genetic connection visualized graphs of data.
According to the difference of performance meaning, the visualized graphs of genetic connection include 5 kinds of visualized elements, are distributed in figure Different location.As shown in Figure 1, visualized elements include father node 100, data flow egress 101 and the data in information node Node 102 is flowed into, stream compression circuit 110 is further included, extracts polices node 120, cleaning rule node 130, transformation rule section Point 140, loading rule node 150 and processing regular node 160.
1st, information node
Information node is used for showing the owner of data and data hierarchical information or end message.According to genetic connection level not Together, data information different from.There was only the information of the owner in owner's level, other levels then include owner information and Data hierarchy information or end message, such as the genetic connection of the interfield of relational database, the description information of the node is just It is:Owner's database tables of data data fields.
There are three types of types for information node:Host node 100, data flow egress 101 and data flow into node 102.
There are one host nodes 100, is the core node of visualized graphs positioned at the centre of whole figure.Figure is shown Genetic connection be exactly this node genetic connection, other genetic connections unrelated with this node do not show on figure, with Ensure the simple, clear of figure.
Data flow ingress 101 can have multiple, be the father node of host node, data source be represented, positioned at whole figure Left side.
Data flow egress 102 can also have multiple, be the child node of host node, the whereabouts of data represented, positioned at entire The right side of figure.Data flow egress includes a kind of special node, i.e., terminal node, terminal node 103 are a kind of special Data flow egress represents that data no longer circulate down, and this data are generally used to visualize.
2nd, stream compression circuit 110
What stream compression circuit showed is the circulation path of data, is from left to right circulated.Stream compression circuit is flowed into from data and is saved The convergence of dealing host node is pointed out, and is flowed out from host node toward the diffusion of data flow egress.
Stream compression circuit is demonstrated by the information of three dimensions, is direction, data update magnitude, data update frequency respectively It is secondary.
The manifestation mode in direction, does not do special design, and acquiescence from left to right circulates;
The magnitude of data update is showed by the thickness of lines.Lines are thicker to represent that data magnitude is bigger, and lines get over detailed rules and regulations table Registration is smaller according to magnitude.
The frequency of data update is showed using the length of lines middle conductor.Line segment is shorter to represent that the update frequency is higher, line Duan Yuechang represents the newer frequency the end of month, and a solid line then represents only to circulate primary.
3rd, polices node 120 is extracted
Polices node is extracted for illustrating data are how to extract.The extraction of data is divided into full dose extraction and increment extraction.
Strategy is extracted to be represented with a circle for indicating capitalization " E " on visualized graphs.Check specific extraction Policing action is also very simple, and mouse is moved on the circle for indicating capitalization " E ", then can automatic Display extraction strategy.
4th, cleaning rule node 130
Cleaning rule node is used for showing the screening criteria during stream compression.A large amount of data distribution in different places, Requirement of each place to the quality of data is different, and data receiving can filter access according to oneself requirement to data Data, these requirements just form data standard, and do data cleansing according to these standards.
Cleaning rule might have a variety of.Such as requirement cannot be null value, require to meet certain form.In visualized graphs On, cleaning rule is represented with a circle for indicating capitalization " C ", various rules is simplified expression, to ensure figure Succinctly, clearly.Check that the operation of Rule content is also very simple, mouse is moved on the circle for indicating capitalization " C ", then can be certainly Dynamic displaying standard schedule list.
In on stream compression circuit, representing the data fit to circulate on the circuit, these are marked the simple pattern bit of cleaning rule Standard could continue circulation and go down.
5th, transformation rule node 140
Transformation rule node is similar to cleaning rule node in the form of expression, the circle table for indicating bigger alphabetical " T " with one Show.On stream compression circuit, for showing the variation occurred during stream compression, transformation.
The data come out from data providing, it is sometimes necessary to demand data side can be just linked by carrying out specially treated, this Kind processing may be fairly simple, such as:Only intercept first four of source data.May also be extremely complex, it is special to need to use Formula.In terms of visualization, in order to ensure the succinct, clear of figure, simple processing has been done.Check that data have passed through those conversions Rule is also very simple, and mouse is moved on the circle for indicating capitalization " T ", then can automatic Display transformation rule inventory.
6th, loading rule node 150
Polices node is loaded for illustrating data are how to be put in storage.The loading of data is divided into full dose loading and step increment method.
Loading strategy is represented on visualized graphs with a circle for indicating capitalization " L ".Check specific loading Policing action is also very simple, and mouse is moved on the circle for indicating capitalization " L ", then can automatic Display loading strategy.
7th, regular node 160 is handled
Data have life cycle, and when data no longer have use value, his life just finishes either filing or straight Outbound is ruined.
It is extremely difficult to judge whether data are also equipped with use value, needs to design some conditions, when these conditions are met After, it is considered as data and is not having use value, can file or destroy.
On visualized graphs, we devise a circle for indicating capitalization " R ", for simple expression data Filing and destruction rule.Mouse is moved on the circle for indicating capitalization " R ", then can automatic Display filing and to destroy rule clear It is single.
Embodiment 2
The visualization of genetic connection is a more complicated process, can be referred to currently without molding visualized graphs, I This genetic connection visualized graphs component for designing, the genetic connections of data can be clearly expressed, to the data of tissue It administers helpful.
The effect of data genetic connection, to sum up having following aspects:
1. data are traced to the source
It traces to the source, refers to seeking foundation, source.The data that we analyze and process, possible source is very extensive, there is government Data have the data of internet, there is the data obtained by data trade from third party, also owned data.It is different The data in source, the quality of data is irregular, is also not quite similar on the result influence of analyzing and processing.When data are abnormal, I Need to track abnormal the reason of occurring, risk control in appropriate level.
The genetic connection of data embodies the ins and outs of data, us can be helped to track the source of data, tracks data Processing procedure.On the genetic connection visualized graphs of data, the left side of host node is exactly data source node, very clearly, It is very clear.Data have passed through those conversions and can also be found out from visualized graphs, the analysis to abnormal data producing cause It is very useful.
2. assess data value
The value of data is very important in data trade field, is related to the price of data.Data value is assessed, Require foundation.Data genetic connection can provide foundation in terms of several to the assessment of data value:
1), data audient.On genetic connection figure, the data flow egress on the right represents audient, that is, demand data side, data Party in request's more multilist shows that data value is bigger;
2), data update magnitude.In data genetic connection figure, the lines of stream compression circuit are thicker, represent the amount of data update Grade is bigger, has reacted the size of data value to a certain extent;
3), the data update frequency.Data update is more frequent, represents that data are more fresh and alive, value is higher.On genetic connection figure, number Line segment according to circulation circuit is shorter, and update is more frequent.
3. data quality accessment
From the genetic connection figure of data, it may be convenient to see the standard schedule of data cleansing, this inventory has reacted logarithm According to the requirement of quality.
4. data filing, the reference destroyed
If data, without audient, data just lose the value used.From the genetic connection figure of data, rightmost does not have There is back end, it is possible to go whether the data representated by assessment host node will file or destroy.
Embodiment 3
Cleaning rule is as follows:
1. judging whether there is Chinese character in content, if there is Chinese character, true is returned, otherwise returns to false;
2. judging whether content is integer, if it is integer, true is returned, otherwise returns to false;
3. judging whether content is percentage, if it is percentage, true is returned, otherwise returns to false;
4. if Json character strings are not [] and unserializing success, true is returned, otherwise returns to false;
5. judge that content whether by entering the Character segmentation of ginseng, if the Character segmentation by entering ginseng, returns to true, otherwise returns false;
6. judging numerical value whether in specified value domain, if in specified value domain, true is returned, otherwise returns to false;
7. registered permanent residence classification verifies, if value returns to TRUE in the range of code value and otherwise returns to false;
8. personnel's Employment verifies, if value returns to TRUE in the range of code value and otherwise returns to false;
9. labour capacity classification check, if value returns to TRUE in the range of code value and otherwise returns to false;
10. family relationship classification check, if value returns to TRUE in the range of code value and otherwise returns to false;
11. health status classification check, if value returns to TRUE in the range of code value and otherwise returns to false;
12. organization's coding checkout, if value returns to TRUE in the range of code value and otherwise returns to false;
13. power classification verifies, if value returns to TRUE in the range of code value and otherwise returns to false;
14. political affiliation code check, if value returns to TRUE in the range of code value and otherwise returns to false;
15. academic code check, if value returns to TRUE in the range of code value and otherwise returns to false;
16. types of identity documents verifies, if value returns to TRUE in the range of code value and otherwise returns to false;
17. accounting profession technical qualification type checking, if value returns to TRUE in the range of code value and otherwise returns to false;
18. Unit code verification is made a report on, if value returns to TRUE in the range of code value and otherwise returns to false;
19. administrative penalty current state verifies, if value returns to TRUE in the range of code value and otherwise returns to false;
20. grain index verifies, if value returns to TRUE in the range of code value and otherwise returns to false;
21. degree of breaking one's promise verifies, if value returns to TRUE in the range of code value and otherwise returns to false;
If return is TRUE, data receiver is sent the data to, if it is false to return, by the data mistake Filter, is not sent to data receiving.
Embodiment 4
Transformation rule is as follows:
1st, the Chinese and number in character string are detached, returns to the character string of specified type;
2nd, number is converted into currency style, returns to the number of currency style;
3rd, text repeats to show multiple, returns to text repeatedly;
4th, the blank character at removal character string both ends, if becoming empty or null, returns to null, otherwise returns to former character string;
5th, clear character string if former character string is ended up with clear character string, removes this ending, after being then back to removal Character string, otherwise return to original character string;
6th, conversion time character string is specified format, returns to the character string of specified format;
7th, value type is converted into percentage, returns to the percentage of specified digit;
8th, by value type(Comprising, e, E)Standard attribute is converted into, returns to the number of standard attribute;
9th, null will be converted to for null value in database table;
10th, the character string is directly taken if character string is digit strings, if the data that scientific notation mode records It needs to be converted to corresponding data character string;
11st, gender information is obtained according to ID card No., returns to the gender information after obtaining;
12nd, date of birth information is obtained according to ID card No., returns to the date of birth information after obtaining;
13rd, according to total score, practical scoring is converted to point system specified.
Embodiment 5
The loading rule includes as follows:
1st, paging full dose extracts data, and step increment method is realized according to the period of loading policy definition;
2nd, comparison loading data and target data still do not update target data according to the update of the loading strategy decision of configuration.
Embodiment 6
As shown in Fig. 2, the hierarchical structure of structural data genetic connection being stored in database of description, is most typical A kind of hierarchical structure of genetic connection.For different types of data, the hierarchical structure of genetic connection has fine distinction.
In general, data, which all belong to some tissue or someone, data, the owner.Data are in different institutes It circulates, merge between the person of having, form a kind of relationship connected between the owner by data, be the one of data genetic connection Kind, top layer is in hierarchical structure.This relationship has clearly showed supplier and the demander of data.
Database, table and field are the storage organizations of data.Different types of data have different storage organizations.Storage The structures shape hierarchical structure of genetic connection.So some difference of the genetic connection hierarchical structure of different types of data.Example Such as, for the data stored with file server, the hierarchical structure of genetic connection is as shown in Figure 3.
The genetic connection of different levels data embodies different meanings.Owner's level embodies the provider of data And party in request, other levels then embody the ins and outs of data.By the genetic connection of different levels, can will be apparent that Understand data migrates circulation, and the management of assessment, data for data value provides foundation.
For a better understanding of the present invention, it is described in detail above in association with specific embodiments of the present invention, but is not Limitation of the present invention.Any simple modification made to the above embodiment of every technical spirit according to the present invention, still belongs to In the range of technical solution of the present invention.In this specification the highlights of each of the examples are it is different from other embodiments it Locate, the same or similar part cross-reference between each embodiment.For system embodiment, due to itself and method Embodiment corresponds to substantially, so description is fairly simple, the relevent part can refer to the partial explaination of embodiments of method.

Claims (10)

1. a kind of data genetic connection visualized graphs system in data improvement, including information node, which is characterized in that also wrap Containing with lower module:
Stream compression circuit:Refer to the path of the stream compression;
It extracts at least one in polices node, cleaning rule node, transformation rule node, loading rule node and processing regular node Kind node;
The extraction polices node is used to illustrate how data extract;
The cleaning rule node is used to represent the screening criteria of the data during the stream compression;
The transformation rule node is used to represent the variation standard of the data during the stream compression;
The loading rule node is used to illustrate how data are put in storage;
The processing regular node is used to represent the data filing or destruction.
2. the data genetic connection visualized graphs system in data improvement according to claim 1, it is characterised in that:Institute Information node is stated for showing the owner of data and data hierarchical information or end message.
3. the data genetic connection visualized graphs system in data improvement according to claim 2, it is characterised in that:Institute It states information node and includes at least one of host node, data flow egress and data inflow node.
4. the data genetic connection visualized graphs system in data improvement according to claim 3, it is characterised in that:Institute The center that host node is located at the visualized graphs is stated, is the core node of the visualized graphs.
5. the data genetic connection visualized graphs system in data improvement according to claim 4, it is characterised in that:Institute The father node that data flow ingress is the host node is stated, data source is represented, includes at least one in the visualized graphs A data flow ingress.
6. the data genetic connection visualized graphs system in data improvement according to claim 5, it is characterised in that:Institute The child node that data flow egress is the host node is stated, the whereabouts of data is represented, includes at least in the visualized graphs One data flow egress.
7. the data genetic connection visualized graphs system in data improvement according to claim 6, it is characterised in that:Eventually End node is a kind of special data flow egress, and mark data no longer circulates down.
8. the data genetic connection visualized graphs system in data improvement according to claim 6, it is characterised in that:Institute It states stream compression circuit out to converge toward the host node from the data flow ingress, and is flowed out from the host node toward described Data flow egress is spread.
9. the data genetic connection visualized graphs system in data improvement according to claim 8, it is characterised in that:Institute It states data and spreads the information that circuit table reveals at least one dimension in direction, data update magnitude and the data update frequency.
10. the data genetic connection visualized graphs system in data improvement according to claim 9, it is characterised in that: In the visualized graphs, the data update magnitude is showed by the thickness of lines.
CN201711383801.0A 2017-12-20 2017-12-20 Data genetic connection visualized graphs system in data improvement Pending CN108228747A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711383801.0A CN108228747A (en) 2017-12-20 2017-12-20 Data genetic connection visualized graphs system in data improvement

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711383801.0A CN108228747A (en) 2017-12-20 2017-12-20 Data genetic connection visualized graphs system in data improvement

Publications (1)

Publication Number Publication Date
CN108228747A true CN108228747A (en) 2018-06-29

Family

ID=62652621

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711383801.0A Pending CN108228747A (en) 2017-12-20 2017-12-20 Data genetic connection visualized graphs system in data improvement

Country Status (1)

Country Link
CN (1) CN108228747A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109241142A (en) * 2018-09-07 2019-01-18 南京中新赛克科技有限责任公司 Data genetic connection calculation method based on flow engine
CN109815378A (en) * 2019-01-31 2019-05-28 三盟科技股份有限公司 A kind of data tracing method and system based on metadata link
CN110083639A (en) * 2019-04-25 2019-08-02 中电科嘉兴新型智慧城市科技发展有限公司 A kind of method and device that the data blood relationship based on clustering is intelligently traced to the source
CN110704699A (en) * 2019-09-06 2020-01-17 中国平安财产保险股份有限公司 Data image construction method and device, computer equipment and storage medium
CN110807026A (en) * 2019-10-24 2020-02-18 北京中科捷信信息技术有限公司 Automatic capture system for analyzing financial big data blood relationship
CN111754123A (en) * 2020-06-28 2020-10-09 深圳壹账通智能科技有限公司 Data monitoring method and device, computer equipment and storage medium
CN112463978A (en) * 2020-11-13 2021-03-09 上海逸迅信息科技有限公司 Method and device for generating data blood relationship
CN113722310A (en) * 2021-09-16 2021-11-30 北京航空航天大学 Blood relationship information visual representation method
CN114428822A (en) * 2022-01-27 2022-05-03 云启智慧科技有限公司 Data processing method and device, electronic equipment and storage medium
CN115203179A (en) * 2022-05-16 2022-10-18 北京航空航天大学 Data cleaning method and device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075304A (en) * 2006-05-18 2007-11-21 河北全通通信有限公司 Method for constructing decision supporting system of telecommunication industry based on database
CN102325170A (en) * 2011-08-24 2012-01-18 无锡中科方德软件有限公司 Data extraction and integration method and system thereof
CN103714133A (en) * 2013-12-17 2014-04-09 华为软件技术有限公司 Data operation and maintenance management method and device
WO2015041714A1 (en) * 2013-09-19 2015-03-26 Platfora, Inc. Interest-driven business intelligence systems including event-oriented data
CN104881427A (en) * 2015-04-01 2015-09-02 北京科东电力控制系统有限责任公司 Data blood relationship analyzing method for power grid regulation and control running
CN107256247A (en) * 2017-06-07 2017-10-17 九次方大数据信息集团有限公司 Big data data administering method and device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075304A (en) * 2006-05-18 2007-11-21 河北全通通信有限公司 Method for constructing decision supporting system of telecommunication industry based on database
CN102325170A (en) * 2011-08-24 2012-01-18 无锡中科方德软件有限公司 Data extraction and integration method and system thereof
WO2015041714A1 (en) * 2013-09-19 2015-03-26 Platfora, Inc. Interest-driven business intelligence systems including event-oriented data
CN103714133A (en) * 2013-12-17 2014-04-09 华为软件技术有限公司 Data operation and maintenance management method and device
CN104881427A (en) * 2015-04-01 2015-09-02 北京科东电力控制系统有限责任公司 Data blood relationship analyzing method for power grid regulation and control running
CN107256247A (en) * 2017-06-07 2017-10-17 九次方大数据信息集团有限公司 Big data data administering method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
数据大家: ""数据之间有血缘关系?数据治理不得不懂的血缘关系梳理方法"", 《HTTPS://WWW.SOHU.COM/A/161142366_99934777》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109241142A (en) * 2018-09-07 2019-01-18 南京中新赛克科技有限责任公司 Data genetic connection calculation method based on flow engine
CN109815378A (en) * 2019-01-31 2019-05-28 三盟科技股份有限公司 A kind of data tracing method and system based on metadata link
CN110083639A (en) * 2019-04-25 2019-08-02 中电科嘉兴新型智慧城市科技发展有限公司 A kind of method and device that the data blood relationship based on clustering is intelligently traced to the source
CN110083639B (en) * 2019-04-25 2023-03-10 中电科嘉兴新型智慧城市科技发展有限公司 Intelligent data blood source tracing method and device based on cluster analysis
CN110704699A (en) * 2019-09-06 2020-01-17 中国平安财产保险股份有限公司 Data image construction method and device, computer equipment and storage medium
CN110807026A (en) * 2019-10-24 2020-02-18 北京中科捷信信息技术有限公司 Automatic capture system for analyzing financial big data blood relationship
CN111754123A (en) * 2020-06-28 2020-10-09 深圳壹账通智能科技有限公司 Data monitoring method and device, computer equipment and storage medium
CN112463978A (en) * 2020-11-13 2021-03-09 上海逸迅信息科技有限公司 Method and device for generating data blood relationship
CN113722310A (en) * 2021-09-16 2021-11-30 北京航空航天大学 Blood relationship information visual representation method
CN114428822A (en) * 2022-01-27 2022-05-03 云启智慧科技有限公司 Data processing method and device, electronic equipment and storage medium
CN114428822B (en) * 2022-01-27 2022-07-29 云启智慧科技有限公司 Data processing method and device, electronic equipment and storage medium
CN115203179A (en) * 2022-05-16 2022-10-18 北京航空航天大学 Data cleaning method and device

Similar Documents

Publication Publication Date Title
CN108228747A (en) Data genetic connection visualized graphs system in data improvement
Rousseau et al. Not so different after all: A cross-discipline view of trust
Song et al. Cluster analysis of the intellectual structure of PPP research
Brandes et al. Network analysis of collaboration structure in Wikipedia
Rosenbloom et al. Interface terminologies: facilitating direct entry of clinical data into electronic health record systems
Smith Governing data and data for governance: the everyday practice of Indigenous sovereignty
Sun et al. Understanding health information technology adoption: A synthesis of literature from an activity perspective
CN110019176A (en) Improve the data improvement control system that data administer service success rate
An et al. Topical evolution patterns and temporal trends of microblogs on public health emergencies: an exploratory study of Ebola on Twitter and Weibo
Griffiths Making connections: studies of the social organisation of healthcare
Savola et al. A visualization and modeling tool for security metrics and measurements management
Sun et al. Patient cluster divergence based healthcare insurance fraudster detection
Kaufman et al. Cognitive differences in human and AI explanation
Ganguly et al. A review of the role of causality in developing trustworthy ai systems
Li et al. The evolution of collaborative networks: A social network analysis of Chinese environmental protection policy
Liu et al. Analysis of influencing factors in emergency management based on an integrated methodology
Brahimi et al. Mapping the Scientific Landscape of Metaverse Using VOSviewer and Bibliometrix
Owda et al. Financial discussion boards irregularities detection system (fdbs-ids) using information extraction
CN107680690B (en) Clinical information system based on metadata
Odell et al. Detecting shifts in metropolitan structure: A spatial network perspective
US9286381B2 (en) Disjoint partial-area based taxonomy abstraction network
Bodnari et al. MCORES: a system for noun phrase coreference resolution for clinical records
Pur et al. Primary health‐care network monitoring: a hierarchical resource allocation modeling approach
CN114612246A (en) Object set identification method and device, computer equipment and storage medium
Noon et al. Implementation of blockchain in healthcare: a systematic review

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180629

RJ01 Rejection of invention patent application after publication