A kind of administrative object distribution formula big data association analysis method based on body
Technical field
The present invention relates to E-government Information Resource Development technical field, more particularly to a kind of administrative object based on body
Distributed big data association analysis method.
Background technology
First, the ontology originating from philosophy (ontology) is received significant attention in information science field, its importance
Embody in many aspects, such as knowledge engineering, database design and integrated, information retrieval and acquisition, soft project, natural language
Speech processing etc..Especially application of the ontology on Web causes the birth of Semantic Web, is expected to solve language when Web information is shared
Adopted problem, realizes worldwide Knowledge information integration;Ontology mature itself, but also it is short of the sheet to administrative object
Volume modeling and analysis.
Second, for administrative object big data analysis technology, mainly include at this stage:
A) administrative object:The objective subjects such as legal person corresponding with government administration behavior, natural person;
B) big data analysis technology:Current big data research has become the great of future technology and socio-economic development
Strategic field, big data analysis have the characteristics that data volume is big, query analysis is complicated compared to traditional data warehouse applications;
By the e-government development of nearly 20 years, government information resources rapid expansion, by taking Shanghai City as an example, a medium district level is administrative
The information resources of unit have reached tens GB even more than, whole district county's government data estimates nearly 50TB, whole city's data then mistake
PB, and the trend of exponential increase is also presented in data, these government information resources by scattered storage, it is independent utilize, it is difficult to integrate point
Analysis, the utilization rate of government information resources is with there is very big vacancy using depth.
In conclusion currently with technical difficulty mainly have at 3 points:
A) data disperse to store, because the odjective causes such as administrative barrier exist, it is difficult to be integrated;
B) data deficiency top layer framework, Data Identification, data structure and attribute description disunity, cause can not directly with
Unified administration object coding mode accesses use;
C) data volume is big, increases soon, and existing database, data warehouse technology have been difficult to analyze available data.
The major defect of prior art includes:Bulk process is not introduced into administrative object information modeling;It will not divide
Cloth big data analysis is introduced into administrative object big data association analysis;Not by bulk process and distributed big data analysis method
With reference to.
Thus, in view of the above, needing effectively to innovate the prior art.
The content of the invention
For disadvantages described above, the present invention provides one kind and can reduce enforcement difficulty and implementation cost, can solve administrative object letter
Cease correlation model underlying issue and failure is active again, is conducive to improve the administrative object distribution formula based on body of efficiency of the practice
Big data association analysis method.
To achieve the above object, the present invention uses following technical scheme:
A kind of administrative object distribution formula big data association analysis method based on body, comprises the steps of:
(1) administrative subject body modeling is carried out respectively:Including natural person and legal person;
(2) host node is constructed in order to realize each distributed semantic gateway connection, big data processing unified management function, specifically
Mode is:In configuration management department design host node top layer semantic net, host node big data process demand, distributed computing platform
Set, downstream site interface;
Host node semantic net therein is realized the semantic information associative search function of each node level semantic net;
Host node big data process demand therein realizes the main algorithm of big data analysis, calculates and appoints to the distribution of each partial node
Business, recycles result of calculation;
(3) it is last, child node is constructed, in two level administrative department design node level semantic net, node big data processing platform,
Receive and feed back host node and calculate demand.
Node level semantic net therein realizes this node semantic information retrieval function.
Node big data processing platform therein is used for realization big data analysis node algorithm, receives host node and calculates and appoints
Business, submits result of calculation.
Administrative object distribution formula big data association analysis method of the present invention based on body has the beneficial effect that:
(1) the big concentration of government data need not be implemented, reduce enforcement difficulty and implementation cost;
(2) be conducive to establish administrative object unifying identifier, unified structure and unified attribute description method, solve administrative object
Information association model underlying issue, while failure is not active again;
(3), using the distributed big data analysis processing frame with reference to body, it is big that the lower administrative object information of scattered storage is solved
Data analysis problems, reduce overall software and hardware capital investment requirements, improve efficiency of the practice.
Brief description of the drawings
The present invention is described in further detail below according to attached drawing.
Fig. 1 is the flow of the administrative object distribution formula big data association analysis method based on body described in the embodiment of the present invention
Schematic diagram.
Embodiment
As shown in Figure 1, the administrative object distribution formula big data association analysis side based on body described in the embodiment of the present invention
Method, mainly comprises the steps of:
(1) by taking civil administration administration object as an example, the modeling of civil administration administration subject body is carried out respectively:Including natural person, legal person;
(2) host node is constructed in order to realize each distributed semantic gateway connection, big data processing unified management function, specifically
Mode is:In configuration management department of Department of Civil Affairs design host node top layer semantic net, host node big data process demand, distributed meter
Calculate model setting, downstream site interface;
Host node semantic net therein is realized the semantic information associative search function of each node level semantic net;
Host node big data process demand therein realizes the main algorithm of big data analysis, calculates and appoints to the distribution of each partial node
Business, recycles result of calculation;
(3) it is last, child node is constructed, in two level administrative department of Department of Civil Affairs design node level semantic net, the processing of node big data
Platform, receives and feeds back host node calculating demand, and node level semantic net therein realizes this node semantic information retrieval function, its
In node big data processing platform be used for realization big data analysis node algorithm, receive host node calculating task, submit and calculate
As a result.
Ontology information and file in figure, all kinds of ontology informations and file for being stored in host node;Administrative unit's node, it is real
The now function such as this node semantic net, big data processing;Associated interface, the information bridges and interoperability for realizing host node and child node connect
Mouthful.
Above example is one kind of the present invention more preferably embodiment, and those skilled in the art are in the technical program
In the range of the usual variations and alternatives that carry out should include it is within the scope of the present invention.