CN108345660A - A kind of data analysing method based on government data - Google Patents
A kind of data analysing method based on government data Download PDFInfo
- Publication number
- CN108345660A CN108345660A CN201810096097.9A CN201810096097A CN108345660A CN 108345660 A CN108345660 A CN 108345660A CN 201810096097 A CN201810096097 A CN 201810096097A CN 108345660 A CN108345660 A CN 108345660A
- Authority
- CN
- China
- Prior art keywords
- data
- government
- analysis
- field
- analysing method
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 26
- 238000004458 analytical method Methods 0.000 claims abstract description 18
- 238000012098 association analyses Methods 0.000 claims abstract description 10
- 238000007405 data analysis Methods 0.000 claims abstract description 5
- 230000000007 visual effect Effects 0.000 claims description 7
- 238000012800 visualization Methods 0.000 claims description 4
- 238000010835 comparative analysis Methods 0.000 claims description 3
- 230000002123 temporal effect Effects 0.000 claims description 3
- 238000012549 training Methods 0.000 claims description 3
- 230000000295 complement effect Effects 0.000 abstract description 3
- 230000010365 information processing Effects 0.000 abstract description 2
- 238000005516 engineering process Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2462—Approximate or statistical queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2465—Query processing support for facilitating data mining operations in structured databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Business, Economics & Management (AREA)
- Computational Linguistics (AREA)
- Probability & Statistics with Applications (AREA)
- Tourism & Hospitality (AREA)
- Fuzzy Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- General Health & Medical Sciences (AREA)
- General Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Primary Health Care (AREA)
- Marketing (AREA)
- Human Resources & Organizations (AREA)
- Economics (AREA)
- Health & Medical Sciences (AREA)
- Educational Administration (AREA)
- Development Economics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a kind of data analysing methods based on government data, belong to big data technical field of information processing.The data analysing method based on government data of the present invention, government data catalogue is overlapped from different dimensions, multiple angles, is compared, association analysis, it carries out data analysis and excavates, association between heuristic data, it was found that the new rule between data and value, analysis report is formed, foundation is provided for government decision.The data analysing method based on government data of the invention can be according to the association between open and shared government data heuristic data, it was found that the value between data, analysis report is formed, to provide complementary opinion for government decision, there is good application value.
Description
Technical field
The present invention relates to big data technical field of information processing, specifically provide a kind of data analysis side based on government data
Method.
Background technology
With society and economic rapid development, the demand of each field of society and data is increasing, and the big data epoch are quiet
So lead.With the arriving in big data epoch, government department increasingly focuses on application technology means and carries out depth to data resource
Value excavate, to meet growing fining, scientific management needs.Government develops and shared data volume increasingly increases
Add, the demand of the analysis mining of data value also increasingly increases, and the demand to the value of heuristic data steps up.But it is existing
Have in technology, still lacks the method effectively analyzed government data.
Invention content
The technical assignment of the present invention be in view of the above problems, provide it is a kind of can be according to open and shared government's number
According to the association between heuristic data, the value between data is found, form analysis report, it is complementary to be provided for government decision
The data analysing method based on government data of opinion.
To achieve the above object, the present invention provides following technical solutions:
A kind of data analysing method based on government data, the data analysing method is from different dimensions, multiple angles to government
Data directory is overlapped, compares, association analysis, carries out data analysis and excavates, and the association between heuristic data finds data
Between new rule with value, formed analysis report, provide foundation for government decision.
Preferably, the government data catalogue is the open or shared data directory of government.
Preferably, two or more data directories are screened from open or shared government data catalogue, it will
Data are divided into several data sets, are analyzed from incidence relation of the different dimensions between data directory.
Preferably, the foundation of garbled data catalogue is data classification, trade classification, department's classification and subject classification.
Preferably, the different dimensions include label collection of illustrative plates, structure alignment, field collection of illustrative plates, trend prediction, content pair
It is clustered than, distribution superposition, feature clustering, association analysis and position,
(1)Label collection of illustrative plates:Label is defined to data directory in advance, for stating its content for including mainly, can be looked for rapidly
To associated data set;
(2)Structure alignment:By between the essential information of data directory, data format, temporal frequency, method of calling comparison structure
Difference;
(3)Field collection of illustrative plates:By the analysis of same field, with visualization icon shows;
(4)Trend prediction:The multi-field of data set counts, continuous data trend analysis, visual means superposition displaying;
(5)Content compares:Major key setting, the comparative analysis of different field between data set and data set;
(6)Distribution superposition:The multi-field of data set counts, discrete distributional analysis, visual means superposition displaying;
(7)Feature clustering:Clustering is carried out to data single dimension or specific data vector;
(8)Association analysis:The calibration of discrete data and model training, it is automatic to classify;
(9)Position clusters:Map superposition, area dividing statistics are carried out to multiple types geographical position coordinates field.
Preferably, it includes closing string figure, power guiding figure, network of personal connections figure, dendrogram to visualize icon in field collection of illustrative plates.
Compared with prior art, the data analysing method of the invention based on government data has beneficial effect following prominent
Fruit:The data analysing method based on government data screens two or two from open or shared government data catalogue
Above data directory is associated analysis, the association between heuristic data, hair from different dimensions, multi-angle to data directory
Value between existing data, forms analysis report, to provide complementary opinion for government decision, has good popularization and application
Value.
Description of the drawings
Fig. 1 is the flow chart of the data analysing method of the present invention based on government data.
Specific implementation mode
Below in conjunction with embodiment, the data analysing method based on government data of the present invention is made further specifically
It is bright.
Embodiment
As shown in Figure 1, the data analysing method based on government data of the present invention, from open or shared government data mesh
Two or more data directories are screened in record, split data into several data sets, it is split from different dimensions, multiple angles
Put or the government data catalogue shared be overlapped, compare, association analysis, carry out data analysis and excavate, between heuristic data
Association, find data between new rule with value, formed analysis report, provide foundation for government decision.
The foundation of garbled data catalogue is data classification, trade classification, department's classification and subject classification.
Different dimensions include label collection of illustrative plates, structure alignment, field collection of illustrative plates, trend prediction, content comparison, distribution superposition, spy
Sign cluster, association analysis and position cluster, wherein:
(1)Label collection of illustrative plates:Label is defined to data directory in advance, for stating its content for including mainly, can be looked for rapidly
To associated data set;
(2)Structure alignment:By between the essential information of data directory, data format, temporal frequency, method of calling comparison structure
Difference;
(3)Field collection of illustrative plates:By the analysis of same field, with visualization icon shows, visualization icon is led including conjunction string figure, power
Xiang Tu, network of personal connections figure, dendrogram.
(4)Trend prediction:The multi-field of data set counts, continuous data trend analysis, visual means superposition displaying;
(5)Content compares:Major key setting, the comparative analysis of different field between data set and data set;
(6)Distribution superposition:The multi-field of data set counts, discrete distributional analysis, visual means superposition displaying;
(7)Feature clustering:Clustering is carried out to data single dimension or specific data vector;
(8)Association analysis:The calibration of discrete data and model training, it is automatic to classify;
(9)Position clusters:Map superposition, area dividing statistics are carried out to multiple types geographical position coordinates field.
Embodiment described above, the only present invention more preferably specific implementation mode, those skilled in the art is at this
The usual variations and alternatives carried out within the scope of inventive technique scheme should be all included within the scope of the present invention.
Claims (6)
1. a kind of data analysing method based on government data, it is characterised in that:The data analysing method is from different dimensions, more
Kind of angle is overlapped government data catalogue, compares, association analysis, carries out data analysis and excavates, between heuristic data
Association finds the new rule between data and value, forms analysis report, foundation is provided for government decision.
2. the data analysing method according to claim 1 based on government data, it is characterised in that:The government data mesh
Record is the open or shared data directory of government.
3. the data analysing method according to claim 2 based on government data, it is characterised in that:From open or shared
Two or more data directories are screened in government data catalogue, split data into several data sets, from different dimensions pair
Incidence relation between data directory is analyzed.
4. the data analysing method according to claim 3 based on government data, it is characterised in that:Garbled data catalogue
According to for data classification, trade classification, department classifies and subject classification.
5. the data analysing method according to claim 4 based on government data, it is characterised in that:The different dimensions packet
Include label collection of illustrative plates, structure alignment, field collection of illustrative plates, trend prediction, content comparison, distribution superposition, feature clustering, association analysis and position
Cluster is set,
(1)Label collection of illustrative plates:Label is defined to data directory in advance, for stating its content for including mainly, can be looked for rapidly
To associated data set;
(2)Structure alignment:By between the essential information of data directory, data format, temporal frequency, method of calling comparison structure
Difference;
(3)Field collection of illustrative plates:By the analysis of same field, with visualization icon shows;
(4)Trend prediction:The multi-field of data set counts, continuous data trend analysis, visual means superposition displaying;
(5)Content compares:Major key setting, the comparative analysis of different field between data set and data set;
(6)Distribution superposition:The multi-field of data set counts, discrete distributional analysis, visual means superposition displaying;
(7)Feature clustering:Clustering is carried out to data single dimension or specific data vector;
(8)Association analysis:The calibration of discrete data and model training, it is automatic to classify;
(9)Position clusters:Map superposition, area dividing statistics are carried out to multiple types geographical position coordinates field.
6. the data analysing method according to claim 5 based on government data, it is characterised in that:It is visual in field collection of illustrative plates
It includes closing string figure, power guiding figure, network of personal connections figure, dendrogram to change icon.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810096097.9A CN108345660A (en) | 2018-01-31 | 2018-01-31 | A kind of data analysing method based on government data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810096097.9A CN108345660A (en) | 2018-01-31 | 2018-01-31 | A kind of data analysing method based on government data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108345660A true CN108345660A (en) | 2018-07-31 |
Family
ID=62961008
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810096097.9A Pending CN108345660A (en) | 2018-01-31 | 2018-01-31 | A kind of data analysing method based on government data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108345660A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110096529A (en) * | 2019-04-16 | 2019-08-06 | 中科金联(北京)科技有限公司 | Network data mining method and system based on multidimensional vector data |
CN111753926A (en) * | 2020-07-07 | 2020-10-09 | 广州驰兴通用技术研究有限公司 | Data sharing method and system for smart city |
CN111754040A (en) * | 2020-06-23 | 2020-10-09 | 邢冠南 | Information processing and pushing method based on user requirements |
CN112699549A (en) * | 2020-12-28 | 2021-04-23 | 南京工程学院 | CDFS structure-containing aeroengine nonlinear model modeling system and modeling method |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104636864A (en) * | 2015-01-28 | 2015-05-20 | 贵州省邮电规划设计院有限公司 | Government affair information resource management system based on cloud computation |
CN105740339A (en) * | 2016-01-25 | 2016-07-06 | 河北中科恒运软件科技股份有限公司 | Civil administration big data fusion and management system |
CN106855962A (en) * | 2015-12-09 | 2017-06-16 | 星际空间(天津)科技发展有限公司 | A kind of method for building government affairs big data platform |
CN107247788A (en) * | 2017-06-15 | 2017-10-13 | 山东浪潮云服务信息科技有限公司 | A kind of method of the comprehensive regulation service based on government data |
-
2018
- 2018-01-31 CN CN201810096097.9A patent/CN108345660A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104636864A (en) * | 2015-01-28 | 2015-05-20 | 贵州省邮电规划设计院有限公司 | Government affair information resource management system based on cloud computation |
CN106855962A (en) * | 2015-12-09 | 2017-06-16 | 星际空间(天津)科技发展有限公司 | A kind of method for building government affairs big data platform |
CN105740339A (en) * | 2016-01-25 | 2016-07-06 | 河北中科恒运软件科技股份有限公司 | Civil administration big data fusion and management system |
CN107247788A (en) * | 2017-06-15 | 2017-10-13 | 山东浪潮云服务信息科技有限公司 | A kind of method of the comprehensive regulation service based on government data |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110096529A (en) * | 2019-04-16 | 2019-08-06 | 中科金联(北京)科技有限公司 | Network data mining method and system based on multidimensional vector data |
CN111754040A (en) * | 2020-06-23 | 2020-10-09 | 邢冠南 | Information processing and pushing method based on user requirements |
CN111753926A (en) * | 2020-07-07 | 2020-10-09 | 广州驰兴通用技术研究有限公司 | Data sharing method and system for smart city |
CN112699549A (en) * | 2020-12-28 | 2021-04-23 | 南京工程学院 | CDFS structure-containing aeroengine nonlinear model modeling system and modeling method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108345660A (en) | A kind of data analysing method based on government data | |
CN104767692B (en) | A kind of net flow assorted method | |
CN105391815B (en) | internet IP address resource acquisition and centralized management method | |
CA2534448A1 (en) | Auto-ip traffic optimization in mobile telecommunications systems | |
CN106332052B (en) | Micro-area public security early warning method based on mobile communication terminal | |
CN104156729B (en) | A kind of classroom demographic method | |
CN111242096B (en) | People number gradient-based people group distinguishing method | |
CN113688490A (en) | Network co-construction sharing processing method, device, equipment and storage medium | |
CN110377605A (en) | A kind of Sensitive Attributes identification of structural data and classification stage division | |
CN112019500B (en) | Encrypted traffic identification method based on deep learning and electronic device | |
CN103634829B (en) | A kind of section screening technique based on drive test information and equipment | |
CN110572441A (en) | Ultra-large-scale DPI data processing system and method based on edge calculation | |
CN113037567A (en) | Network attack behavior simulation system and method for power grid enterprise | |
CN116527362A (en) | Data protection method based on LayerCFL intrusion detection | |
CN101075303A (en) | Data unearch model for pedicting service potential customers | |
CN108509426B (en) | A kind of depth various dimensions flow semantic analysis | |
CN112583820B (en) | Power attack testing system based on attack topology | |
CN111510438B (en) | Management and control method for data classification of power internet of things terminal | |
CN108241874A (en) | Video text area positioning method based on BP neural network and spectrum analysis | |
CN111737318A (en) | Screening method for phishing susceptible population | |
CN104202407A (en) | Video file synchronization method and video file synchronization device | |
Zhao et al. | Realization of intrusion detection system based on the improved data mining technology | |
CN111614786A (en) | System and method for processing data at high speed by remote server based on block chain | |
CN113919415A (en) | Abnormal group detection method based on unsupervised algorithm | |
CN113919493A (en) | Fraud behavior analysis system based on neural network model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180731 |