CN108345660A - A kind of data analysing method based on government data - Google Patents

A kind of data analysing method based on government data Download PDF

Info

Publication number
CN108345660A
CN108345660A CN201810096097.9A CN201810096097A CN108345660A CN 108345660 A CN108345660 A CN 108345660A CN 201810096097 A CN201810096097 A CN 201810096097A CN 108345660 A CN108345660 A CN 108345660A
Authority
CN
China
Prior art keywords
data
government
analysis
field
analysing method
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810096097.9A
Other languages
Chinese (zh)
Inventor
张峰
张兆勇
李娜
顾晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Hui Trade Electronic Port Co Ltd
Shandong Huimao Electronic Port Co Ltd
Original Assignee
Shandong Hui Trade Electronic Port Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Hui Trade Electronic Port Co Ltd filed Critical Shandong Hui Trade Electronic Port Co Ltd
Priority to CN201810096097.9A priority Critical patent/CN108345660A/en
Publication of CN108345660A publication Critical patent/CN108345660A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • Computational Linguistics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Tourism & Hospitality (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Primary Health Care (AREA)
  • Marketing (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Health & Medical Sciences (AREA)
  • Educational Administration (AREA)
  • Development Economics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a kind of data analysing methods based on government data, belong to big data technical field of information processing.The data analysing method based on government data of the present invention, government data catalogue is overlapped from different dimensions, multiple angles, is compared, association analysis, it carries out data analysis and excavates, association between heuristic data, it was found that the new rule between data and value, analysis report is formed, foundation is provided for government decision.The data analysing method based on government data of the invention can be according to the association between open and shared government data heuristic data, it was found that the value between data, analysis report is formed, to provide complementary opinion for government decision, there is good application value.

Description

A kind of data analysing method based on government data
Technical field
The present invention relates to big data technical field of information processing, specifically provide a kind of data analysis side based on government data Method.
Background technology
With society and economic rapid development, the demand of each field of society and data is increasing, and the big data epoch are quiet So lead.With the arriving in big data epoch, government department increasingly focuses on application technology means and carries out depth to data resource Value excavate, to meet growing fining, scientific management needs.Government develops and shared data volume increasingly increases Add, the demand of the analysis mining of data value also increasingly increases, and the demand to the value of heuristic data steps up.But it is existing Have in technology, still lacks the method effectively analyzed government data.
Invention content
The technical assignment of the present invention be in view of the above problems, provide it is a kind of can be according to open and shared government's number According to the association between heuristic data, the value between data is found, form analysis report, it is complementary to be provided for government decision The data analysing method based on government data of opinion.
To achieve the above object, the present invention provides following technical solutions:
A kind of data analysing method based on government data, the data analysing method is from different dimensions, multiple angles to government Data directory is overlapped, compares, association analysis, carries out data analysis and excavates, and the association between heuristic data finds data Between new rule with value, formed analysis report, provide foundation for government decision.
Preferably, the government data catalogue is the open or shared data directory of government.
Preferably, two or more data directories are screened from open or shared government data catalogue, it will Data are divided into several data sets, are analyzed from incidence relation of the different dimensions between data directory.
Preferably, the foundation of garbled data catalogue is data classification, trade classification, department's classification and subject classification.
Preferably, the different dimensions include label collection of illustrative plates, structure alignment, field collection of illustrative plates, trend prediction, content pair It is clustered than, distribution superposition, feature clustering, association analysis and position,
(1)Label collection of illustrative plates:Label is defined to data directory in advance, for stating its content for including mainly, can be looked for rapidly To associated data set;
(2)Structure alignment:By between the essential information of data directory, data format, temporal frequency, method of calling comparison structure Difference;
(3)Field collection of illustrative plates:By the analysis of same field, with visualization icon shows;
(4)Trend prediction:The multi-field of data set counts, continuous data trend analysis, visual means superposition displaying;
(5)Content compares:Major key setting, the comparative analysis of different field between data set and data set;
(6)Distribution superposition:The multi-field of data set counts, discrete distributional analysis, visual means superposition displaying;
(7)Feature clustering:Clustering is carried out to data single dimension or specific data vector;
(8)Association analysis:The calibration of discrete data and model training, it is automatic to classify;
(9)Position clusters:Map superposition, area dividing statistics are carried out to multiple types geographical position coordinates field.
Preferably, it includes closing string figure, power guiding figure, network of personal connections figure, dendrogram to visualize icon in field collection of illustrative plates.
Compared with prior art, the data analysing method of the invention based on government data has beneficial effect following prominent Fruit:The data analysing method based on government data screens two or two from open or shared government data catalogue Above data directory is associated analysis, the association between heuristic data, hair from different dimensions, multi-angle to data directory Value between existing data, forms analysis report, to provide complementary opinion for government decision, has good popularization and application Value.
Description of the drawings
Fig. 1 is the flow chart of the data analysing method of the present invention based on government data.
Specific implementation mode
Below in conjunction with embodiment, the data analysing method based on government data of the present invention is made further specifically It is bright.
Embodiment
As shown in Figure 1, the data analysing method based on government data of the present invention, from open or shared government data mesh Two or more data directories are screened in record, split data into several data sets, it is split from different dimensions, multiple angles Put or the government data catalogue shared be overlapped, compare, association analysis, carry out data analysis and excavate, between heuristic data Association, find data between new rule with value, formed analysis report, provide foundation for government decision.
The foundation of garbled data catalogue is data classification, trade classification, department's classification and subject classification.
Different dimensions include label collection of illustrative plates, structure alignment, field collection of illustrative plates, trend prediction, content comparison, distribution superposition, spy Sign cluster, association analysis and position cluster, wherein:
(1)Label collection of illustrative plates:Label is defined to data directory in advance, for stating its content for including mainly, can be looked for rapidly To associated data set;
(2)Structure alignment:By between the essential information of data directory, data format, temporal frequency, method of calling comparison structure Difference;
(3)Field collection of illustrative plates:By the analysis of same field, with visualization icon shows, visualization icon is led including conjunction string figure, power Xiang Tu, network of personal connections figure, dendrogram.
(4)Trend prediction:The multi-field of data set counts, continuous data trend analysis, visual means superposition displaying;
(5)Content compares:Major key setting, the comparative analysis of different field between data set and data set;
(6)Distribution superposition:The multi-field of data set counts, discrete distributional analysis, visual means superposition displaying;
(7)Feature clustering:Clustering is carried out to data single dimension or specific data vector;
(8)Association analysis:The calibration of discrete data and model training, it is automatic to classify;
(9)Position clusters:Map superposition, area dividing statistics are carried out to multiple types geographical position coordinates field.
Embodiment described above, the only present invention more preferably specific implementation mode, those skilled in the art is at this The usual variations and alternatives carried out within the scope of inventive technique scheme should be all included within the scope of the present invention.

Claims (6)

1. a kind of data analysing method based on government data, it is characterised in that:The data analysing method is from different dimensions, more Kind of angle is overlapped government data catalogue, compares, association analysis, carries out data analysis and excavates, between heuristic data Association finds the new rule between data and value, forms analysis report, foundation is provided for government decision.
2. the data analysing method according to claim 1 based on government data, it is characterised in that:The government data mesh Record is the open or shared data directory of government.
3. the data analysing method according to claim 2 based on government data, it is characterised in that:From open or shared Two or more data directories are screened in government data catalogue, split data into several data sets, from different dimensions pair Incidence relation between data directory is analyzed.
4. the data analysing method according to claim 3 based on government data, it is characterised in that:Garbled data catalogue According to for data classification, trade classification, department classifies and subject classification.
5. the data analysing method according to claim 4 based on government data, it is characterised in that:The different dimensions packet Include label collection of illustrative plates, structure alignment, field collection of illustrative plates, trend prediction, content comparison, distribution superposition, feature clustering, association analysis and position Cluster is set,
(1)Label collection of illustrative plates:Label is defined to data directory in advance, for stating its content for including mainly, can be looked for rapidly To associated data set;
(2)Structure alignment:By between the essential information of data directory, data format, temporal frequency, method of calling comparison structure Difference;
(3)Field collection of illustrative plates:By the analysis of same field, with visualization icon shows;
(4)Trend prediction:The multi-field of data set counts, continuous data trend analysis, visual means superposition displaying;
(5)Content compares:Major key setting, the comparative analysis of different field between data set and data set;
(6)Distribution superposition:The multi-field of data set counts, discrete distributional analysis, visual means superposition displaying;
(7)Feature clustering:Clustering is carried out to data single dimension or specific data vector;
(8)Association analysis:The calibration of discrete data and model training, it is automatic to classify;
(9)Position clusters:Map superposition, area dividing statistics are carried out to multiple types geographical position coordinates field.
6. the data analysing method according to claim 5 based on government data, it is characterised in that:It is visual in field collection of illustrative plates It includes closing string figure, power guiding figure, network of personal connections figure, dendrogram to change icon.
CN201810096097.9A 2018-01-31 2018-01-31 A kind of data analysing method based on government data Pending CN108345660A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810096097.9A CN108345660A (en) 2018-01-31 2018-01-31 A kind of data analysing method based on government data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810096097.9A CN108345660A (en) 2018-01-31 2018-01-31 A kind of data analysing method based on government data

Publications (1)

Publication Number Publication Date
CN108345660A true CN108345660A (en) 2018-07-31

Family

ID=62961008

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810096097.9A Pending CN108345660A (en) 2018-01-31 2018-01-31 A kind of data analysing method based on government data

Country Status (1)

Country Link
CN (1) CN108345660A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110096529A (en) * 2019-04-16 2019-08-06 中科金联(北京)科技有限公司 Network data mining method and system based on multidimensional vector data
CN111753926A (en) * 2020-07-07 2020-10-09 广州驰兴通用技术研究有限公司 Data sharing method and system for smart city
CN111754040A (en) * 2020-06-23 2020-10-09 邢冠南 Information processing and pushing method based on user requirements
CN112699549A (en) * 2020-12-28 2021-04-23 南京工程学院 CDFS structure-containing aeroengine nonlinear model modeling system and modeling method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104636864A (en) * 2015-01-28 2015-05-20 贵州省邮电规划设计院有限公司 Government affair information resource management system based on cloud computation
CN105740339A (en) * 2016-01-25 2016-07-06 河北中科恒运软件科技股份有限公司 Civil administration big data fusion and management system
CN106855962A (en) * 2015-12-09 2017-06-16 星际空间(天津)科技发展有限公司 A kind of method for building government affairs big data platform
CN107247788A (en) * 2017-06-15 2017-10-13 山东浪潮云服务信息科技有限公司 A kind of method of the comprehensive regulation service based on government data

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104636864A (en) * 2015-01-28 2015-05-20 贵州省邮电规划设计院有限公司 Government affair information resource management system based on cloud computation
CN106855962A (en) * 2015-12-09 2017-06-16 星际空间(天津)科技发展有限公司 A kind of method for building government affairs big data platform
CN105740339A (en) * 2016-01-25 2016-07-06 河北中科恒运软件科技股份有限公司 Civil administration big data fusion and management system
CN107247788A (en) * 2017-06-15 2017-10-13 山东浪潮云服务信息科技有限公司 A kind of method of the comprehensive regulation service based on government data

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110096529A (en) * 2019-04-16 2019-08-06 中科金联(北京)科技有限公司 Network data mining method and system based on multidimensional vector data
CN111754040A (en) * 2020-06-23 2020-10-09 邢冠南 Information processing and pushing method based on user requirements
CN111753926A (en) * 2020-07-07 2020-10-09 广州驰兴通用技术研究有限公司 Data sharing method and system for smart city
CN112699549A (en) * 2020-12-28 2021-04-23 南京工程学院 CDFS structure-containing aeroengine nonlinear model modeling system and modeling method

Similar Documents

Publication Publication Date Title
CN108345660A (en) A kind of data analysing method based on government data
CN104767692B (en) A kind of net flow assorted method
CN105391815B (en) internet IP address resource acquisition and centralized management method
CA2534448A1 (en) Auto-ip traffic optimization in mobile telecommunications systems
CN106332052B (en) Micro-area public security early warning method based on mobile communication terminal
CN104156729B (en) A kind of classroom demographic method
CN111242096B (en) People number gradient-based people group distinguishing method
CN113688490A (en) Network co-construction sharing processing method, device, equipment and storage medium
CN110377605A (en) A kind of Sensitive Attributes identification of structural data and classification stage division
CN112019500B (en) Encrypted traffic identification method based on deep learning and electronic device
CN103634829B (en) A kind of section screening technique based on drive test information and equipment
CN110572441A (en) Ultra-large-scale DPI data processing system and method based on edge calculation
CN113037567A (en) Network attack behavior simulation system and method for power grid enterprise
CN116527362A (en) Data protection method based on LayerCFL intrusion detection
CN101075303A (en) Data unearch model for pedicting service potential customers
CN108509426B (en) A kind of depth various dimensions flow semantic analysis
CN112583820B (en) Power attack testing system based on attack topology
CN111510438B (en) Management and control method for data classification of power internet of things terminal
CN108241874A (en) Video text area positioning method based on BP neural network and spectrum analysis
CN111737318A (en) Screening method for phishing susceptible population
CN104202407A (en) Video file synchronization method and video file synchronization device
Zhao et al. Realization of intrusion detection system based on the improved data mining technology
CN111614786A (en) System and method for processing data at high speed by remote server based on block chain
CN113919415A (en) Abnormal group detection method based on unsupervised algorithm
CN113919493A (en) Fraud behavior analysis system based on neural network model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180731