CN109977125A - A kind of big data safety analysis plateform system based on network security - Google Patents

A kind of big data safety analysis plateform system based on network security Download PDF

Info

Publication number
CN109977125A
CN109977125A CN201910280131.2A CN201910280131A CN109977125A CN 109977125 A CN109977125 A CN 109977125A CN 201910280131 A CN201910280131 A CN 201910280131A CN 109977125 A CN109977125 A CN 109977125A
Authority
CN
China
Prior art keywords
data
module
analysis
network security
mining
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910280131.2A
Other languages
Chinese (zh)
Inventor
张晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujian Singularity Space-Time Digital Technology Co Ltd
Original Assignee
Fujian Singularity Space-Time Digital Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujian Singularity Space-Time Digital Technology Co Ltd filed Critical Fujian Singularity Space-Time Digital Technology Co Ltd
Priority to CN201910280131.2A priority Critical patent/CN109977125A/en
Publication of CN109977125A publication Critical patent/CN109977125A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A kind of big data safety analysis plateform system based on network security, including data acquisition module, data memory module, data mining analysis module and data visualize module;The data acquisition module is connect with the data memory module;The data memory module is connect with the data mining analysis module;The data mining analysis module is connect with the data visualization display module;The data acquisition module is configured as acquisition structural data, unstructured data and semi-structured data;The data memory module is configured as storing the data of the data collecting module collected;The data mining analysis module is configured as and mining analysis for statistical analysis to data;The data visualization display module is configured as to user feedback data.Present invention reduces data carrying costs, improve data storage capacity, improve Network Safety Analysis treatment effeciency.

Description

A kind of big data safety analysis plateform system based on network security
Technical field
The present invention relates to technical field of network security more particularly to a kind of big data safety analysis based on network security are flat Platform system.
Background technique
Network security refers to that the data in the hardware, software and its system of network system are protected, not because accidental or The reason of person's malice and by destruction, change or leakage, system can be continuous, reliable and be normally run, in network service not It is disconnected.Network analysis system, which is one, allows network manager, can be in various network security problems, the network management suited the remedy to the case Scheme, it detects the data of transmission all in network, is analyzed, is diagnosed, and helps user to exclude network contingency, evades safety Risk improves network performance, increases network availability value.
Big data (big data), refer to can not be captured within the scope of certain time with conventional software tool, manage and The data acquisition system of processing is to need new tupe that could have stronger decision edge, see clearly discovery power and process optimization ability Magnanimity, high growth rate and diversified information assets.Big data has the characteristics that 5V, i.e. Volume (a large amount of), Velocity are (high Speed), Variety (multiplicity), Value (low value density) and Veracity (authenticity).Big data processing mainly includes following Several stages: acquisition, importing/pretreatment, statistics/analysis and excavation.
Rapid development of information technology, the network architecture is increasingly sophisticated, and network security data amount of analysis speedup and increment are huge, number According to abundance, data category multiplicity, for promptly and accurately detect security risk and attack, information need to be improved and adopted The rate of collection and transmission;Have the shortcomings that carrying cost height and amount of storage are small using traditional structured database storing data, And the inefficiency of analysis and complex query is carried out to mass data, it is difficult to meet Network Safety Analysis demand at this stage.
Summary of the invention
(1) goal of the invention
To solve technical problem present in background technique, the present invention proposes a kind of big data safety based on network security Analysis platform system reduces data carrying cost, improves data storage capacity, improves Network Safety Analysis treatment effeciency.
(2) technical solution
To solve the above problems, the present invention provides a kind of big data safety analysis plateform system based on network security, Module is visualized including data acquisition module, data memory module, data mining analysis module and data;
The data acquisition module is connect with the data memory module;The data memory module and the data mining Analysis module connection;The data mining analysis module is connect with the data visualization display module;
The data acquisition module is configured as acquisition structural data, unstructured data and semi-structured data;
The data memory module is configured as storing the data of the data collecting module collected;
The data mining analysis module is configured as and mining analysis for statistical analysis to data;
The data visualization display module is configured as to user feedback data.
Preferably, the offline statistical analysis of data is realized using Hive;The real-time online point of data is realized using Storm Analysis.
Preferably, it for acquisition, polymerization and the transmission of distributed and High Availabitity massive logs, is realized using Flume;It is right In the processing for enlivening stream data, cached using Kafka;Storage analysis for real time data, is realized using Storm and is divided Cloth real-time message processing.
Preferably, it is stored using the HDFS distributed file system with high fault tolerance and high-throughput data access characteristics The mass data of the data collecting module collected;The data storage management of efficient big data quantity is carried out using Hbase.
Preferably, using MapReduce carries out data division, calculating task scheduling and distributed computing;Using Hive into The statistical analysis of row data;It is carried out using Mahout based on HadoopMachine learning and data mining.
Above-mentioned technical proposal of the invention has a following beneficial technical effect: data collecting module collected daily record data and Flow information data, and the data of acquisition are transmitted to data memory module;Data memory module stores different type and purposes Daily record data and flow information data, including for query and search log data and flow information data, for counting According to the daily record data and flow information data after the standardization of mining analysis and the log number analyzed in real time for data According to flow information data, the data of storage are transmitted to data mining analysis module by data memory module;Data mining analysis Module carries out parallel batching, statistical analysis and mining analysis, data mining analysis module to data and passes data analysis result Transport to data visualization display module;Data analysis result is fed back to user by data visualization display module.To sum up, of the invention Data carrying cost is reduced, data storage capacity is improved, improves Network Safety Analysis treatment effeciency.
Detailed description of the invention
Fig. 1 is the structural schematic diagram of the big data safety analysis plateform system proposed by the present invention based on network security.
Specific embodiment
In order to make the objectives, technical solutions and advantages of the present invention clearer, With reference to embodiment and join According to attached drawing, the present invention is described in more detail.It should be understood that these descriptions are merely illustrative, and it is not intended to limit this hair Bright range.In addition, in the following description, descriptions of well-known structures and technologies are omitted, to avoid this is unnecessarily obscured The concept of invention.
As shown in Figure 1, a kind of big data safety analysis plateform system based on network security proposed by the present invention, including number Module is visualized according to acquisition module, data memory module, data mining analysis module and data;
The data acquisition module is connect with the data memory module;The data memory module and the data mining Analysis module connection;The data mining analysis module is connect with the data visualization display module;
The data acquisition module is configured as acquisition structural data, unstructured data and semi-structured data;
The data memory module is configured as storing the data of the data collecting module collected;
The data mining analysis module is configured as and mining analysis for statistical analysis to data;
The data visualization display module is configured as to user feedback data.
In the present invention, data collecting module collected daily record data and flow information data, and the data of acquisition are transmitted to Data memory module;Data memory module stores the daily record data and flow information data of different type and purposes, including is used for The log data and flow information data of query and search, for the daily record data after the standardization of data mining analysis The daily record data and flow information data analyzed in real time with flow information data and for data, data memory module will store Data be transmitted to data mining analysis module;Data mining analysis module to data carry out parallel batching, statistical analysis and Data analysis result is transmitted to data visualization display module by mining analysis, data mining analysis module;Data visualization exhibition Show that data analysis result is fed back to user by module.To sum up, present invention reduces data carrying costs, improve data storage Amount, improves Network Safety Analysis treatment effeciency.
In an alternative embodiment, the offline statistical analysis of data is realized using Hive;Data are realized using Storm On-line analysis.
It should be noted that platform is broadly divided into two classes to the analysis of daily record data and flow information data: offline statistics Analysis and on-line analysis, offline statistical analysis realize that on-line analysis is realized by Storm by Hive.
In an alternative embodiment, it for acquisition, polymerization and the transmission of distributed and High Availabitity massive logs, adopts It is realized with Flume;For enlivening the processing of stream data, cached using Kafka;Storage analysis for real time data, Distributed real-time message processing is realized using Storm.
It should be noted that Flume supports to customize Various types of data sender in log system, for collecting data;Together When, Flume, which is provided, carries out simple process to data, and writes the ability of various data receivings;Flume can will be applied and be generated Data store such as the HDFS and HBase into any pooled storage;When the speed for collecting data is more than that data will be written When, the information of collection is very big, be even more than the write-in data capability of system, at this point, Flume can in data producer and It is adjusted between data receptacle, guarantees that it can provide stable data therebetween;Flume can provide context routing Feature;The pipeline of Flume is based on affairs, ensure that consistency of the data in transmission and reception;Flume is reliable, fault-tolerant Property is high, scalable, manageable and customized.
Kafka is that a kind of distributed post of high-throughput subscribes to message system, its purpose be by Hadoop and Row load mechanism provides real-time message by cluster come the Message Processing unified on offline and line.
Storm is free and open source a distributed real time computation system, can accomplish reliably to handle using Storm Unlimited data flow, as Hadoop batch processing big data, Storm can be can be used with real-time processing data, Storm to be appointed What programming language, Storm have a characteristic that simple programming, high-performance, distribution, expansible, fault-tolerant and message save.
In an alternative embodiment, it is distributed using with the HDFS of high fault tolerance and high-throughput data access characteristics Formula file system stores the mass data of the data collecting module collected;The data of efficient big data quantity are carried out using Hbase Storage management.
It should be noted that HDFS has the characteristics that high fault tolerance, it is designed to be deployed on cheap hardware, it passes through height The characteristic of handling capacity carrys out the data of access application, is suitble to the application program of super large data set;HDFS passes through with data text The NameSpace of the back end management file system of part store function.
HBase is a high reliability, high-performance, towards column, telescopic distributed memory system, HBase relationship number According to library tool there are two types of feature, first feature is that it is the database for being suitable for unstructured data storage, second Feature be it is per-column rather than based on capable mode;All data files in HBase are stored in Hadoop HDFS It mainly include two types: Hfile and HLog File in file system.
In an alternative embodiment, data division, calculating task scheduling and distributed meter are carried out using MapReduce It calculates;The statistical analysis of data is carried out using Hive;Machine learning and data mining based on Hadoop are carried out using Mahout.
It should be noted that MapReduce is computation model, frame and platform towards big data parallel processing, it has Three layers of meaning: first layer meaning is that MapReduce is the high performance parallel computation platform based on cluster, it is realizing one It, can be only with common commercial server realization when a distribution and parallel computing trunking comprising tens of, hundreds of to many thousands of nodes; Second layer meaning is that MapReduce is a parallel computation and runs software frame, it is by providing a huge but design Superior parallel computation software frame is appointed to be automatically performed parallelization processing, automatic division calculating data and the calculating of calculating task Business distributes and executes automatically task on clustered node and collect calculated result, by distributed data storage, data communication, appearance The ins and outs for many system bottoms that the parallel computations such as fault reason are related to transfers to system to be responsible for processing;Third layer meaning is MapReduce is a Parallel programming model and method, with two function programming realities of Map (mapping) and Reduce (reduction) Now basic parallel computation task provides abstract operation and multiple programming interface, simply and easily to complete extensive number According to programming and calculation processing.
Hive is built upon the data warehouse base frame on Hadoop, have a characteristic that support creation index with Semantic inspection is executed when optimizing data query, storage files in different types, storing metadata in relational database to reduce inquiry Time for looking into, can directly using the data being stored in Hadoop system, support user to extend looking into for UDF function and class SQL Inquiry mode.
Mahout includes a variety of realizations, including cluster, classification, recommendation filtering and frequent subitem excavate, by using The library Apache Hadoop, Mahout can be effectively extended in cloud, and in terms of creating intelligent application, Mahout can be effective Ground help developer realizes the creation of program more conveniently.
It should be understood that above-mentioned specific embodiment of the invention is used only for exemplary illustration or explains of the invention Principle, but not to limit the present invention.Therefore, that is done without departing from the spirit and scope of the present invention is any Modification, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.In addition, appended claims purport of the present invention Covering the whole variations fallen into attached claim scope and boundary or this range and the equivalent form on boundary and is repairing Change example.

Claims (5)

1. a kind of big data safety analysis plateform system based on network security, which is characterized in that including data acquisition module, number Module is visualized according to memory module, data mining analysis module and data;
The data acquisition module is connect with the data memory module;The data memory module and the data mining analysis Module connection;The data mining analysis module is connect with the data visualization display module;
The data acquisition module is configured as acquisition structural data, unstructured data and semi-structured data;
The data memory module is configured as storing the data of the data collecting module collected;
The data mining analysis module is configured as and mining analysis for statistical analysis to data;
The data visualization display module is configured as to user feedback data.
2. the big data safety analysis plateform system according to claim 1 based on network security, which is characterized in that use The offline statistical analysis of Hive realization data;The on-line analysis of data is realized using Storm.
3. the big data safety analysis plateform system according to claim 2 based on network security, which is characterized in that for Acquisition, polymerization and the transmission of distributed and High Availabitity massive logs, are realized using Flume;For enlivening the place of stream data Reason, is cached using Kafka;Storage analysis for real time data realizes distributed real-time message processing using Storm.
4. the big data safety analysis plateform system according to claim 1 based on network security, which is characterized in that use The data acquisition module is stored with the HDFS distributed file system of high fault tolerance and high-throughput data access characteristics to adopt The mass data of collection;The data storage management of efficient big data quantity is carried out using Hbase.
5. the big data safety analysis plateform system according to claim 1 based on network security, which is characterized in that use MapReduce carries out data division, calculating task scheduling and distributed computing;The statistical analysis of data is carried out using Hive;It adopts Machine learning and data mining based on Hadoop are carried out with Mahout.
CN201910280131.2A 2019-04-09 2019-04-09 A kind of big data safety analysis plateform system based on network security Pending CN109977125A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910280131.2A CN109977125A (en) 2019-04-09 2019-04-09 A kind of big data safety analysis plateform system based on network security

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910280131.2A CN109977125A (en) 2019-04-09 2019-04-09 A kind of big data safety analysis plateform system based on network security

Publications (1)

Publication Number Publication Date
CN109977125A true CN109977125A (en) 2019-07-05

Family

ID=67083580

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910280131.2A Pending CN109977125A (en) 2019-04-09 2019-04-09 A kind of big data safety analysis plateform system based on network security

Country Status (1)

Country Link
CN (1) CN109977125A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111090646A (en) * 2019-10-21 2020-05-01 中国科学院信息工程研究所 Electromagnetic data processing platform
CN111753008A (en) * 2020-06-30 2020-10-09 珠海迈越信息技术有限公司 Set top box viewing method and system based on big data analysis
CN112163060A (en) * 2020-09-16 2021-01-01 安徽龙运智能科技有限公司 System for processing mass GPS data by big data technology
CN113742413A (en) * 2021-09-10 2021-12-03 湖南强智科技发展有限公司 High-accuracy big data analysis system based on multi-form processing
CN114268503A (en) * 2021-12-24 2022-04-01 湘潭大学 Network space data security situation sensing and early warning method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104820670A (en) * 2015-03-13 2015-08-05 国家电网公司 Method for acquiring and storing big data of power information
CN105553957A (en) * 2015-12-09 2016-05-04 国家电网公司 Network safety situation awareness early-warning method and system based big data
CN107145587A (en) * 2017-05-11 2017-09-08 成都四方伟业软件股份有限公司 A kind of anti-fake system of medical insurance excavated based on big data
CN107181612A (en) * 2017-05-08 2017-09-19 深圳市众泰兄弟科技发展有限公司 A kind of visual network method for safety monitoring based on big data
CN107196910A (en) * 2017-04-18 2017-09-22 国网山东省电力公司电力科学研究院 Threat early warning monitoring system, method and the deployment framework analyzed based on big data
CN107515927A (en) * 2017-08-24 2017-12-26 深圳市云房网络科技有限公司 A kind of real estate user behavioural analysis platform
CN109150870A (en) * 2018-08-14 2019-01-04 中国电子科技集团公司电子科学研究院 Network safety situation analysis and application method and system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104820670A (en) * 2015-03-13 2015-08-05 国家电网公司 Method for acquiring and storing big data of power information
CN105553957A (en) * 2015-12-09 2016-05-04 国家电网公司 Network safety situation awareness early-warning method and system based big data
CN107196910A (en) * 2017-04-18 2017-09-22 国网山东省电力公司电力科学研究院 Threat early warning monitoring system, method and the deployment framework analyzed based on big data
CN107181612A (en) * 2017-05-08 2017-09-19 深圳市众泰兄弟科技发展有限公司 A kind of visual network method for safety monitoring based on big data
CN107145587A (en) * 2017-05-11 2017-09-08 成都四方伟业软件股份有限公司 A kind of anti-fake system of medical insurance excavated based on big data
CN107515927A (en) * 2017-08-24 2017-12-26 深圳市云房网络科技有限公司 A kind of real estate user behavioural analysis platform
CN109150870A (en) * 2018-08-14 2019-01-04 中国电子科技集团公司电子科学研究院 Network safety situation analysis and application method and system

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111090646A (en) * 2019-10-21 2020-05-01 中国科学院信息工程研究所 Electromagnetic data processing platform
CN111090646B (en) * 2019-10-21 2023-07-28 中国科学院信息工程研究所 Electromagnetic data processing platform
CN111753008A (en) * 2020-06-30 2020-10-09 珠海迈越信息技术有限公司 Set top box viewing method and system based on big data analysis
CN112163060A (en) * 2020-09-16 2021-01-01 安徽龙运智能科技有限公司 System for processing mass GPS data by big data technology
CN113742413A (en) * 2021-09-10 2021-12-03 湖南强智科技发展有限公司 High-accuracy big data analysis system based on multi-form processing
CN114268503A (en) * 2021-12-24 2022-04-01 湘潭大学 Network space data security situation sensing and early warning method and system

Similar Documents

Publication Publication Date Title
CN109977125A (en) A kind of big data safety analysis plateform system based on network security
CN104820670B (en) A kind of acquisition of power information big data and storage method
Gupta et al. Cloud computing and big data analytics: what is new from databases perspective?
CN111435344B (en) Big data-based drilling acceleration influence factor analysis model
JP2022078130A (en) System and method for mata data-driven external interface generation of application programming interface
CN109272155A (en) A kind of corporate behavior analysis system based on big data
CN103631882B (en) Semantization service generation system and method based on graph mining technique
CN106339509A (en) Power grid operation data sharing system based on large data technology
CN107943668A (en) Computer server cluster daily record monitoring method and monitor supervision platform
CN106815338A (en) A kind of real-time storage of big data, treatment and inquiry system
CN107766402A (en) A kind of building dictionary cloud source of houses big data platform
CN110347719A (en) A kind of enterprise's foreign trade method for prewarning risk and system based on big data
CN105139281A (en) Method and system for processing big data of electric power marketing
CN112116488A (en) Water conservancy big data comprehensive maintenance system
CN106571960A (en) Log collection and management system and log collection and management method
Fu et al. Real-time data infrastructure at uber
CN109754219A (en) A kind of big data management and analysis platform system based on city management
CN102902813A (en) Log collection system
CN112148718A (en) Big data support management system for city-level data middling station
Caldarola et al. Big data: A survey-the new paradigms, methodologies and tools
Roth et al. Event data warehousing for complex event processing
CN107103064A (en) Data statistical approach and device
CN112181960A (en) Intelligent operation and maintenance framework system based on AIOps
CN111459900B (en) Big data life cycle setting method, device, storage medium and server
Mesiti et al. Towards a user-friendly loading system for the analysis of big data in the internet of things

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190705