CN109344333A - A kind of internet big data analysis extracting method and system - Google Patents

A kind of internet big data analysis extracting method and system Download PDF

Info

Publication number
CN109344333A
CN109344333A CN201811315842.0A CN201811315842A CN109344333A CN 109344333 A CN109344333 A CN 109344333A CN 201811315842 A CN201811315842 A CN 201811315842A CN 109344333 A CN109344333 A CN 109344333A
Authority
CN
China
Prior art keywords
data
unit
screening
internet big
keyword
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201811315842.0A
Other languages
Chinese (zh)
Inventor
钟文洪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Letian Artificial Intelligence Technology (guangzhou) Co Ltd
Original Assignee
Letian Artificial Intelligence Technology (guangzhou) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Letian Artificial Intelligence Technology (guangzhou) Co Ltd filed Critical Letian Artificial Intelligence Technology (guangzhou) Co Ltd
Priority to CN201811315842.0A priority Critical patent/CN109344333A/en
Publication of CN109344333A publication Critical patent/CN109344333A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/602Providing cryptographic facilities or services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/606Protecting data by securing the transmission between two devices or processes

Abstract

The invention discloses a kind of internet big data analysis extracting method and system, extracting method is analyzed the following steps are included: A, data acquisition unit acquire internet big data;B, the data of acquisition are sent to control unit after data screening module is screened and are handled;C, control unit sends to data characteristics extraction module and instructs, and carries out feature extraction according to pre-set keyword to the data after screening;D, the data after feature extraction store after being encrypted by DEU data encryption unit;E, the internet data after finally extracting is transmitted to background terminal by data transmission unit, the method that the present invention uses is easy to operate, and security performance is high, in addition, by being screened to data, feature extraction is carried out after screening, can be improved the analysis extraction efficiency and accuracy of internet big data, and reduces cost of labor, it improves work efficiency, furthermore it is possible to realize to take in row encryption after store, it is ensured that Information Security.

Description

A kind of internet big data analysis extracting method and system
Technical field
The present invention relates to data abstraction techniques field, specially a kind of internet big data analysis extracting method and system.
Background technique
People just know very well the importance of information since ancient times, and possessing more can obtain advantage with more accurately information, with Society be constantly progressive, information is more and more diversified, and quantity is also growth at double, and the concept of big data also mentions therewith Out, from numerous and jumbled data, required data is extracted and are analyzed, intuitive information is obtained, first chance is occupied with this, obtains benefit Benefit.
Big data, or mass data, mass data;Be by enormous amount, structure is complicated, numerous types data are constituted Data acquisition system is data processing and application model based on cloud computing, passes through the integrated shared of data, the intelligence of intersection multiplexing formation Power resource and knowledge services ability.Research institution is so defined " big data ": " big data " is to need new tupe ability With stronger decision edge, see clearly discovery power and process optimization ability magnanimity, high growth rate and diversified information assets.From It says in a way, big data is the cutting edge technology of data analysis.In brief, it from the data of various type, quickly obtains The ability for obtaining valuable information, is exactly big data technology.Internet big data analysis and extraction are generally by mutual in the prior art Networking data platform is extracted and is analyzed automatically, although can be realized the extraction to internet big data and analysis, Internet data platform feature is single, causes extraction efficiency low, and poor safety performance, be easy to cause leaking data, therefore, have Necessity improves.
Summary of the invention
The purpose of the present invention is to provide a kind of internet big data analysis extracting method and systems, to solve above-mentioned background The problem of being proposed in technology.
To achieve the above object, the invention provides the following technical scheme: a kind of internet big data analysis extracting method, divides Analyse extracting method the following steps are included:
A, data acquisition unit acquires internet big data;
B, the data of acquisition are sent to control unit after data screening module is screened and are handled;
C, control unit to data characteristics extraction module send instruct, to the data after screening according to pre-set keyword into Row feature extraction;
D, the data after feature extraction store after being encrypted by DEU data encryption unit;
E, the internet data after finally extracting is transmitted to background terminal by data transmission unit.
Preferably, data screening modular approach is as follows in the step B:
A, multiple noise datas are extracted from data to be screened as sample data;
B, conversion process is carried out to each sample data, obtains the transformation data of each sample data;
C, by preparatory trained data classification model, it is pre- that label is carried out to each sample data and each transformation data It surveys, determines the target labels and target labels probability of each sample data;
D, according to the target labels of each sample data and target labels probability, each sample data is screened, obtains number of targets According to.
Preferably, data ciphering method is as follows in the step D:
A, cleaning operation is carried out to data to be encrypted first;
B, the operation of AES encryption algorithm is carried out to the data after cleaning later, obtains an encrypted ciphertext data;
C, hyperchaos cryptographic calculation is carried out to a ciphertext data later again, obtains secondary ciphertext data;
D, finally secondary ciphertext data are carried out carrying out des encryption operation, the final encryption of complete paired data.
Preferably, the data characteristics extraction module feature extracting method is as follows:
A, data set is established, multiple Sub Data Sets to feature extraction are wherein included in data set;
B, feature training is carried out to data set, obtains training pattern;
C, the first keyword and the second keyword in data set are extracted;
D, each Sub Data Set in cyclic search data set, using the first keyword and the second keyword as primary condition, subdata Collection scans for;
E, search is matched to the first keyword or the second keyword in each Sub Data Set, then extracts to data.
Preferably, a kind of internet big data analysis extraction system, including control unit, data acquisition unit, data sieve Menu member, data characteristics extraction unit, data encryption storage unit and data outputting unit, the data acquisition unit pass through number Control unit is connected according to screening unit, described control unit is separately connected data characteristics extraction unit, data encryption storage unit And data outputting unit;Wherein, the data acquisition unit is for acquiring internet big data;The data screening unit is used for Internet big data is screened, interference information is removed;The data characteristics extraction unit be used for the data after screening into Row feature extraction;The data encryption storage unit is for carrying out encryption storage to the internet big data after feature extraction.
Compared with prior art, the beneficial effects of the present invention are:
(1) method that the present invention uses is easy to operate, and security performance is high, in addition, being carried out after screening by screening to data Feature extraction, can be improved the analysis extraction efficiency and accuracy of internet big data, and reduce cost of labor, improve Working efficiency, furthermore it is possible to realize to take in row encryption after store, it is ensured that Information Security.
(2) the data screening module that the present invention uses can carry out automatically data screening according to computer program, and it is convenient to operate And it is time-consuming short, human resources can either be saved, and be able to ascend data screening efficiency, further improve the extraction effect of data Rate.
(3) data ciphering method that the present invention uses can transmit data and carry out multi-enciphering, improve the peace of data Full property and confidentiality.
It (4), can by the first keyword of search and the second keyword in the data characteristics extracting method that the present invention uses It reduces and extracts difficulty, improve feature extraction precision.
Detailed description of the invention
Fig. 1 is flow chart of the present invention;
Fig. 2 is encryption method flow chart of the present invention;
Fig. 3 is apparatus of the present invention functional block diagram.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
Fig. 1-3 is please referred to, the present invention provides a kind of technical solution: the invention provides the following technical scheme: a kind of internet Big data analysis extracting method, analysis extracting method the following steps are included:
A, data acquisition unit acquires internet big data;
B, the data of acquisition are sent to control unit after data screening module is screened and are handled;
C, control unit to data characteristics extraction module send instruct, to the data after screening according to pre-set keyword into Row feature extraction;
D, the data after feature extraction store after being encrypted by DEU data encryption unit;
E, the internet data after finally extracting is transmitted to background terminal by data transmission unit.
The present invention needs to use screening, feature extraction and the cryptographic operation to data, the present invention in analysis extraction process Data screening module can carry out automatically data screening according to computer program, it is convenient and time-consuming short to operate, and can either save people Power resource, and it is able to ascend data screening efficiency;Pass through the first keyword of search and the second keyword, energy in feature extracting method Enough reduce extracts difficulty, improves feature extraction precision;Data ciphering method can transmit data and carry out multi-enciphering, improve The safety and confidentiality of data by the processing of three above step can accelerate the analysis extraction efficiency of data, and It can ensure the safety of data.
In the present invention, data screening modular approach is as follows in step B:
A, multiple noise datas are extracted from data to be screened as sample data;
B, conversion process is carried out to each sample data, obtains the transformation data of each sample data;
C, by preparatory trained data classification model, it is pre- that label is carried out to each sample data and each transformation data It surveys, determines the target labels and target labels probability of each sample data;
D, according to the target labels of each sample data and target labels probability, each sample data is screened, obtains number of targets According to.
The data screening module that the present invention uses can carry out automatically data screening according to computer program, operate convenient and consume When it is short, human resources can either be saved, and be able to ascend data screening efficiency.
In the present invention, data ciphering method is as follows in step D:
A, cleaning operation is carried out to data to be encrypted first;
B, the operation of AES encryption algorithm is carried out to the data after cleaning later, obtains an encrypted ciphertext data;
C, hyperchaos cryptographic calculation is carried out to a ciphertext data later again, obtains secondary ciphertext data;
D, finally secondary ciphertext data are carried out carrying out des encryption operation, the final encryption of complete paired data.
The data ciphering method that the present invention uses can transmit data and carry out multi-enciphering, improve the safety of data And confidentiality.
In the present invention, data characteristics extraction module feature extracting method is as follows:
A, data set is established, multiple Sub Data Sets to feature extraction are wherein included in data set;
B, feature training is carried out to data set, obtains training pattern;
C, the first keyword and the second keyword in data set are extracted;
D, each Sub Data Set in cyclic search data set, using the first keyword and the second keyword as primary condition, subdata Collection scans for;
E, search is matched to the first keyword or the second keyword in each Sub Data Set, then extracts to data.
By the first keyword of search and the second keyword in the data characteristics extracting method that the present invention uses, can reduce Difficulty is extracted, feature extraction precision is improved.
In addition, the invention also discloses a kind of internet big data analysis extraction system, including control unit 1, data are adopted Collect unit 2, data screening unit 3, data characteristics extraction unit 4, data encryption storage unit 5 and data outputting unit 6, it is described Data acquisition unit 2 connects control unit 1 by data screening unit 3, and described control unit 1 is separately connected data characteristics extraction Unit 4, data encryption storage unit 5 and data outputting unit 6;Wherein, the data acquisition unit 2 is big for acquiring internet Data;The data screening unit 3 removes interference information for screening to internet big data;The data characteristics mentions Take unit 4 for carrying out feature extraction to the data after screening;After the data encryption storage unit 5 is used for feature extraction Internet big data carries out encryption storage.
In conclusion the method that the present invention uses is easy to operate, security performance is high, in addition, by being screened to data, Feature extraction is carried out after screening, can be improved the analysis extraction efficiency and accuracy of internet big data, and is reduced artificial Cost improves work efficiency, furthermore it is possible to realize to take in row encryption after store, it is ensured that Information Security.
It although an embodiment of the present invention has been shown and described, for the ordinary skill in the art, can be with A variety of variations, modification, replacement can be carried out to these embodiments without departing from the principles and spirit of the present invention by understanding And modification, the scope of the present invention is defined by the appended.

Claims (5)

1. a kind of internet big data analysis extracting method, it is characterised in that: analysis extracting method the following steps are included:
A, data acquisition unit acquires internet big data;
B, the data of acquisition are sent to control unit after data screening module is screened and are handled;
C, control unit to data characteristics extraction module send instruct, to the data after screening according to pre-set keyword into Row feature extraction;
D, the data after feature extraction store after being encrypted by DEU data encryption unit;
E, the internet data after finally extracting is transmitted to background terminal by data transmission unit.
2. a kind of internet big data analysis extracting method according to claim 1, it is characterised in that: in the step B Data screening modular approach is as follows:
A, multiple noise datas are extracted from data to be screened as sample data;
B, conversion process is carried out to each sample data, obtains the transformation data of each sample data;
C, by preparatory trained data classification model, it is pre- that label is carried out to each sample data and each transformation data It surveys, determines the target labels and target labels probability of each sample data;
D, according to the target labels of each sample data and target labels probability, each sample data is screened, obtains number of targets According to.
3. a kind of internet big data analysis extracting method according to claim 1, it is characterised in that: in the step D Data ciphering method is as follows:
A, cleaning operation is carried out to data to be encrypted first;
B, the operation of AES encryption algorithm is carried out to the data after cleaning later, obtains an encrypted ciphertext data;
C, hyperchaos cryptographic calculation is carried out to a ciphertext data later again, obtains secondary ciphertext data;
D, finally secondary ciphertext data are carried out carrying out des encryption operation, the final encryption of complete paired data.
4. a kind of internet big data analysis extracting method according to claim 1, it is characterised in that: the data characteristics Extraction module feature extracting method is as follows:
A, data set is established, multiple Sub Data Sets to feature extraction are wherein included in data set;
B, feature training is carried out to data set, obtains training pattern;
C, the first keyword and the second keyword in data set are extracted;
D, each Sub Data Set in cyclic search data set, using the first keyword and the second keyword as primary condition, subdata Collection scans for;
E, search is matched to the first keyword or the second keyword in each Sub Data Set, then extracts to data.
5. a kind of internet big data analysis extraction system, it is characterised in that: including control unit (1), data acquisition unit (2), data screening unit (3), data characteristics extraction unit (4), data encryption storage unit (5) and data outputting unit (6), The data acquisition unit (2) is separately connected by data screening unit (3) connection control unit (1), described control unit (1) Data characteristics extraction unit (4), data encryption storage unit (5) and data outputting unit (6);Wherein, the data acquisition is single First (2) are for acquiring internet big data;The data screening unit (3) removes for screening to internet big data Interference information;The data characteristics extraction unit (4) is used to carry out feature extraction to the data after screening;The data encryption is deposited Storage unit (5) is for carrying out encryption storage to the internet big data after feature extraction.
CN201811315842.0A 2018-11-07 2018-11-07 A kind of internet big data analysis extracting method and system Withdrawn CN109344333A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811315842.0A CN109344333A (en) 2018-11-07 2018-11-07 A kind of internet big data analysis extracting method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811315842.0A CN109344333A (en) 2018-11-07 2018-11-07 A kind of internet big data analysis extracting method and system

Publications (1)

Publication Number Publication Date
CN109344333A true CN109344333A (en) 2019-02-15

Family

ID=65313821

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811315842.0A Withdrawn CN109344333A (en) 2018-11-07 2018-11-07 A kind of internet big data analysis extracting method and system

Country Status (1)

Country Link
CN (1) CN109344333A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110390204A (en) * 2019-07-25 2019-10-29 上海应用技术大学 Medical big data processing system and method
CN110765337A (en) * 2019-11-15 2020-02-07 中科院计算技术研究所大数据研究院 Service providing method based on internet big data
CN111556098A (en) * 2020-04-08 2020-08-18 深圳供电局有限公司 Artificial intelligence based analysis system and analysis method for internet of things data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106202400A (en) * 2016-07-11 2016-12-07 广东聚联电子商务股份有限公司 A kind of big data collection and analysis method complaining behavior
CN106203171A (en) * 2016-06-03 2016-12-07 中国电子科技网络信息安全有限公司 Big data platform Security Index system and method
CN106897462A (en) * 2017-03-13 2017-06-27 榆林学院 Data statistic analysis plateform system
CN108040056A (en) * 2017-12-15 2018-05-15 福州大学 Safety medical treatment big data system based on Internet of Things

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106203171A (en) * 2016-06-03 2016-12-07 中国电子科技网络信息安全有限公司 Big data platform Security Index system and method
CN106202400A (en) * 2016-07-11 2016-12-07 广东聚联电子商务股份有限公司 A kind of big data collection and analysis method complaining behavior
CN106897462A (en) * 2017-03-13 2017-06-27 榆林学院 Data statistic analysis plateform system
CN108040056A (en) * 2017-12-15 2018-05-15 福州大学 Safety medical treatment big data system based on Internet of Things

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110390204A (en) * 2019-07-25 2019-10-29 上海应用技术大学 Medical big data processing system and method
CN110765337A (en) * 2019-11-15 2020-02-07 中科院计算技术研究所大数据研究院 Service providing method based on internet big data
CN110765337B (en) * 2019-11-15 2021-04-06 中科院计算技术研究所大数据研究院 Service providing method based on internet big data
CN111556098A (en) * 2020-04-08 2020-08-18 深圳供电局有限公司 Artificial intelligence based analysis system and analysis method for internet of things data
CN111556098B (en) * 2020-04-08 2023-09-15 深圳供电局有限公司 Analysis system and analysis method for Internet of things data based on artificial intelligence

Similar Documents

Publication Publication Date Title
Zou et al. Encrypted traffic classification with a convolutional long short-term memory neural network
CN104506484B (en) A kind of proprietary protocol analysis and recognition methods
CN102315974B (en) Stratification characteristic analysis-based method and apparatus thereof for on-line identification for TCP, UDP flows
CN105871832A (en) Network application encrypted traffic recognition method and device based on protocol attributes
CN109344333A (en) A kind of internet big data analysis extracting method and system
CN111082997B (en) Network function arrangement method based on service identification in mobile edge computing platform
CN111343169B (en) System and method for gathering security resources and sharing information under industrial control environment
CN109299742A (en) Method, apparatus, equipment and the storage medium of automatic discovery unknown network stream
CN101764704A (en) Method for auditing internet sensitive contents and device thereof
CN112019500B (en) Encrypted traffic identification method based on deep learning and electronic device
CN110460510A (en) A kind of method, apparatus that establishing multi-conference, electronic equipment and medium
CN105471635B (en) A kind of processing method of system log, device and system
CN109359686A (en) A kind of user's portrait method and system based on Campus Network Traffic
CN109284319A (en) A kind of auditing system based on big data visualization technique
CN109542867A (en) Distribution type data collection method and device
CN107529190B (en) User data acquisition system and method
CN114996207A (en) Big data analysis method and system based on 5G cloud computing
He et al. Identification of SSH applications based on convolutional neural network
CN110708341B (en) User behavior detection method and system based on remote desktop encryption network traffic mode difference
CN112800140A (en) High-reliability data acquisition method based on block chain prediction machine
CN109995784A (en) A kind of data extraction accelerated method based on UDP
CN106973314B (en) Instruction identification method and system for network interaction
CN112988829A (en) Big data analysis processing system
CN111126762A (en) Intelligent workflow engine for electric power cloud security
Zheng et al. Identification of Malicious Encrypted Traffic Through Feature Fusion

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20190215