CN112667888A - Big data processing system based on block chain - Google Patents

Big data processing system based on block chain Download PDF

Info

Publication number
CN112667888A
CN112667888A CN202011536944.2A CN202011536944A CN112667888A CN 112667888 A CN112667888 A CN 112667888A CN 202011536944 A CN202011536944 A CN 202011536944A CN 112667888 A CN112667888 A CN 112667888A
Authority
CN
China
Prior art keywords
data
module
information
secret
analysis module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011536944.2A
Other languages
Chinese (zh)
Inventor
唐金胜
何前进
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202011536944.2A priority Critical patent/CN112667888A/en
Publication of CN112667888A publication Critical patent/CN112667888A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Storage Device Security (AREA)

Abstract

本发明属于大数据技术领域,具体的说是一种基于区块链的大数据处理系统,包括数据采集模块、数据分析模块、数据混淆模块以及数据推送模块;所述数据采集模块不间断的收集用户观看的内容相关信息,并发送至数据混淆模块;所述数据混淆模块在接收到相关数据后,对数据进行处理,得到无指向性的偏好数据,处理完成后将数据发送至数据分析模块;所述数据分析模块在接收到无指向性数据后,对该数据进行分析,得到对应区域的用户的偏好状态信息,并发送至数据推送模块;所述数据推送模块依据用户偏好状态信息,进行针对性的推送相关信息;本发明保证大数据隐私安全,信息推广效果好。

Figure 202011536944

The invention belongs to the technical field of big data, in particular to a big data processing system based on blockchain, comprising a data collection module, a data analysis module, a data obfuscation module and a data push module; the data collection module continuously collects The relevant information of the content watched by the user is sent to the data obfuscation module; after receiving the relevant data, the data obfuscation module processes the data to obtain non-directional preference data, and sends the data to the data analysis module after the processing is completed; After receiving the non-directional data, the data analysis module analyzes the data, obtains the preference status information of the user in the corresponding area, and sends it to the data push module; The relevant information can be automatically pushed; the present invention ensures the privacy and security of the big data, and the information promotion effect is good.

Figure 202011536944

Description

Big data processing system based on block chain
Technical Field
The invention belongs to the technical field of big data, and particularly relates to a big data processing system based on a block chain.
Background
Big data (bigdata), which refers to a data set that cannot be captured, managed and processed by a conventional software tool within a certain time range, is an information asset that needs a new processing mode to have stronger decision-making power, insight discovery power and flow optimization capability to adapt to a large amount, high growth rate and diversification. With the rapid development of electronic informatization, electronic information data gradually becomes the key point of research of people, and people can not leave various data in daily life, so that big data becomes a hot spot of current research.
In addition, the conventional blockchain technology is a data technology which tends to be decentralized, non-falsifiable and data public, and once the stored big data is acquired by all nodes, the big data privacy and the big data security cannot be guaranteed. In the real application process, it is a necessary prerequisite that the data service provider guarantees the data security and privacy of the customer in the process of providing services for the customer, and if the data security cannot be guaranteed, the data service provider will not be accepted and approved by the customer.
Disclosure of Invention
In order to make up for the defects of the prior art, improve the privacy safety of big data based on a block chain and improve the information popularization effect, the invention provides a big data processing system based on the block chain.
The technical scheme adopted by the invention for solving the technical problems is as follows: the invention relates to a big data processing system based on a block chain, which comprises a data acquisition module, a data analysis module, a data confusion module and a data pushing module; the data acquisition module adopts a crawler technology to uninterruptedly collect content related information watched by a user; the data acquisition module sends the acquired data to the data confusion module; after the data confusion module receives the related data, the data is processed by adopting a black box mode according to a specific algorithm, and the relevance between a specific individual user and specific preference data in the data is removed or disturbed to obtain the non-directional preference data; approximate geographic information and approximate time information are still stored in the nondirectional data; the basic processing unit of the rough geographic information is a county-level region; the approximate time information is accurate to an hour; the data confusion module processes the acquired data and sends the acquired omnidirectional data to the data analysis module; after receiving the nondirectional data, the data analysis module analyzes the data to obtain preference state information of the user in the corresponding area; after the data analysis module obtains the user preference state information, the information is sent to a data pushing module; the data pushing module pushes relevant information in different regions at different time points according to the user preference state information, so that the popularization effect is improved;
during operation, the data acquisition module acquires user preference data, the data confusion module processes the acquired data to complete data desensitization to obtain nondirectional data, the acquired data is prevented from being associated with a specific user to cause leakage of user privacy information, and meanwhile, after the nondirectional data are obtained, the data analysis module analyzes the nondirectional data to obtain preference information of users in related regions, so that the data push module can push the data pertinently to improve the popularization effect, meanwhile, the user preference information of the related regions can be provided to corresponding cooperative units to promote the service quality of the related units in the specific regions to be improved, the retention rate of consumers is improved, the economic benefit is improved, and meanwhile, the data analysis module analyzes the preference information of the users in a specific time period to predict the psychological state of the users at a specific time point, the specific region is directionally popularized at a specific time point, and the popularization effect of the data push module is further improved.
Preferably, the data analysis module analyzes the received omnidirectional data, divides user groups according to different browsing information contents, and improves the effect of directional popularization of the data pushing module; the data analysis module analyzes the nondirectional data, and individually distinguishes parts belonging to the violation information, so that user groups, places and time which prefer to browse the violation information are obtained according to the violation information analysis, and violation information characteristic data are obtained; the data analysis module sends violation information characteristic data to the data pushing module; the data pushing module is used for increasing the quantity and frequency of pushing general laws and education information to a specific area at a specific time point according to the received violation information characteristic data;
when the system works, the nondirectional data are analyzed by the data analysis module, so that the region where the user group prefers to browse the violation information in daily life and the proportion of the violation information which the user group prefers to browse in the region can be found out, therefore, the data push module pushes the general law and the education information to the users in the region, and by increasing the quantity and frequency of pushing the general law and the education information, the law control consciousness of the user group in the region is improved in the process of subtlety, the law control and culture level construction degree of the user in the region is enhanced, the condition that the user group quality is reduced, the law control and culture level construction are reduced due to the fact that the user in the region browses too much violation information is avoided, meanwhile, the data pushing module is used for increasing the pushed general law and education information, inhibiting or eliminating potential illegal intentions of a few users, and improving the willingness and level of the users in the region to watch law and understand law.
Preferably, after receiving the omnidirectional data, the data analysis module analyzes the part of data, and performs suppression information marking on information belonging to a negative state in the part of data; the data analysis module divides user groups according to the marked depression information marks, distinguishes the user groups, places and time which prefer to browse negative information, and obtains pressure characteristic data; the data analysis module sends the pressure characteristic data to the data pushing module; the data pushing module increases the number and frequency of front information release in a specific area according to the received pressure characteristic data; the positive information includes but is not limited to positive energy videos, articles; when the data pushing module pushes the front information in a specific area, the pushing force for the whole day time period is increased relative to the historical pushing force, and the force for pushing the front information in the specific time period is increased to a peak value;
during operation, nondirectional data are analyzed through the data analysis module, regions and groups which are preferential to browse negative information are found out, then the strength of pushing positive information to the regions is increased through the data pushing module, psychological depression or depression caused by long-term preferential browse negative information of users in the regions is avoided, the psychological health level of the users in the regions is reduced, the people in the regions are not good in happiness, meanwhile, the positive information of the strength pushing is increased through the data pushing module, the potential depression state of the user groups in the regions can be relieved or eliminated to a certain extent in a subtlety mode, and most of the users in the user groups can actively, optimistically and upwards see life.
Preferably, the data analysis module analyzes the received omnidirectional data and establishes a user preference model; the data analysis module obtains predicted data of the user browsing negative information and violation information through analysis of the model; the data analysis module analyzes the received nondirectional data to obtain statistical data of the negative information and the violation information browsed by the user; the data analysis module compares the predicted data with the statistical data to obtain an absolute value of a comparison result; the absolute value of the comparison result is larger than or equal to zero, the data analysis module judges that the general law, the education information and the positive information which are pushed by the data pushing module in a pertinence manner obtain the expected effect, and the user group browsing the violation information and the negative information is reduced or unchanged; the absolute value of the comparison result is less than zero, the general law, the education information and the positive information which are pushed in a targeted manner by the data pushing module do not obtain the expected effect, and the data analysis module judges that the user group browsing the violation information and the negative information is expanded; the absolute value of the comparison result is less than zero, and the data analysis module sends a pushing force increasing instruction to the data pushing module; after the data pushing module receives the instruction, the data pushing module increases daily information pushing force;
during operation, through the user preference model that data analysis module established, and obtain the forecast data through the model, through the contrast to forecast data and statistical data, can clearly reflect the effect that relevant information of data propelling movement module pertinence propelling movement gained, guarantee that data propelling movement module is effectual to the popularization of relevant information, avoid data propelling movement module not good to relevant information's popularization effect, simultaneously, can reflect whether suitable to the propelling movement dynamics of data propelling movement module through the contrast to forecast data and statistical data, avoid data propelling movement module to carry out the in-process propelling movement dynamics undersize or too big to relevant information, lead to the propelling movement effect not good or arouse user's crowd's reaction, reduce the propelling movement effect.
Preferably, the processing system comprises a data storage module, a data encryption module and a node propagation module; the data storage module is used for storing data generated by the processing system in the working process and collected user data; the data encryption module encrypts the omnidirectional data output by the data confusion module to obtain secret data and a decryption key; after the data encryption module obtains the security level data, the security level data are sent to the node transmission module; after receiving the secret level data, the node transmission module transmits the secret level data to the block chain network in a broadcasting mode and receives stored signals returned by other nodes in the block chain network; after receiving return signals of half of nodes except the nodes in the block chain network, the node transmission module judges that the confidential data is successfully linked and stops broadcasting the confidential data; when the data encryption module obtains the security level data, a verification MD5 value of the security level data is generated at the same time, and the verification MD5 value is divided into two ends with equal length and is respectively attached to the head and the tail of the security level data; after receiving the complete security level data, other nodes in the block chain network check the security level data in the background; when the other nodes check the secret data, extracting check MD5 values attached to the head and tail of the secret data, and performing comparison MD5 value obtained by calculating the check MD5 value and the secret data; if the comparison MD5 value is consistent with the check MD5 value, other nodes judge that the received security level data is complete and correct, and store the security level data and broadcast the stored signal; if the comparison MD5 value is inconsistent with the check MD5 value, other nodes judge that the received security level data has defects and does not accord with the expected received data, and other nodes discard the received security level data, keep silent and do not broadcast the stored signals;
when the system works, collected omnidirectional data is encrypted by the data encryption module and then broadcast to the blockchain network through the node transmission module for storage, the characteristics of the blockchain network are utilized to ensure the safety, the openness and the transparency of the data and avoid the data from being attacked and falsified, meanwhile, in the process of uploading confidential data to the blockchain network, the MD5 value of the confidential data is added to the confidential data before and after the confidential data, the integrity and the reliability of the confidential data received by other nodes in the blockchain network can be ensured, the confidential data is prevented from being disturbed and attacked in the process of transmitting the confidential data in the blockchain network, the loss or falsification of the confidential data is caused, the stored confidential data in the blockchain network is deviated, and meanwhile, the verification is carried out by comparing the MD5 value, the confidential data stored in the blockchain network is ensured to be completely consistent with the data broadcast by the node transmission module, the method and the device avoid that other nodes in the block network cannot verify the secret data after the secret data is tampered and interfered and has a deviation in the transmission process, and directly store the wrong secret data, so that the block network authenticates the wrong secret data as correct data, acknowledges the correctness of the wrong secret data, and rejects the correctness of the correct secret data initially sent by the node transmission module.
Preferably, when encrypting the non-directional data, the data encryption module segments the received non-directional data according to a predetermined segmentation algorithm to obtain data blocks with the same size and arranged in sequence, and then divides and stores the data blocks obtained after segmentation into local storage data and block network storage data by using a division algorithm; the data encryption module encrypts the block network data by using a Hash encryption algorithm to obtain secret data, and the secret data is sent to the node transmission module and broadcasted to the block chain network through the node transmission module for storage; the ratio of the local storage data to the block network storage data is (8-9): 1; the data block is divided and stored by adopting a division algorithm comprising a time function, block network storage data are extracted from the data blocks which are same in size and are arranged in sequence according to the division algorithm, and the residual data blocks after extraction are stored as local storage data; the time function is the current working time point of the data encryption module; the local storage data does not comprise data blocks which are extracted and used as block network storage data; when the processing system or other users use the nondirectional data, downloading the secret data from the block chain network, decrypting the secret data through the data encryption module, reversely running the division algorithm, compounding the block network storage data and the local storage data in the secret data to obtain data blocks which are same in size and are sequentially arranged after being divided, and then reversely running the division algorithm to obtain original nondirectional data;
when the system works, the data encryption module is used for segmenting and extracting the non-directional data to obtain a relatively small number of block stored data from the non-directional data, the block stored data are uploaded to the block chain network as secret data, the data amount required to be transmitted by the node transmission module is reduced, the load of the block chain network is reduced, the block chain network is prevented from being too bulky, meanwhile, the complete non-directional data are segmented, and a part of the encrypted data are sent to the block chain network as the secret data in a broadcasting mode, so that the safety of the whole data can be ensured under the condition that a secret key is accidentally leaked, the condition that lawless persons acquire all data from the block chain network and decrypt the data through the secret key to cause data leakage can be avoided, meanwhile, the data can be further improved by segmenting the data, the data stored in the block network is guaranteed not to be tampered, the fact that the block chain selected by a processing system is relatively small in size and cannot be tampered with through a 51% attack mode, the data stored in the block chain network is tampered with the advantage of computing power, the safety and accuracy of the data are affected, when the secret level data stored in the block chain network has deviation, the data cannot be combined with the locally stored data after decryption, and the fact that the data are affected or misled by tampering or deviation data is avoided.
The invention has the following beneficial effects:
1. according to the big data processing system based on the block chain, the data analysis module is used for analyzing the collected information, so that the data push module can push the related information of a specific region and a user group in a targeted manner, the information push effect is improved, meanwhile, the data analysis module is used for establishing a user preference model, the expected data obtained by analyzing the model is compared with the statistical data, the effect of the data push module for pushing the related information is obtained, the push strength of the data push module is dynamically adjusted, and the push effect of the related information is improved.
2. The big data processing system based on the block chain divides the nondirectional data through the data encryption module, extracts part of the data from the nondirectional data to serve as secret data, broadcasts the secret data to the block chain network through the node transmission module after encryption, stores the data in the block chain network, ensures the authenticity of the data, avoids the data from being tampered, reduces the data volume uploaded to the block chain network through the division and encryption of the data, and ensures the privacy and the safety of the data.
Drawings
The invention will be further explained with reference to the drawings.
FIG. 1 is a system flow diagram of the present invention;
Detailed Description
In order to make the technical means, the creation characteristics, the achievement purposes and the effects of the invention easy to understand, the invention is further described with the specific embodiments.
As shown in fig. 1, the big data processing system based on the block chain includes a data acquisition module, a data analysis module, a data confusion module, and a data pushing module; the data acquisition module adopts a crawler technology to uninterruptedly collect content related information watched by a user; the data acquisition module sends the acquired data to the data confusion module; after the data confusion module receives the related data, the data is processed by adopting a black box mode according to a specific algorithm, and the relevance between a specific individual user and specific preference data in the data is removed or disturbed to obtain the non-directional preference data; approximate geographic information and approximate time information are still stored in the nondirectional data; the basic processing unit of the rough geographic information is a county-level region; the approximate time information is accurate to an hour; the data confusion module processes the acquired data and sends the acquired omnidirectional data to the data analysis module; after receiving the nondirectional data, the data analysis module analyzes the data to obtain preference state information of the user in the corresponding area; after the data analysis module obtains the user preference state information, the information is sent to a data pushing module; the data pushing module pushes relevant information in different regions at different time points according to the user preference state information, so that the popularization effect is improved;
during operation, the data acquisition module acquires user preference data, the data confusion module processes the acquired data to complete data desensitization to obtain nondirectional data, the acquired data is prevented from being associated with a specific user to cause leakage of user privacy information, and meanwhile, after the nondirectional data are obtained, the data analysis module analyzes the nondirectional data to obtain preference information of users in related regions, so that the data push module can push the data pertinently to improve the popularization effect, meanwhile, the user preference information of the related regions can be provided to corresponding cooperative units to promote the service quality of the related units in the specific regions to be improved, the retention rate of consumers is improved, the economic benefit is improved, and meanwhile, the data analysis module analyzes the preference information of the users in a specific time period to predict the psychological state of the users at a specific time point, the specific region is directionally popularized at a specific time point, and the popularization effect of the data push module is further improved.
As an implementation manner of the present invention, the data analysis module analyzes the received omnidirectional data, divides the user group according to the different browsing information contents, and improves the effect of the data pushing module on directional popularization; the data analysis module analyzes the nondirectional data, and individually distinguishes parts belonging to the violation information, so that user groups, places and time which prefer to browse the violation information are obtained according to the violation information analysis, and violation information characteristic data are obtained; the data analysis module sends violation information characteristic data to the data pushing module; the data pushing module is used for increasing the quantity and frequency of pushing general laws and education information to a specific area at a specific time point according to the received violation information characteristic data;
when the system works, the nondirectional data are analyzed by the data analysis module, so that the region where the user group prefers to browse the violation information in daily life and the proportion of the violation information which the user group prefers to browse in the region can be found out, therefore, the data push module pushes the general law and the education information to the users in the region, and by increasing the quantity and frequency of pushing the general law and the education information, the law control consciousness of the user group in the region is improved in the process of subtlety, the law control and culture level construction degree of the user in the region is enhanced, the condition that the user group quality is reduced, the law control and culture level construction are reduced due to the fact that the user in the region browses too much violation information is avoided, meanwhile, the data pushing module is used for increasing the pushed general law and education information, inhibiting or eliminating potential illegal intentions of a few users, and improving the willingness and level of the users in the region to watch law and understand law.
As an embodiment of the present invention, after receiving the nondirectional data, the data analysis module analyzes the partial data, and performs suppression information marking on information belonging to a negative state in the partial data; the data analysis module divides user groups according to the marked depression information marks, distinguishes the user groups, places and time which prefer to browse negative information, and obtains pressure characteristic data; the data analysis module sends the pressure characteristic data to the data pushing module; the data pushing module increases the number and frequency of front information release in a specific area according to the received pressure characteristic data; the positive information includes but is not limited to positive energy videos, articles; when the data pushing module pushes the front information in a specific area, the pushing force for the whole day time period is increased relative to the historical pushing force, and the force for pushing the front information in the specific time period is increased to a peak value;
during operation, nondirectional data are analyzed through the data analysis module, regions and groups which are preferential to browse negative information are found out, then the strength of pushing positive information to the regions is increased through the data pushing module, psychological depression or depression caused by long-term preferential browse negative information of users in the regions is avoided, the psychological health level of the users in the regions is reduced, the people in the regions are not good in happiness, meanwhile, the positive information of the strength pushing is increased through the data pushing module, the potential depression state of the user groups in the regions can be relieved or eliminated to a certain extent in a subtlety mode, and most of the users in the user groups can actively, optimistically and upwards see life.
As an embodiment of the present invention, the data analysis module analyzes the received omnidirectional data and establishes a user preference model; the data analysis module obtains predicted data of the user browsing negative information and violation information through analysis of the model; the data analysis module analyzes the received nondirectional data to obtain statistical data of the negative information and the violation information browsed by the user; the data analysis module compares the predicted data with the statistical data to obtain an absolute value of a comparison result; the absolute value of the comparison result is larger than or equal to zero, the data analysis module judges that the general law, the education information and the positive information which are pushed by the data pushing module in a pertinence manner obtain the expected effect, and the user group browsing the violation information and the negative information is reduced or unchanged; the absolute value of the comparison result is less than zero, the general law, the education information and the positive information which are pushed in a targeted manner by the data pushing module do not obtain the expected effect, and the data analysis module judges that the user group browsing the violation information and the negative information is expanded; the absolute value of the comparison result is less than zero, and the data analysis module sends a pushing force increasing instruction to the data pushing module; after the data pushing module receives the instruction, the data pushing module increases daily information pushing force;
during operation, through the user preference model that data analysis module established, and obtain the forecast data through the model, through the contrast to forecast data and statistical data, can clearly reflect the effect that relevant information of data propelling movement module pertinence propelling movement gained, guarantee that data propelling movement module is effectual to the popularization of relevant information, avoid data propelling movement module not good to relevant information's popularization effect, simultaneously, can reflect whether suitable to the propelling movement dynamics of data propelling movement module through the contrast to forecast data and statistical data, avoid data propelling movement module to carry out the in-process propelling movement dynamics undersize or too big to relevant information, lead to the propelling movement effect not good or arouse user's crowd's reaction, reduce the propelling movement effect.
As an implementation mode of the invention, the processing system comprises a data storage module, a data encryption module and a node propagation module; the data storage module is used for storing data generated by the processing system in the working process and collected user data; the data encryption module encrypts the omnidirectional data output by the data confusion module to obtain secret data and a decryption key; after the data encryption module obtains the security level data, the security level data are sent to the node transmission module; after receiving the secret level data, the node transmission module transmits the secret level data to the block chain network in a broadcasting mode and receives stored signals returned by other nodes in the block chain network; after receiving return signals of half of nodes except the nodes in the block chain network, the node transmission module judges that the confidential data is successfully linked and stops broadcasting the confidential data; when the data encryption module obtains the security level data, a verification MD5 value of the security level data is generated at the same time, and the verification MD5 value is divided into two ends with equal length and is respectively attached to the head and the tail of the security level data; after receiving the complete security level data, other nodes in the block chain network check the security level data in the background; when the other nodes check the secret data, extracting check MD5 values attached to the head and tail of the secret data, and performing comparison MD5 value obtained by calculating the check MD5 value and the secret data; if the comparison MD5 value is consistent with the check MD5 value, other nodes judge that the received security level data is complete and correct, and store the security level data and broadcast the stored signal; if the comparison MD5 value is inconsistent with the check MD5 value, other nodes judge that the received security level data has defects and does not accord with the expected received data, and other nodes discard the received security level data, keep silent and do not broadcast the stored signals;
when the system works, collected omnidirectional data is encrypted by the data encryption module and then broadcast to the blockchain network through the node transmission module for storage, the characteristics of the blockchain network are utilized to ensure the safety, the openness and the transparency of the data and avoid the data from being attacked and falsified, meanwhile, in the process of uploading confidential data to the blockchain network, the MD5 value of the confidential data is added to the confidential data before and after the confidential data, the integrity and the reliability of the confidential data received by other nodes in the blockchain network can be ensured, the confidential data is prevented from being disturbed and attacked in the process of transmitting the confidential data in the blockchain network, the loss or falsification of the confidential data is caused, the stored confidential data in the blockchain network is deviated, and meanwhile, the verification is carried out by comparing the MD5 value, the confidential data stored in the blockchain network is ensured to be completely consistent with the data broadcast by the node transmission module, the method and the device avoid that other nodes in the block network cannot verify the secret data after the secret data is tampered and interfered and has a deviation in the transmission process, and directly store the wrong secret data, so that the block network authenticates the wrong secret data as correct data, acknowledges the correctness of the wrong secret data, and rejects the correctness of the correct secret data initially sent by the node transmission module.
As an embodiment of the present invention, when encrypting the non-directional data, the data encryption module divides the received non-directional data according to a predetermined division algorithm to obtain data blocks with the same size and arranged in sequence, and then divides and stores the data blocks obtained after division into local storage data and block network storage data by using a division algorithm; the data encryption module encrypts the block network data by using a Hash encryption algorithm to obtain secret data, and the secret data is sent to the node transmission module and broadcasted to the block chain network through the node transmission module for storage; the ratio of the local storage data to the block network storage data is (8-9): 1; the data block is divided and stored by adopting a division algorithm comprising a time function, block network storage data are extracted from the data blocks which are same in size and are arranged in sequence according to the division algorithm, and the residual data blocks after extraction are stored as local storage data; the time function is the current working time point of the data encryption module; the local storage data does not comprise data blocks which are extracted and used as block network storage data; when the processing system or other users use the nondirectional data, downloading the secret data from the block chain network, decrypting the secret data through the data encryption module, reversely running the division algorithm, compounding the block network storage data and the local storage data in the secret data to obtain data blocks which are same in size and are sequentially arranged after being divided, and then reversely running the division algorithm to obtain original nondirectional data;
when the system works, the data encryption module is used for segmenting and extracting the non-directional data to obtain a relatively small number of block stored data from the non-directional data, the block stored data are uploaded to the block chain network as secret data, the data amount required to be transmitted by the node transmission module is reduced, the load of the block chain network is reduced, the block chain network is prevented from being too bulky, meanwhile, the complete non-directional data are segmented, and a part of the encrypted data are sent to the block chain network as the secret data in a broadcasting mode, so that the safety of the whole data can be ensured under the condition that a secret key is accidentally leaked, the condition that lawless persons acquire all data from the block chain network and decrypt the data through the secret key to cause data leakage can be avoided, meanwhile, the data can be further improved by segmenting the data, the data stored in the block network is guaranteed not to be tampered, the fact that the block chain selected by a processing system is relatively small in size and cannot be tampered with through a 51% attack mode, the data stored in the block chain network is tampered with the advantage of computing power, the safety and accuracy of the data are affected, when the secret level data stored in the block chain network has deviation, the data cannot be combined with the locally stored data after decryption, and the fact that the data are affected or misled by tampering or deviation data is avoided.
The specific working process is as follows:
during work, the data acquisition module is used for acquiring user preference data, the data confusion module is used for processing the acquired data, and the data analysis module is used for analyzing the data so as to obtain preference information of users in related regions, so that the data push module can conveniently push the preference information in a targeted manner; analyzing the nondirectional data through a data analysis module, finding out a region where a user group prefers to browse the violation information in daily life and the proportion of the violation information which the user group prefers to browse in the region, and pushing general law and education information to a user in the region through a data pushing module; the data analysis module analyzes the nondirectional data, finds out regions and crowds which prefer to browse negative information, and then increases the strength of pushing positive information to the regions through the data pushing module; the user preference model established by the data analysis module is used for obtaining predicted data through the model, and the effect obtained by the data pushing module in the process of pushing relevant information in a targeted manner is reflected through the comparison between the predicted data and the statistical data; the acquired omnidirectional data is encrypted by the data encryption module and then broadcast to the block chain network for storage by the node transmission module, and the safety, the openness and the transparency of the data are ensured by utilizing the characteristics of the block chain network, so that the data is prevented from being attacked and tampered; by means of the segmentation and extraction of the nondirectional data by the data encryption module, the block storage data with a relatively small number is obtained from the nondirectional data, and the block storage data is uploaded to the block chain network as secret data, so that the data volume required to be transmitted by the node transmission module is reduced.
The foregoing illustrates and describes the principles, general features, and advantages of the present invention. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims (6)

1.一种基于区块链的大数据处理系统,其特征在于:包括数据采集模块、数据分析模块、数据混淆模块以及数据推送模块;所述数据采集模块采用爬虫技术,不间断的收集用户观看的内容相关信息;所述数据采集模块将采集到的数据发送至数据混淆模块中;所述数据混淆模块在接收到相关数据后,依照特定的算法,采用黑箱模式对数据进行处理,去除或打乱数据中特定个体用户与特定偏好数据之间的关联性,得到无指向性的偏好数据;所述无指向性数据中仍旧保存有大致地理信息、大致时间信息;所述大致地理信息的基本处理单位为县级区域;所述大致时间信息精确到小时;所述数据混淆模块对采集到的数据完成处理后,将得到的无指向性数据发送至数据分析模块;所述数据分析模块在接收到无指向性数据后,对该数据进行分析,得到对应区域的用户的偏好状态信息;所述数据分析模块得到用户偏好状态信息后,将该信息发送至数据推送模块中;所述数据推送模块依据用户偏好状态信息,针对不同地区的用户在不同时间点针对性的推送相关信息,提升推广效果。1. A block chain-based big data processing system is characterized in that: comprising a data acquisition module, a data analysis module, a data obfuscation module and a data push module; the data acquisition module adopts crawler technology to continuously collect user viewing The data collection module sends the collected data to the data obfuscation module; after the data obfuscation module receives the relevant data, according to a specific algorithm, the black box mode is used to process the data, remove or delete The correlation between specific individual users and specific preference data in the random data is obtained to obtain non-directional preference data; approximate geographic information and approximate time information are still stored in the non-directional data; the basic processing of the approximate geographic information The unit is a county-level area; the approximate time information is accurate to hours; after the data obfuscation module completes the processing of the collected data, it sends the obtained non-directional data to the data analysis module; the data analysis module receives After the non-directional data, the data is analyzed to obtain the user's preference status information in the corresponding area; after the data analysis module obtains the user's preference status information, the information is sent to the data push module; the data push module is based on User preference status information, and push relevant information to users in different regions at different time points to improve the promotion effect. 2.根据权利要求1所述一种基于区块链的大数据处理系统,其特征在于:所述数据分析模块对于接收到的无指向性数据进行分析,依照浏览信息内容的不同,对用户群体进行划分,提升数据推动模块进行定向推广的效果;所述数据分析模块通过对无指向性数据进行分析,将其中属于违规信息的部分进行单独区分,进而依据违规信息分析得出偏好浏览违规信息的用户群体、地点以及时间,得到违规信息特征数据;所述数据分析模块将违规信息特征数据发送至数据推送模块;所述数据推送模块依据接收到的违规信息特征数据,在特定时间点增加向特定地区推送普法、教育信息的数量以及频次。2. A block chain-based big data processing system according to claim 1, wherein the data analysis module analyzes the received non-directional data, and analyzes the user groups according to the content of the browsing information. Divide and improve the effect of targeted promotion by the data promotion module; the data analysis module analyzes the non-directional data, separates the parts belonging to the violation information, and then analyzes the violation information to obtain preference for browsing the violation information. User group, location and time to obtain the feature data of the violation information; the data analysis module sends the feature data of the violation information to the data push module; the data push module adds the feature data of the violation information at a specific point of time to the specific data according to the received feature data of the violation information. The number and frequency of legal popularization and education information pushed by the region. 3.根据权利要求2所述一种基于区块链的大数据处理系统,其特征在于:所述数据分析模块在接收到无指向性数据后,对该部分数据进行分析,将该部分数据中属于负面状态的信息进行压抑信息标记;所述数据分析模块依据标记的压抑信息标记,对用户群体进行划分,区分出偏好浏览负面信息的用户群体、地点以及时间,得到压力特征数据;所述数据分析模块将压力特征数据发送至数据推送模块;所述数据推送模块依据接收到的压力特征数据,加大在特定地区投放正面信息的数量以及频次;所述正面信息包括但不限于正能量的视频、文章;所述数据推送模块在特定地区推送正面信息时,对全天时间段的推送力度相对于历史推送力度均加大,且在特定时间段推送正面信息的力度增加到峰值。3. The blockchain-based big data processing system according to claim 2, wherein the data analysis module analyzes the part of the data after receiving the non-directional data, and stores the part of the data in the data. The information belonging to the negative state is marked with suppression information; the data analysis module divides the user groups according to the marked suppression information marking, distinguishes the user groups, locations and times that prefer to browse negative information, and obtains stress characteristic data; the data The analysis module sends the pressure characteristic data to the data push module; the data push module increases the number and frequency of positive information in a specific area based on the received pressure characteristic data; the positive information includes but is not limited to positive energy videos , article; when the data push module pushes positive information in a specific area, the push force for the whole day time period is increased compared to the historical push force, and the push force for positive information in a specific time period increases to a peak value. 4.根据权利要求3所述一种基于区块链的大数据处理系统,其特征在于:所述数据分析模块对接收到的无指向性数据进行分析,并建立用户偏好性模型;所述数据分析模块通过对模型的分析,得到用户浏览负面信息和违规信息的预计数据;所述数据分析模块通过对接收到的无指向性数据进行分析,得到用户浏览负面信息和违规信息的统计数据;所述数据分析模块对预计数据与统计数据进行对比,得到对比结果的绝对值;所述对比结果的绝对值大于等于零,数据分析模块判定数据推送模块针对性推送的普法、教育信息和正面信息取得预期效果,浏览违规信息以及负面信息的用户群体缩小或不变;所述对比结果的绝对值小于零,数据推送模块针对性推送的普法、教育信息和正面信息未取得预期效果,数据分析模块判定浏览违规信息以及负面信息的用户群体扩大;所述对比结果的绝对值小于零,数据分析模块向数据推送模块发送推动力度增加指令;所述数据推送模块接收到指令后,数据推送模块加大日常推送信息力度。4. A blockchain-based big data processing system according to claim 3, wherein the data analysis module analyzes the received non-directional data, and establishes a user preference model; the data The analysis module obtains the expected data of user browsing negative information and violation information by analyzing the model; the data analysis module obtains the statistical data of user browsing negative information and violation information by analyzing the received non-directional data; The data analysis module compares the predicted data with the statistical data, and obtains the absolute value of the comparison result; the absolute value of the comparison result is greater than or equal to zero, and the data analysis module determines that the legal popularization, educational information and positive information targetedly pushed by the data push module are expected to be obtained. effect, the group of users who browse the illegal information and negative information is reduced or unchanged; the absolute value of the comparison result is less than zero, the legal popularization, educational information and positive information pushed by the data push module have not achieved the expected effect, and the data analysis module determines that the browsing The user group of illegal information and negative information expands; the absolute value of the comparison result is less than zero, and the data analysis module sends an instruction to increase the push force to the data push module; after the data push module receives the instruction, the data push module increases daily push Information strength. 5.根据权利要求1所述一种基于区块链的大数据处理系统,其特征在于:所述处理系统包括数据存储模块、数据加密模块、节点传播模块;所述数据存储模块用于保存处理系统在工作过程中产生的数据以及采集到的用户数据;所述数据加密模块对数据混淆模块输出的无指向性数据进行加密,得到密级数据以及解密秘钥;所述数据加密模块在得到密级数据后,将密级数据发送至节点传播模块;所述节点传播模块在接收到密级数据后,通过广播的模式,将密级数据传播到区块链网络中,并接收区块链网络中其它节点返回的已存储信号;所述节点传播模块在接收到区块链网络中除自身之外的半数节点的返回信号后,判定密级数据上链成功,停止广播密级数据;所述数据加密模块得到密级数据时,同时生成密级数据的核验MD5值,并将核验MD5值划分为等长的两端并分别附加到密级数据的头部和尾部;所述区块链网络中的其它节点在接收到完整密级数据后,在后台进行密级数据核验;所述其它节点对密级数据的核验时,提取密级数据头部与尾部附加的核验MD5值,并将该核验MD5值与密级数据经过计算得到的对比MD5值进行;所述对比MD5值与核验MD5值一致,则其它节点判定接收到的密级数据完整且正确,将密级数据保存并广播已存储信号;所述对比MD5值与核验MD5值不一致,则其它节点判定接收到的密级数据存在缺陷,与预期接收数据不符,其它节点丢弃已接收的密级数据,并保持沉默,不广播已存储信号。5. A blockchain-based big data processing system according to claim 1, characterized in that: the processing system comprises a data storage module, a data encryption module, and a node propagation module; the data storage module is used for storing and processing The data generated by the system in the working process and the collected user data; the data encryption module encrypts the non-directional data output by the data obfuscation module to obtain secret-level data and decryption key; the data encryption module obtains the secret-level data Then, the secret-level data is sent to the node dissemination module; after receiving the secret-level data, the node dissemination module transmits the secret-level data to the blockchain network through the broadcasting mode, and receives the returned information from other nodes in the blockchain network. Stored signals; after the node dissemination module receives the return signals from half of the nodes in the blockchain network except itself, it determines that the secret-level data is successfully uploaded to the chain, and stops broadcasting the secret-level data; when the data encryption module obtains the secret-level data , and generate the verification MD5 value of the secret-level data at the same time, and divide the verification MD5 value into two ends of equal length and attach them to the head and tail of the secret-level data respectively; other nodes in the blockchain network receive the complete secret-level data. Then, perform security level data verification in the background; when the other nodes verify the security level data, extract the verification MD5 value attached to the head and tail of the security level data, and compare the verification MD5 value with the calculated comparison MD5 value of the security level data. ; The comparison MD5 value is consistent with the verification MD5 value, then other nodes determine that the received secret-level data is complete and correct, and the secret-level data is saved and broadcasts the stored signal; The comparison MD5 value is inconsistent with the verification MD5 value, then other nodes determine that The received secret-level data is defective and does not match the expected received data. Other nodes discard the received secret-level data, keep silent, and do not broadcast the stored signal. 6.根据权利要求5所述一种基于区块链的大数据处理系统,其特征在于:所述数据加密模块在对无指向性数据进行加密时,按照预定的分割算法对接收到的无指向性数据进分割,得到大小相同的且顺序排列的数据块,之后,将分割后得到的数据块采用划分算法,划分并保存为本地存储数据与区块网络存储数据;所述数据加密模块对区块网络数据使用哈希加密算法进行加密,得到密级数据,并将密级数据发送至节点传播模块,通过节点传播模块广播到区块链网络中进行保存;所述本地存储数据与区块网络存储数据之间的比例为(8-9):1;所述数据块进行划分保存时采用的划分算法中包括时间函数,依据划分算法,从大小相同、顺序排列的数据块中抽取区块网络存储数据,抽取后剩余的数据块保存为本地存储数据;所述时间函数为数据加密模块当前工作时间点;所述本地存储数据中不包括已被抽取并作为区块网络存储数据的数据块;所述处理系统或其他用户使用无指向性数据时,从区块链网络中下载密级数据,经过数据加密模块进行解密,再逆向运行划分算法,将密级数据中的区块网络存储数据与本地存储数据进行复合,得到分割后的大小相同的且顺序排列的数据块,之后,逆向运行分割算法,得到原始的无指向性数据。6 . The blockchain-based big data processing system according to claim 5 , wherein the data encryption module encrypts the non-directional data according to a predetermined segmentation algorithm to the received non-directional data. 7 . Then, the data blocks obtained after the division are divided and saved as local storage data and block network storage data by using a division algorithm; the data encryption module The block network data is encrypted using a hash encryption algorithm to obtain secret-level data, and the secret-level data is sent to the node propagation module, and broadcast to the blockchain network through the node propagation module for preservation; the local storage data and the block network storage data The ratio between them is (8-9): 1; the division algorithm adopted when the data blocks are divided and saved includes a time function, and according to the division algorithm, the block network storage data is extracted from the data blocks of the same size and arranged in sequence. , the remaining data blocks after the extraction are saved as local storage data; the time function is the current working time point of the data encryption module; the local storage data does not include the data blocks that have been extracted and used as block network storage data; the described When the processing system or other users use non-directional data, the encrypted data is downloaded from the blockchain network, decrypted by the data encryption module, and then the division algorithm is run in reverse, and the block network storage data in the encrypted data is compared with the local storage data. After compounding, the divided data blocks of the same size and arranged in order are obtained. After that, the segmentation algorithm is run in reverse to obtain the original non-directional data.
CN202011536944.2A 2020-12-23 2020-12-23 Big data processing system based on block chain Pending CN112667888A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011536944.2A CN112667888A (en) 2020-12-23 2020-12-23 Big data processing system based on block chain

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011536944.2A CN112667888A (en) 2020-12-23 2020-12-23 Big data processing system based on block chain

Publications (1)

Publication Number Publication Date
CN112667888A true CN112667888A (en) 2021-04-16

Family

ID=75408123

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011536944.2A Pending CN112667888A (en) 2020-12-23 2020-12-23 Big data processing system based on block chain

Country Status (1)

Country Link
CN (1) CN112667888A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113569262A (en) * 2021-07-30 2021-10-29 立信(重庆)数据科技股份有限公司 Ciphertext storage method and system based on block chain
CN113674819A (en) * 2021-08-25 2021-11-19 昆明智迪教育信息咨询有限责任公司 Mental health management work monitoring and evaluating method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105357232A (en) * 2014-08-09 2016-02-24 星际空间(天津)科技发展有限公司 Location awareness-based information pushing system and method
CN105956048A (en) * 2016-04-27 2016-09-21 上海遥薇(集团)有限公司 Community service big data algorithm mining system
CN110557385A (en) * 2019-08-22 2019-12-10 西安电子科技大学 information hiding access method and system based on behavior confusion, and server
CN111953670A (en) * 2020-07-30 2020-11-17 江苏大学 Adaptive obfuscation method, system and computer storage medium based on Meek transmission plug-in

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105357232A (en) * 2014-08-09 2016-02-24 星际空间(天津)科技发展有限公司 Location awareness-based information pushing system and method
CN105956048A (en) * 2016-04-27 2016-09-21 上海遥薇(集团)有限公司 Community service big data algorithm mining system
CN110557385A (en) * 2019-08-22 2019-12-10 西安电子科技大学 information hiding access method and system based on behavior confusion, and server
CN111953670A (en) * 2020-07-30 2020-11-17 江苏大学 Adaptive obfuscation method, system and computer storage medium based on Meek transmission plug-in

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113569262A (en) * 2021-07-30 2021-10-29 立信(重庆)数据科技股份有限公司 Ciphertext storage method and system based on block chain
CN113674819A (en) * 2021-08-25 2021-11-19 昆明智迪教育信息咨询有限责任公司 Mental health management work monitoring and evaluating method

Similar Documents

Publication Publication Date Title
CN111756522B (en) Data processing method and system
CN110677411B (en) Data sharing method and system based on cloud storage
CN106408952A (en) Vehicle illegal behavior random photographing system and method
US20220021660A1 (en) Data privacy system
US9805216B2 (en) Privacy compliance event analysis system
US8327150B2 (en) System, method and program for managing information
CN108809990B (en) Crowdsourcing data security encryption method, server and storage medium
JP2019161643A (en) Tamper protection and video source identification for video processing pipeline
CN112667888A (en) Big data processing system based on block chain
CN111104685B (en) Dynamic updating method and device for two-dimension code
CN113452526A (en) Electronic document storage and verification method and corresponding device
CN113449048A (en) Data label distribution determining method and device, computer equipment and storage medium
CN113766085B (en) Image processing method and related device
CN111680013A (en) Data sharing method based on block chain, electronic equipment and device
CN113709513B (en) Equipment fingerprint processing method, user side, server, system and storage medium
CN112380404B (en) Data filtering method, device and system
CN119011274A (en) Protection method for network security of micro-grid system and related equipment
CN110516460B (en) Encryption security method and system for BIM data
CN118551361A (en) A method for identifying an electronic ID card and related equipment
CN108900472A (en) The transmission method and device of information
CN116232769B (en) Safe interaction method and platform
CN116049877B (en) Method, system, equipment and storage medium for identifying and desensitizing private data
CN111582954B (en) False data identification method and device
CN116401718A (en) Block chain-based data protection method and device, electronic equipment and storage medium
CN111382286B (en) Data processing method and related product

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination