CN112800473B

CN112800473B - Data processing method based on big data safety house

Info

Publication number: CN112800473B
Application number: CN202110285868.0A
Authority: CN
Inventors: 汤文巍; 章智云
Original assignee: Vhs Shanghai Health Technology Co ltd
Current assignee: Vhs Shanghai Health Technology Co ltd
Priority date: 2021-03-17
Filing date: 2021-03-17
Publication date: 2022-01-04
Anticipated expiration: 2041-03-17
Also published as: CN112800473A

Abstract

The invention relates to a data processing method based on a big data safety house, which comprises the following steps: receiving a big data access processing request and generating a service token corresponding to the big data access processing request, wherein the service token has uniqueness; initializing a corresponding execution sandbox environment according to the service token, and putting a data subset and a data processing instruction set related to the big data access processing request into the execution sandbox environment; completing the big data access processing request in the execution sandbox environment, performing data security isolation processing on cross-domain data to obtain an isolation processing result, and performing desensitization processing to obtain a desensitization processing result; and releasing the execution sandbox environment, eliminating the data subset and the data processing instruction set, logging off the service token, and sending the desensitization processing result. The invention can improve the safety controllability of data.

Description

Data processing method based on big data safety house

Technical Field

The invention relates to the technical field of big data processing, in particular to a data processing method based on a big data security house.

Background

The existing big data processing transmits processing logic codes to distributed data nodes for data operation processing through distributed services based on underlying big data storage. Although the mode executes localized data processing in multiple nodes through distributed deployment, the performance cost brought by network transmission is reduced as much as possible. However, the locally executed data processing task lacks an effective security management and control capability, and cannot ensure the security of data in the distributed data processing process. Especially when distributed data processing of a wide area network is involved, the existing big data processing mechanism is further unable to meet the requirements of security and confidentiality of data and high controllability of the available field of data.

Disclosure of Invention

The invention aims to provide a data processing method based on a big data safety house, and safety and controllability of data are improved.

The technical scheme adopted by the invention for solving the technical problems is as follows: the data processing method based on the big data safety house comprises the following steps:

(1) receiving a big data access processing request and generating a service token corresponding to the big data access processing request, wherein the service token has uniqueness;

(2) initializing a corresponding execution sandbox environment according to the service token, and putting a data subset and a data processing instruction set related to the big data access processing request into the execution sandbox environment;

(3) completing the big data access processing request in the execution sandbox environment, realizing the data security isolation processing of cross-domain data to obtain an isolation processing result, and performing desensitization processing on the isolation processing result to obtain a desensitization processing result;

(4) and releasing the execution sandbox environment, eliminating the data subset and the data processing instruction set, logging off the service token, and sending the desensitization processing result.

The step (1) further comprises a step of confirming the identity of the initiator after receiving the big data access processing request, and a service token corresponding to the big data access processing request is generated after the identity of the initiator passes.

And (3) after the data subset and the data processing instruction set related to the big data access processing request are placed in the execution sandbox environment in the step (2), the step of saving the extraction state, the data volume and the data set range of the data subset, and the loading state and the instruction set range of the data processing instruction set to a log is further included.

The step (3) further comprises the step of saving the data processing state, the result data volume and the desensitization condition to a log.

The step (4) further comprises saving the processing state and the result of the big data access processing request in a log.

Advantageous effects

Due to the adoption of the technical scheme, compared with the prior art, the invention has the following advantages and positive effects:

the invention adopts active data loading to ensure that the sandbox execution environment only stores limited related data subsets, avoids cross-domain data access leakage, adopts a unique service token to ensure that the data of the sandbox execution environment realizes safe isolation modes such as isolation caching, isolation register and isolation calculation and the execution safety of a processing instruction set, and ensures effective desensitization of the processing result after the sandbox execution through active desensitization.

The invention effectively solves the system contradiction between the execution efficiency and the data access safety control in the big data processing access through the technical scheme of safe and credible data loading, processing execution and active desensitization. The data security room mechanism realized by the invention can be widely applied to the safe and credible application of large data sources, and the social public data resources acquired with high cost are really, efficiently and safely applied to the commercial application of various industries, thereby improving the social benefit.

Drawings

FIG. 1 is a flow chart of the present invention.

Detailed Description

The invention will be further illustrated with reference to the following specific examples. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Further, it should be understood that various changes or modifications of the present invention may be made by those skilled in the art after reading the teaching of the present invention, and such equivalents may fall within the scope of the present invention as defined in the appended claims.

The embodiment of the invention relates to a data processing method based on a big data safety house, which comprises the following steps as shown in figure 1:

and (1) confirming the identity of an initiator of the request and the related big data access authority by authenticating the big data access processing request. After the authentication of the big data access processing request is completed in the step (1), the corresponding big data access processing request information and the corresponding authentication result are recorded in the log through the step (1 a).

And (2) for the big data access processing request which is authenticated and confirmed to be effective through the step (1), applying for and generating a corresponding unique service token for the safe and reliable execution and data isolation of the subsequent step. After the service token generation is completed in the step (2), recording a corresponding big data access request log in the log through the step (2a), and keeping the service token generated by the request.

And (3) initializing a corresponding execution sandbox environment based on the service token generated in the step (2). Extracting relevant data subsets according to the big data access processing request accepted in the step (1) and storing the relevant data subsets in an execution sandbox environment. After the initialization preparation of the sandbox environment is executed in the step (3), the corresponding data subset extraction state, the data volume, the data set range and other relevant information are recorded in the log through the step (3a) for subsequent verification.

And (4) loading a related data processing instruction set according to the large data access processing request accepted in the step (1) based on the service token generated in the step (2) and storing the loaded related data processing instruction set in an execution sandbox environment. After the data processing instruction set is loaded to the execution sandbox in the step (4), the relevant information such as the loading state, the instruction set range and the like of the corresponding data processing instruction set is recorded in the log through the step (4a) for subsequent verification.

And (5) starting the step (4) to load a data processing instruction set in the execution sandbox environment based on the service token generated in the step (2) to complete processing of the data subset stored in the step (3), so that data security isolation processing of cross-domain data is realized to obtain an isolation processing result, and desensitization processing is performed on the obtained isolation processing result. After the step (5) completes the execution of the data processing instruction set and generates a desensitization processing result, the step (5a) records relevant information such as corresponding data processing state, result data volume, desensitization condition and the like into a log for subsequent verification.

And (6) releasing the execution sandbox environment initialized in the step (3), and releasing and eliminating the corresponding data subset stored in the sandbox in the step (3) and the data processing instruction set loaded in the sandbox in the step (4). And (3) then, the step (6) logs off the unique service token generated by the step (2) and returns the desensitization processing result generated in the step (5) to the initiator of the big data access processing request. After the desensitization processing result is returned in the step (6), the processing state and the result of the final big data access processing request are recorded in a log through the step (6a) for subsequent verification.

The invention adopts active data loading to ensure that the sandbox execution environment only stores limited related data subsets, avoids cross-domain data access leakage, adopts a unique service token to ensure the data security isolation and the processing instruction set execution security of the sandbox execution environment, and ensures effective desensitization of the processing result after the sandbox execution through active desensitization.

Claims

1. A data processing method based on a big data safety house is characterized by comprising the following steps:

(2) initializing a corresponding execution sandbox environment according to the service token, ensuring the data security isolation of the execution sandbox environment and the execution security of a processing instruction set, and putting a data subset and a data processing instruction set related to the big data access processing request into the execution sandbox environment;

2. The data processing method based on the big data security house as claimed in claim 1, wherein the step (1) further comprises a step of confirming the identity of the initiator after receiving the big data access processing request, and the service token corresponding to the big data access processing request is generated after the identity of the initiator passes.

3. The big-data security house-based data processing method according to claim 1, wherein the step (2) of putting the data subset and the data processing instruction set related to the big data access processing request into the execution sandbox environment further comprises the step of saving the extraction state, the data amount and the data set range of the data subset, and the loading state and the instruction set range of the data processing instruction set into a log.

4. The big-data security house-based data processing method as claimed in claim 1, wherein the step (3) further comprises the step of saving the data processing state, the result data volume and the desensitization condition to a log.

5. The big-data security house-based data processing method according to claim 1, wherein the step (4) further comprises saving the processing status and the result of the big data access processing request in a log.