GB2616199A - Automatically adjusting data access policies in data analytics - Google Patents

Automatically adjusting data access policies in data analytics Download PDF

Info

Publication number
GB2616199A
GB2616199A GB2308825.5A GB202308825A GB2616199A GB 2616199 A GB2616199 A GB 2616199A GB 202308825 A GB202308825 A GB 202308825A GB 2616199 A GB2616199 A GB 2616199A
Authority
GB
United Kingdom
Prior art keywords
autoencoder
autoencoder network
level
data
network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB2308825.5A
Other versions
GB2616199B (en
GB202308825D0 (en
Inventor
K Baughman Aaron
Kwatra Shikhar
Ekambaram Vijay
Narotambhai Marvaniya Smitkumar
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of GB202308825D0 publication Critical patent/GB202308825D0/en
Publication of GB2616199A publication Critical patent/GB2616199A/en
Application granted granted Critical
Publication of GB2616199B publication Critical patent/GB2616199B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/604Tools and structures for managing or administering access control systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Bioethics (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Automation & Control Theory (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Debugging And Monitoring (AREA)

Abstract

From a first model parameter, an autoencoder network is generated. A reconstruction error for the autoencoder network is measured, the reconstruction error comprising a difference between an input to the autoencoder network and a corresponding output from the autoencoder network, the input to the autoencoder network comprising a portion of an initial set of data. The reconstruction error and a confidence score corresponding to a complexity level of the autoencoder network are aggregated into a level of difficulty score of the autoencoder network. From the level of difficulty score and an initial data access policy level corresponding to the initial set of data, a derived data access policy level corresponding to the initial data access policy level is generated, the derived data access policy level enforcing access to a transformed set of data generated by applying a transformation to the initial set of data.

Claims (20)

1. A computer-implemented method comprising: generating, from a first model parameter, an autoencoder network; measuring a reconstruction error for the autoencoder network, the reconstruction error comprising a difference between an input to the autoencoder network and a corresponding output from the autoencoder networ k, the input to the autoencoder network comprising a portion of an initial s et of data; aggregating, into a level of difficulty score of the autoencoder network, the reconstruction error and a confidence score corresponding to a comple xity level of the autoencoder network; and generating, from the level of difficulty score and an initial data access policy leve l corresponding to the initial set of data, a derived data access policy level corresponding to the initial data acce ss policy level, the derived data access policy level enforcing access to a transformed se t of data generated by applying a transformation to the initial set of dat a.
2. The computer-implemented method of claim 1, further comprising: training, using a training subset of the initial set of data, the autoencoder network.
3. The computer-implemented method of claim 2, wherein the training is performed to minimize a reconstruction error of t he autoencoder network.
4. The computer-implemented method of claim 2, wherein the training is performed to minimize a difference between an out put of an encoder portion of the autoencoder network and a transformed set of data generated by applying the transformation to the training subset.
5. The computer-implemented method of claim 1, further comprising: measuring, for the autoencoder network, the complexity level.
6. The computer-implemented method of claim 1, further comprising: generating, from the level of difficulty score, a set of model parameters, a second model parameter in the set of model parameters comprising a vari ation from the first model parameter; generating, from the set of model parameters, a set of autoencoder networks; measuring a model-specific reconstruction error of each autoencoder networ k in the set of autoencoder networks, the model-specific reconstruction error comprising a difference between a n input to an autoencoder network in the set of autoencoder networks and a corresponding output from the autoencoder network in the set of autoencod er networks, the input to the autoencoder network in the set of autoencoder networks c omprising the portion of the initial set of data; and aggregating, into a level of difficulty score of the set of autoencoder networks, the model-specific reconstruction error of each autoencoder network and a set of confidence scores, each confidence score corresponding to a complexity level of an autoencod er network in the set of autoencoder networks.
7. The computer-implemented method of claim 1, wherein the model parameter comprises a number of hidden layers in an enc oder portion of the autoencoder network and a number of hidden layers in a decoder portion of the autoencoder network.
8. The computer-implemented method of claim 1, wherein the model parameter comprises a number of dimensions in an output of an encoder portion of the autoencoder network.
9. A computer program product for automatically adjusting a data access polic y, the computer program product comprising: one or more computer readable storage media, and program instructions collectively stored on the one or more computer readable storage media, the program instructions comprising: program instructions to generate, from a first model parameter, an autoencoder network; program instructions to measure a reconstruction error for the autoencoder network, the reconstruction error comprising a difference between an input to the autoencoder network and a corresponding output from the autoencoder networ k, the input to the autoencoder network comprising a portion of an initial s et of data; program instructions to aggregate, into a level of difficulty score of the autoencoder network, the reconstruction error and a confidence score corresponding to a comple xity level of the autoencoder network; and program instructions to generate, from the level of difficulty score and an initial data access policy leve l corresponding to the initial set of data, a derived data access policy level corresponding to the initial data acce ss policy level, the derived data access policy level enforcing access to a transformed se t of data generated by applying a transformation to the initial set of dat a.
10. The computer program product of claim 9, further comprising: program instructions to train, using a training subset of the initial set of data, the autoencoder network.
11. The computer program product of claim 10, wherein the training is performed to minimize a reconstruction error of t he autoencoder network.
12. The computer program product of claim 10, wherein the training is performed to minimize a difference between an out put of an encoder portion of the autoencoder network and a transformed set of data generated by applying the transformation to the training subset.
13. The computer program product of claim 9, further comprising: program instructions to measure, for the autoencoder network, the complexity level.
14. The computer program product of claim 9, further comprising: program instructions to generate, from the level of difficulty score, a set of model parameters, a second model parameter in the set of model parameters comprising a vari ation from the first model parameter; program instructions to generate, from the set of model parameters, a set of autoencoder networks; program instructions to measure a model-specific reconstruction error of e ach autoencoder network in the set of autoencoder networks, the model-specific reconstruction error comprising a difference between a n input to an autoencoder network in the set of autoencoder networks and a corresponding output from the autoencoder network in the set of autoencod er networks, the input to the autoencoder network in the set of autoencoder networks c omprising the portion of the initial set of data; and program instructions to aggregate, into a level of difficulty score of the set of autoencoder networks, the model-specific reconstruction error of each autoencoder network and a set of confidence scores, each confidence score corresponding to a complexity level of an autoencod er network in the set of autoencoder networks.
15. The computer program product of claim 9, wherein the model parameter comprises a number of hidden layers in an enc oder portion of the autoencoder network and a number of hidden layers in a decoder portion of the autoencoder network.
16. The computer program product of claim 9, wherein the model parameter comprises a number of dimensions in an output of an encoder portion of the autoencoder network.
17. The computer program product of claim 9, wherein the stored program instructions are stored in the at least one of the one or more storage media of a local data processing system, and wherein the stored program instructions are transferred over a networ k from a remote data processing system.
18. The computer program product of claim 9, wherein the stored program instructions are stored in the at least one of the one or more storage media of a server data processing system, and wherein the stored program instructions are downloaded over a network to a remote data processing system for use in a computer readable storage device associated with the remote data processing system.
19. The computer program product of claim 9, wherein the computer program product is provided as a service in a cloud environment.
20. A computer system comprising one or more processors, one or more computer-readable memories, and one or more computer-readable storage devices, and program instructions stored on at least one of the one or more storag e devices for execution by at least one of the one or more processors via at least one of the one or more memories, the stored program instructions comprising: program instructions to generate, from a first model parameter, an autoencoder network; program instructions to measure a reconstruction error for the autoencoder network, the reconstruction error comprising a difference between an input to the autoencoder network and a corresponding output from the autoencoder networ k, the input to the autoencoder network comprising a portion of an initial s et of data; program instructions to aggregate, into a level of difficulty score of the autoencoder network, the reconstruction error and a confidence score corresponding to a comple xity level of the autoencoder network; and program instructions to generate, from the level of difficulty score and an initial data access policy leve l corresponding to the initial set of data, a derived data access policy level corresponding to the initial data acce ss policy level, the derived data access policy level enforcing access to a transformed set of data gen erated by applying a transformation to the initial set of data.
GB2308825.5A 2020-11-24 2021-10-14 Automatically adjusting data access policies in data analytics Active GB2616199B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US17/103,118 US20220164457A1 (en) 2020-11-24 2020-11-24 Automatically adjusting data access policies in data analytics
PCT/CN2021/123849 WO2022111112A1 (en) 2020-11-24 2021-10-14 Automatically adjusting data access policies in data analytics

Publications (3)

Publication Number Publication Date
GB202308825D0 GB202308825D0 (en) 2023-07-26
GB2616199A true GB2616199A (en) 2023-08-30
GB2616199B GB2616199B (en) 2024-03-13

Family

ID=81658354

Family Applications (1)

Application Number Title Priority Date Filing Date
GB2308825.5A Active GB2616199B (en) 2020-11-24 2021-10-14 Automatically adjusting data access policies in data analytics

Country Status (6)

Country Link
US (1) US20220164457A1 (en)
JP (1) JP2023550445A (en)
CN (1) CN116490871A (en)
DE (1) DE112021006167T5 (en)
GB (1) GB2616199B (en)
WO (1) WO2022111112A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11929169B2 (en) * 2022-02-09 2024-03-12 Kyndryl, Inc. Personalized sensory feedback

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150222606A1 (en) * 2012-09-21 2015-08-06 Nokia Corporation Method and apparatus for providing access control to shared data based on trust level
CN110784316A (en) * 2019-10-29 2020-02-11 安徽大学 Adaptive strategy updating fast attribute encryption method based on strategy hiding
US10616281B1 (en) * 2017-09-14 2020-04-07 Amazon Technologies, Inc. Service-level authorization policy management
US20200213336A1 (en) * 2018-12-26 2020-07-02 International Business Machines Corporation Detecting inappropriate activity in the presence of unauthenticated API requests using artificial intelligence

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8429307B1 (en) * 2010-04-30 2013-04-23 Emc Corporation Application aware intelligent storage system
US20110321117A1 (en) * 2010-06-23 2011-12-29 Itt Manufacturing Enterprises, Inc. Policy Creation Using Dynamic Access Controls
JP2014115685A (en) * 2012-12-06 2014-06-26 Nippon Telegr & Teleph Corp <Ntt> Profile analyzing device, method and program
DE102013206291A1 (en) * 2013-04-10 2014-10-16 Robert Bosch Gmbh Method and apparatus for creating a non-parametric, data-based function model
US11481652B2 (en) * 2015-06-23 2022-10-25 Gregory Knox System and method for recommendations in ubiquituous computing environments
US10970322B2 (en) * 2018-11-26 2021-04-06 International Business Machines Corporation Training an artificial intelligence to generate an answer to a query based on an answer table pattern
US11200461B2 (en) * 2018-12-21 2021-12-14 Capital One Services, Llc Methods and arrangements to identify feature contributions to erroneous predictions

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150222606A1 (en) * 2012-09-21 2015-08-06 Nokia Corporation Method and apparatus for providing access control to shared data based on trust level
US10616281B1 (en) * 2017-09-14 2020-04-07 Amazon Technologies, Inc. Service-level authorization policy management
US20200213336A1 (en) * 2018-12-26 2020-07-02 International Business Machines Corporation Detecting inappropriate activity in the presence of unauthenticated API requests using artificial intelligence
CN110784316A (en) * 2019-10-29 2020-02-11 安徽大学 Adaptive strategy updating fast attribute encryption method based on strategy hiding

Also Published As

Publication number Publication date
CN116490871A (en) 2023-07-25
US20220164457A1 (en) 2022-05-26
JP2023550445A (en) 2023-12-01
DE112021006167T5 (en) 2023-09-07
GB2616199B (en) 2024-03-13
GB202308825D0 (en) 2023-07-26
WO2022111112A1 (en) 2022-06-02

Similar Documents

Publication Publication Date Title
US20210304071A1 (en) Systems and methods for generating machine learning applications
Christou et al. Quasi‐likelihood inference for negative binomial time series models
WO2021098281A1 (en) Project baseline data generation method and device, computer device, and computer readable storage medium
CN106779380A (en) A kind of intelligent construction safety checks evaluation system and method
CN109815344B (en) Network model training system, method, apparatus and medium based on parameter sharing
Moradi et al. Performance prediction in dynamic clouds using transfer learning
CN108390775B (en) User experience quality evaluation method and system based on SPICE
JP6200039B2 (en) Automatic traffic generation for fairing systems
GB2616199A (en) Automatically adjusting data access policies in data analytics
US20210110110A1 (en) Interleaved conversation concept flow enhancement
CN113902122A (en) Federal model collaborative training method and device, computer equipment and storage medium
Wang et al. Flint: A platform for federated learning integration
Zheng et al. [Retracted] Application of Mathematical Models in Economic Variable Input and Output Models under the Scientific Visualization
Chen et al. A forecasting system of micro-blog public opinion based on artificial neural network
WO2024065776A1 (en) Method for data processing, apparatus for data processing, electronic device, and storage medium
US20180068122A1 (en) Transmission of trustworthy data
Liu et al. [Retracted] Analysis of Efficiency of Human Resource Management Evaluation Model Based on SOM Neural Network
CN111291536A (en) Method and system for automatically generating poems
Sahoo et al. Improving effort estimation of software products by augmenting class point approach with regression analysis
Cui Intelligent analysis of exercise health big data based on deep convolutional neural network
Bai et al. The model of project risk assessment based on BP neural network algorithm
US20240193400A1 (en) Technology service management using graph neural network
CN113392862B (en) Self-healing management and control method and device for sensing data, computer equipment and storage medium
Cao et al. Using Empirical Modal Decomposition to Improve the Daily Milk Yield Prediction of Cows
Peng et al. Stock Index Prediction Method based on ARIMA-ELM Combination Model

Legal Events

Date Code Title Description
746 Register noted 'licences of right' (sect. 46/1977)

Effective date: 20240423