WO2021215710A1 - Method for preventing breach of original data for deep learning and data breach preventing device using them - Google Patents

Method for preventing breach of original data for deep learning and data breach preventing device using them Download PDF

Info

Publication number
WO2021215710A1
WO2021215710A1 PCT/KR2021/004382 KR2021004382W WO2021215710A1 WO 2021215710 A1 WO2021215710 A1 WO 2021215710A1 KR 2021004382 W KR2021004382 W KR 2021004382W WO 2021215710 A1 WO2021215710 A1 WO 2021215710A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
characteristic information
spatial
loss
original
Prior art date
Application number
PCT/KR2021/004382
Other languages
French (fr)
Inventor
Sumin Lee
Original Assignee
Deeping Source Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Deeping Source Inc. filed Critical Deeping Source Inc.
Publication of WO2021215710A1 publication Critical patent/WO2021215710A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/10Protecting distributed programs or content, e.g. vending or licensing of copyrighted material ; Digital rights management [DRM]
    • G06F21/16Program or content traceability, e.g. by watermarking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6209Protecting access to data via a platform, e.g. using keys or access control rules to a single file or object, e.g. in a secure envelope, encrypted and accessed using a key, or with access control rules appended to the object itself
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • G06F21/6245Protecting personal data, e.g. for financial or medical purposes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6218Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
    • G06F21/6245Protecting personal data, e.g. for financial or medical purposes
    • G06F21/6254Protecting personal data, e.g. for financial or medical purposes by anonymising data, e.g. decorrelating personal data from the owner's identification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/10Protecting distributed programs or content, e.g. vending or licensing of copyrighted material ; Digital rights management [DRM]
    • G06F21/106Enforcing content protection by specific content processing
    • G06F21/1063Personalisation

Definitions

  • the present disclosure relates to a method for preventing breach of original data to be used for deep learning and a data breach preventing device using the same; and more particularly, to the method for converting the original data into optimal noisy data that humans can recognize but from which a learning network outputs different results, thus preventing the breach of the original data to be used for the deep learning, and the data breach preventing device using the same.
  • Big data refers to data including all of unstructured data and semi-structured data not utilized so far, like e-commerce data, metadata, web log data, radio frequency identification (RFID) data, sensor network data, social network data, data of Internet text and documents, Internet search indexing data, as well as all of structured data used by conventional enterprises or public institutions. Data as such are referred to as big data in the sense that common software tools and computer systems cannot easily handle such a huge volume of data.
  • RFID radio frequency identification
  • the third party can directly access the original data included in the data sets, the original data is at risk of illegal copying.
  • the third party can train a new deep learning network using the original data, and accordingly, there is a problem that the user's rights to the data sets are violated.
  • a method for preventing breach of original data to be used for deep learning including steps of: (a) a data breach preventing device, if the original data is acquired, performing or supporting another device to perform a process of adding noise onto the original data, to thereby generate 1-st noisy data; and (b) the data breach preventing device, (b1) while increasing an integer k from 1 to n-1 wherein n is an integer larger than 1, performing or supporting another device to perform iteration of (i) a process of inputting k-th noisy data into a learning network, to thereby allow the learning network to apply one or more learning operations to the k-th noisy data using one or more learned parameters of the learning network, and thus to output k-th characteristic information about the k-th noisy data, and (ii) a process of launching an adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) one or more (k_1)-st losses calculated by:
  • the learning network includes a 1-st learning network to an m-th learning network respectively having one or more 1-st learned parameters to one or more m-th learned parameters wherein m is an integer greater than 0, and wherein, at the step of (b), the data breach preventing device performs or supports another device to perform a process of inputting the k-th noisy data respectively into the 1-st learning network to the m-th learning network, to thereby allow the 1-st learning network to the m-th learning network to (i) apply the learning operations with the 1-st learned parameters to the m-th learned parameters to the k-th noisy data respectively, and thus output (k_1)-st characteristic information to (k_m)-th characteristic information about the k-th noisy data, (ii) calculate the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st loss to a (k_1_m)-th loss which are calculated by using the 1-st ground truth and each piece of
  • the data breach preventing device performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed loss to a (k_1_m)-th summed loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st loss to the (k_1_m)-th loss, in the process of calculating the (k_1)-st losses, and (ii) a process of averaging over a (k_2_1)-st summed loss to a (k_2_m)-th summed loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st loss to the (k_2_m)-th loss, in the process of calculating the (k_2)-nd losses.
  • the data breach preventing device performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information and (i-2) each piece of 1-st original characteristic information to m-th original characteristic information respectively created by inputting the original data into the 1-st learning network to the m-th learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st task specific output to a (k_m)-th task specific output and (ii-2) each of a 1-st original task specific output to an m-th original task specific output respectively created by referring to the 1-st original
  • the data breach preventing device performs or supports another device to perform (i) a process of applying random rotation to the k-th noisy data, to thereby generate (k_1)-st spatial data to (k_j)-th spatial data wherein j is an integer greater than 0, (ii) a process of inputting the (k_1)-st spatial data to the (k_j)-th spatial data into the learning network, to thereby allow the learning network to apply the deep-learning operation to the (k_1)-st spatial data to the (k_j)-th spatial data and thus to output (k_1)-st spatial characteristic information to (k_j)-th spatial characteristic information, (iii) a process of calculating the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st spatial loss to a (k_1_j)-th spatial loss which are calculated by using the 1-st ground truth and each piece of the (k_1)-s
  • the data breach preventing device performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed spatial loss to a (k_1_j)-th summed spatial loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st spatial loss to the (k_1_j)-th spatial loss, in the process of calculating the (k_1)-st losses, and (ii) a process of averaging over a (k_2_1)-st summed spatial loss to a (k_2_j)-th summed spatial loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st spatial loss to the (k_2_j)-th spatial loss, in the process of calculating the (k_2)-nd losses.
  • the data breach preventing device performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information and (i-2) each piece of original characteristic information created by inputting the original data into the learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output and (ii-2) an original task specific output created by referring to the original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from
  • the data breach preventing device performs or supports another device to perform a process of randomly selecting one or more angles ⁇ ranging from - ⁇ to ⁇ and a process of rotating the k-th noisy data as corresponding to each of the angles ⁇ , in the process of creating the (k_1)-st spatial data to the (k_j)-th spatial data.
  • the data breach preventing device performs or supports another device to perform (i) a process of comparing (i-1) (k+1)-th characteristic information created by inputting the (k+1)-th noisy data into the learning network and (i-2) original characteristic information created by inputting the original data into the learning network, and if the (k+1)-th characteristic information and the original characteristic information are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data, and (ii) a process of comparing (ii-1) a (k+1)-th task specific output created by referring to the (k+1)-th characteristic information and (ii-2) an original task specific output created by referring to the original characteristic information, and if the (k+1)-th task specific output and the original task specific output are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
  • the data breach preventing device performs or supports another device to perform a process of adding random noise, according to a normal distribution with one of variances and one of means created by a noise function, onto the original data.
  • a data breach preventing device for preventing breach of original data to be used for deep learning, including: at least one memory that stores instructions; and at least one processor configured to execute the instructions to perform or support another device to perform: (I) if the original data is acquired, a process of adding noise onto the original data, to thereby generate 1-st noisy data, and (II) (II-1) while increasing an integer k from 1 to n-1 wherein n is an integer larger than 1, (i) a process of inputting k-th noisy data into a learning network, to thereby allow the learning network to apply one or more learning operations to the k-th noisy data using one or more learned parameters of the learning network, and thus to output k-th characteristic information about the k-th noisy data, and (ii) a process of launching an adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) one or more (k_1)-st losses calculated by referring
  • the learning network includes a 1-st learning network to an m-th learning network respectively having one or more 1-st learned parameters to one or more m-th learned parameters wherein m is an integer greater than 0, and wherein, at the process of (II), the processor performs or supports another device to perform a process of inputting the k-th noisy data respectively into the 1-st learning network to the m-th learning network, to thereby allow the 1-st learning network to the m-th learning network to (i) apply the learning operations with the 1-st learned parameters to the m-th learned parameters to the k-th noisy data respectively, and thus output (k_1)-st characteristic information to (k_m)-th characteristic information about the k-th noisy data, (ii) calculate the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st loss to a (k_1_m)-th loss which are calculated by using the 1-st ground truth and each piece of the (k
  • the processor performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed loss to a (k_1_m)-th summed loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st loss to the (k_1_m)-th loss, in the process of calculating the (k_1)-st losses, and (ii) a process of averaging over a (k_2_1)-st summed loss to a (k_2_m)-th summed loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st loss to the (k_2_m)-th loss, in the process of calculating the (k_2)-nd losses.
  • the processor performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information and (i-2) each piece of 1-st original characteristic information to m-th original characteristic information respectively created by inputting the original data into the 1-st learning network to the m-th learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st task specific output to a (k_m)-th task specific output and (ii-2) each of a 1-st original task specific output to an m-th original task specific output respectively created by referring to the 1-st original characteristic information to the
  • the processor performs or supports another device to perform (i) a process of applying random rotation to the k-th noisy data, to thereby generate (k_1)-st spatial data to (k_j)-th spatial data wherein j is an integer greater than 0, (ii) a process of inputting the (k_1)-st spatial data to the (k_j)-th spatial data into the learning network, to thereby allow the learning network to apply the deep-learning operation to the (k_1)-st spatial data to the (k_j)-th spatial data and thus to output (k_1)-st spatial characteristic information to (k_j)-th spatial characteristic information, (iii) a process of calculating the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st spatial loss to a (k_1_j)-th spatial loss which are calculated by using the 1-st ground truth and each piece of the (k_1)-st spatial characteristic
  • the processor performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed spatial loss to a (k_1_j)-th summed spatial loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st spatial loss to the (k_1_j)-th spatial loss, in the process of calculating the (k_1)-st losses, and (ii) a process of averaging over a (k_2_1)-st summed spatial loss to a (k_2_j)-th summed spatial loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st spatial loss to the (k_2_j)-th spatial loss, in the process of calculating the (k_2)-nd losses.
  • the processor performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information and (i-2) each piece of original characteristic information created by inputting the original data into the learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output and (ii-2) an original task specific output created by referring to the original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (i)
  • the processor performs or supports another device to perform a process of randomly selecting one or more angles ⁇ ranging from - ⁇ to ⁇ and a process of rotating the k-th noisy data as corresponding to each of the angles ⁇ , in the process of creating the (k_1)-st spatial data to the (k_j)-th spatial data.
  • the processor performs or supports another device to perform (i) a process of comparing (i-1) (k+1)-th characteristic information created by inputting the (k+1)-th noisy data into the learning network and (i-2) original characteristic information created by inputting the original data into the learning network, and if the (k+1)-th characteristic information and the original characteristic information are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data, and (ii) a process of comparing (ii-1) a (k+1)-th task specific output created by referring to the (k+1)-th characteristic information and (ii-2) an original task specific output created by referring to the original characteristic information, and if the (k+1)-th task specific output and the original task specific output are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
  • the processor performs or supports another device to perform a process of adding random noise, according to a normal distribution with one of variances and one of means created by a noise function, onto the original data.
  • recordable media that are readable by a computer for storing a computer program to execute the method of the present disclosure are further provided.
  • the present disclosure has an effect of preventing the breach of the original data to be used for the deep learning.
  • the present disclosure has another effect of protecting the original data, by generating resultant data humans can recognize but from which the learning network outputs different results, unlike when using the original data.
  • the present disclosure has still another effect of providing the watermarked data, from which the original data cannot be recovered by using data processing technology such as image processing technology.
  • the present disclosure has still yet another effect of providing the watermarked data which appears to humans as minimally deteriorated, and degrade accuracy of the learning network having been trained with the watermarked data.
  • Fig. 1 is a drawing schematically illustrating a data breach preventing device which is used to prevent breach of original data to be used for deep learning in accordance with one example embodiment of the present disclosure.
  • Fig. 2 is a drawing schematically illustrating an example of watermarked data generated by a method for preventing the breach of the original data to be used for the deep learning in accordance with one example embodiment of the present disclosure.
  • Fig. 3 is a drawing schematically illustrating the method for preventing the breach of the original data to be used for the deep learning in accordance with one example embodiment of the present disclosure.
  • Fig. 4 is a drawing schematically illustrating the method for preventing the breach of the original data to be used for the deep learning in accordance with another example embodiment of the present disclosure.
  • Fig. 5 is a drawing schematically illustrating the method for preventing the breach of the original data to be used for the deep learning in accordance with still another example embodiment of the present disclosure.
  • Any images referred to in the present disclosure may include images related to any roads paved or unpaved, in which case the objects on the roads or near the roads may include vehicles, persons, animals, plants, buildings, flying objects like planes or drones, or any other obstacles which may appear in a road-related scene, but the scope of the present disclosure is not limited thereto.
  • said any images referred to in the present disclosure may include images not related to any roads, such as images related to alleyway, land lots, sea, lakes, rivers, mountains, forests, deserts, sky, or any indoor space
  • the objects in said any images may include vehicles, persons, animals, plants, buildings, flying objects like planes or drones, ships, amphibious planes or ships, or any other obstacles which may appear in a scene related to alleyway, land lots, sea, lakes, rivers, mountains, forests, deserts, sky, or any indoor space, but the scope of the present disclosure is not limited thereto.
  • Fig. 1 is a drawing schematically illustrating a data breach preventing device which is used to prevent breach of original data to be used for deep learning in accordance with one example embodiment of the present disclosure.
  • the data breach preventing device 100 may include a memory 101 for storing instructions to prevent the breach of the original data to be used for the deep learning, and a processor 102 for performing processes of preventing the breach of the original data to be used for the deep learning according to the instructions in the memory 101.
  • the instructions may correspond to processes of watermarking data in order to generate the watermarked data from which a learning network outputs results different from those of the original data, but from which humans can recognize the original data.
  • the data breach preventing device 100 may typically achieve a desired system performance by using combinations of at least one computing device and at least one computer software, e.g., a computer processor, a memory, a storage, an input device, an output device, or any other conventional computing components, an electronic communication device such as a router or a switch, an electronic information storage system such as a network-attached storage (NAS) device and a storage area network (SAN) as the computing device and any instructions that allow the computing device to function in a specific way as the computer software.
  • NAS network-attached storage
  • SAN storage area network
  • processors of such devices may include hardware configuration of MPU (Micro Processing Unit) or CPU (Central Processing Unit), cache memory, data bus, etc. Additionally, the computing device may further include OS and software configuration of applications that achieve specific purposes.
  • MPU Micro Processing Unit
  • CPU Central Processing Unit
  • cache memory data bus
  • computing device may further include OS and software configuration of applications that achieve specific purposes.
  • Such description of the computing device does not exclude an integrated device including any combination of a processor, a memory, a medium, or any other computing components for implementing the present disclosure.
  • the processor 102 of the data breach preventing device 100 may perform or support another device to perform a process of adding noise onto the original data, to thereby generate 1-st noisy data.
  • the processor 102 may perform or support another device to perform iteration of (i) a process of inputting k-th noisy data into a learning network, to thereby allow the learning network to apply one or more learning operations to the k-th noisy data using one or more learned parameters of the learning network, and thus to output k-th characteristic information about the k-th noisy data, (ii) a process of launching an adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) one or more (k_1)-st losses calculated by referring to (1) the k-th characteristic information and (2) at least one 1-st ground truth corresponding to the original data, and (i
  • the data breach preventing device 100 may add noise onto the original image, to thereby generate a noisy image as shown in (B) from which humans can recognize the original image, and may launch an adversarial attack on the noisy image, to thereby generate a watermarked image as shown in (C) from which humans can recognize the original image but the learning network outputs results different from those of the original image, thus preventing the breach of the original data to be used for the deep-learning.
  • a method for preventing the breach of the original data by using the data breach preventing device 100 in accordance with one example embodiment of the present disclosure is described by referring to Fig. 3 as follows.
  • the data breach preventing device 100 may instruct a noise network 110 to generate random noise.
  • the original data 1 may be included in an original data set.
  • the noise network 110 may generate the random noise having a normal distribution N( ⁇ , ⁇ 2) with one of means and one of variances created according to a noise function.
  • the data breach preventing device 100 may add noise to the original data, to thereby generate the 1-st noisy data 10.
  • the learning network may output from the 1-st noisy data 10 same or similar results to those from the original data 10.
  • the data breach preventing device 100 may perform or support another device to perform iteration of (i) a process of inputting the k-th noisy data into the learning network 120, to thereby allow the learning network 120 to apply the learning operations to the k-th noisy data using the learned parameters of the learning network, and thus to output k-th characteristic information 20 about the k-th noisy data, (ii) a process of launching the adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) the (k_1)-st losses 130 calculated by referring to (1) the k-th characteristic information 20 and (2) the 1-st ground truth corresponding to the original data, and (ii-2) the (k_2)-nd losses 130 calculated by referring to (1) at least one k-th task specific output created by using the k-th characteristic information and (2) the 2-nd ground truth corresponding to the original data, and thus a process of generating (k+1)-th noisy data.
  • the data breach preventing device 100 may input the 1-st noisy data 10 into the learning network 120.
  • the learning network 120 may include a machine learning network, but the scope of the present disclosure is not limited thereto, and may include any learning network capable of generating outputs by applying the learning operations using their own learned parameters to the inputted data.
  • the machine learning network may include at least one of a k-Nearest Neighbors, a Linear Regression, a Logistic Regression, a Support Vector Machine (SVM), a Decision Tree and Random Forest, a Neural Network, a Clustering, a Visualization and a Dimensionality Reduction, an Association Rule Learning, a Deep Belief Network, a Reinforcement Learning, and a Deep learning algorithm, but the machine learning network is not limited thereto and may include various learning algorithms.
  • SVM Support Vector Machine
  • the learning network 120 may apply the learning operations to the 1-st noisy data 10 using the learned parameters of the learning network 120, to thereby output the 1-st characteristic information about the 1-st noisy data 10.
  • the 1-st characteristic information may be features or logits corresponding to the 1-st noisy data.
  • the 1-st characteristic information may be feature values related to certain features in the 1-st noisy data, or the logits including values related to at least one of vectors, matrices, and coordinates related to the certain features.
  • the 1-st noisy data are facial image data
  • the result above may be classes for face recognition, facial features, e.g., laughing expressions, coordinates of facial landmark points, e.g., both end points on far sides of an eye.
  • the data breach preventing device 100 may perform or support another device to perform a process of launching the adversarial attack on the 1-st noisy data via backpropagation using at least one of (i) one or more (1_1)-st losses calculated by referring to the 1-st characteristic information and the 1-st ground truth corresponding to the original data, and (ii) one or more (1_2)-nd losses calculated by referring to (ii-1) at least one 1-st task specific output generated by using the 1-st characteristic information and (ii-2) the 2-nd ground truth corresponding to the original data, to thereby generate 2-nd noisy data, resulting in a single iteration.
  • the data breach preventing device 100 may fix and not update the learned parameters of the learning network 120, and may perform the backpropagation to maximize at least part of the (1_1)-st losses and the (1_2)-nd losses regarding the 1-st noisy data only, to thereby convert the 1-st noisy date into the 2-nd noisy data.
  • the 2-nd noisy data may be recognized as the original data by humans, but may be recognized as data just with higher similarity to the original data by the learning network 120.
  • the learning network 120 may determine the original data, the 1-st noisy data, and the 2-nd noisy data as similar to one another, but with different probabilities of being true, e.g., being the original data.
  • the original data is an image of a dog
  • the learning network 120 may recognize the dog in all of the original data, the 1-st noisy data, and the 2-nd noisy data, however, the probabilities of the recognition result as the "dog" being true may decrease by degrees like 99% for the original data, 98% for the 1-st noisy data, and 97% for the 2-nd noisy data. This may be due to a fact that the 1-st noisy data is a result of adding the noise to the original data, and that the 2-nd noisy data is a result of launching the adversarial attack on the 1-st noisy data.
  • the data breach preventing device 100 may acquire at least one loss gradient for maximizing at least part of the (1_1)-st losses and the (1_2)-nd losses, and may backpropagate the loss gradient to the original data using gradient ascent.
  • the task specific output may be an output of a task to be performed by the learning network 120, and may have various results according to the task learned by the learning network 120, such as a probability of a class for classification, coordinates resulting from regression for location detection, etc., and an activation function of an activation unit may be applied to the characteristic information outputted from the learning network 120, to thereby generate the task specific output according to the task to be performed by the learning network 120.
  • the activation functions may include a sigmoid function, a linear function, a softmax function, an rlinear function, a square function, a sqrt function, an srlinear function, an abs function, a tanh function, a brlinear function, etc., but the scope of the present disclosure is not limited thereto.
  • the data breach preventing device 100 may map the characteristic information outputted from the learning network 120 onto each of classes, to thereby generate each of one or more probabilities of the inputted data, for each of the classes.
  • each of the probabilities for each of the classes may represent each of probabilities of the characteristic information, outputted for each of the classes from the learning network 120, being true.
  • the original data are the facial image data
  • a probability of the face having a laughing expression may be outputted as 0.75
  • a probability of the face not having the laughing expression may be outputted as 0.25, and the like.
  • a softmax algorithm may be used for mapping the characteristic information outputted from the learning network 120 onto each of the classes, but the scope of the present disclosure is not limited thereto, and various algorithms may be used for mapping the characteristic information onto each of the classes.
  • the data breach preventing device 100 may input the 2-nd noisy data into the learning network 120.
  • the learning network 120 may apply the learning operations to the 2-nd noisy data using the learned parameters of the learning network 120, to thereby output the 2-nd characteristic information about the 2-nd noisy data.
  • the data breach preventing device 100 may perform a process of launching a subsequent adversarial attack on the 2-nd noisy data via backpropagation using at least part of (i) one or more (2_1)-st losses calculated by referring to the 2-nd characteristic information and the 1-st ground truth corresponding to the original data, and (ii) one or more (2_2)-nd losses calculated by referring to (ii-1) at least one 2-nd task specific output generated by using the 2-nd characteristic information and (ii-2) the 2-nd ground truth corresponding to the original data, to thereby generate 3-rd noisy data, resulting in a subsequent iteration.
  • the probability of the result of the 3-rd noisy data being true may be lower than the probability of the result of the 2-nd noisy data being true, where the result of the 3-rd noisy data is outputted from the learning network 120 by applying the learning operations to the 3-rd noisy data, and the result of the 2-nd noisy data is outputted from the learning network 120 by applying the learning operations to the 2-nd noisy data.
  • the learning network 120 may output results derived from the noisy data different to those from the original data.
  • the learning network 120 may recognize the (k+1)-th noisy data as not being “dog” due to the repeated adversarial attacks.
  • humans can still recognize the original data from the (k+1)-th noisy data.
  • the data breach preventing device 100 may perform or support another device to perform (i) a process of comparing (i-1) (k+1)-th characteristic information created by inputting the (k+1)-th noisy data into the learning network 120 and (i-2) original characteristic information created by inputting the original data 1 into the learning network 120, and if the (k+1)-th characteristic information and the original characteristic information are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data 30, and (ii) a process of comparing (ii-1) a (k+1)-th task specific output created by referring to the (k+1)-th characteristic information and (ii-2) an original task specific output created by referring to the original characteristic information, and if the (k+1)-th task specific output and the original task specific output are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data 30.
  • the data breach preventing device 100 may stop the iteration and output the noisy data at the iteration as the watermarked data, at the time when an outcome determined by the learning network 120 of the output becomes different from an identity, e.g., a GT or ground truth, of the original data, by referring to its corresponding probability.
  • an identity e.g., a GT or ground truth
  • Fig. 4 is a drawing schematically illustrating the method for preventing the breach of the original data to be used for the deep learning by launching the adversarial attack on the 1-st noisy data using multiple learning networks 120_1, 120_2, ..., and 120_m with their own learned parameters in accordance with another example embodiment of the present disclosure.
  • each of the multiple learning networks 120_1, 120_2, ..., and 120_m may have completed learning to perform tasks at least part of which may be different from one another, and m may be an integer larger than 0.
  • m may be an integer larger than 0.
  • the data breach preventing device 100 may add noise to the original data, to thereby generate the 1-st noisy data 10.
  • the data breach preventing device 100 may perform or support another device to perform iteration of (i) a process of inputting the k-th noisy data respectively into the 1-st learning network 120_1 to the m-th learning network 120_m, to thereby allow the 1-st learning network 120_1 to the m-th learning network 120_m to (i-1) apply the learning operations with one or more 1-st learned parameters to one or more m-th learned parameters to the k-th noisy data respectively, and thus to (i-2) output (k_1)-st characteristic information 20_1 to (k_m)-th characteristic information 20_m about the k-th noisy data, (ii) a process of calculating the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st loss to a (k_1_m)-th loss which are calculated by using the 1-st ground truth corresponding to the original data and each piece of the (i) a process of inputting the k-th noisy
  • the data breach preventing device 100 may input the 1-st noisy data 10 respectively into the 1-st learning network 120_1 to the m-th learning network 120_m.
  • the 1-st learning network 120_1 to the m-th learning network 120_m may apply the learning operations, respectively using the 1-st learned parameters to the m-th learned parameters, to the 1-st noisy data 10, to thereby respectively output the (1_1)-st characteristic information to the (1_m)-th characteristic information about the 1-st noisy data 10.
  • the data breach preventing device 100 may perform or support another device to perform (i) a process of calculating one or more (1_1)-st losses by referring to at least one averaged value over a (1_1_1)-st loss to a (1_1_m)-th loss which are calculated by using the 1-st ground truth corresponding to the original data and each piece of the (1_1)-st characteristic information to the (1_m)-th characteristic information, and (ii) a process of calculating one or more (1_2)-nd losses by referring to at least one averaged value over a (1_2_1)-st loss to a (1_2_m)-th loss which are calculated by using the 2-nd ground truth corresponding to the original data and each of a (1_1)-st task specific output to a (1_m)-th task specific output respectively created by using the (1_1)-st characteristic information to the (1_m)-th characteristic information.
  • the data breach preventing device 100 may perform a process of launching the adversarial attack on the 1-st noisy data via backpropagation using at least part of the (1_1)-st losses and the (1_2)-nd losses, to thereby generate the 2-nd noisy data, resulting in a single iteration.
  • the data breach preventing device 100 may input the 2-nd noisy data respectively into the 1-st learning network 120_1 to the m-th learning network 120_m.
  • the 1-st learning network 120_1 to the m-th learning network 120_m may apply the learning operations, respectively using the 1-st learned parameters to the m-th learned parameters, to the 2-nd noisy data, to thereby respectively output the (2_1)-st characteristic information to the (2_m)-th characteristic information about the 2-nd noisy data.
  • the data breach preventing device 100 may perform or support another device to perform (i) a process of calculating one or more (2_1)-st losses by referring to at least one averaged value over a (2_1_1)-st loss to a (2_1_m)-th loss which are calculated by using (i-1) the 1-st ground truth corresponding to the original data and (i-2) each piece of the (2_1)-st characteristic information to the (2_m)-th characteristic information, and (ii) a process of calculating one or more (2_2)-nd losses by referring to at least one averaged value over a (2_2_1)-st loss to a (2_2_m)-th loss which are calculated by using (ii-1) the 2-nd ground truth corresponding to the original data and (ii-2) each of a (2_1)-st task specific output to a (2_m)-th task specific output respectively created by using the (2_1)-st characteristic information to the (2_m)-th characteristic information.
  • the data breach preventing device 100 may perform a process of launching a subsequent adversarial attack on the 2-nd noisy data via backpropagation using at least part of the (2_1)-st losses and the (2_2)-nd losses, to thereby generate the 3-rd noisy data, resulting in a subsequent iteration.
  • the 1-st learning network 120_1 to the m-th learning network 120_m may output results derived from the noisy data different from those derived from the original data.
  • the data breach preventing device 100 may perform or support another device to perform (i) a process of averaging over a (k_1_1)-st summed loss to a (k_1_m)-th summed loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st loss to the (k_1_m)-th loss, in the process of calculating the (k_1)-st losses, and, (ii) a process of averaging over a (k_2_1)-st summed loss to a (k_2_m)-th summed loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st loss to the (k_2_m)-th loss, in the process of calculating the (k_2)-nd losses.
  • initial values of the losses at a first iteration may be zeros.
  • the data breach preventing device 100 may perform or support another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information and (i-2) each piece of 1-st original characteristic information to m-th original characteristic information created by inputting the original data respectively into the 1-st learning network 120_1 to the m-th learning network 120_m, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st task specific output to a (k_m)-th task specific output and (ii-2) each of a 1-st original task specific output to an m-th original task specific output respectively created by referring to
  • the data breach preventing device 100 may stop the iteration and output the noisy data as the watermarked data, at the time when the number of a part of outcomes determined by the 1-st learning network 120_1 to the m-th learning network 120_m being different from the identity, e.g., the GT or ground truth, of the original data becomes greater than the predetermined threshold.
  • Fig. 5 is a drawing schematically illustrating the method for preventing the breach of the original data to be used for the deep learning by converting the 1-st noisy data into the (k_1)-st spatial data to the (k_j)-th spatial data through random rotation of the 1-st noisy data in accordance with still another example embodiment of the present disclosure.
  • the part easily deducible from the explanation of Figs. 3 and 4 will be omitted.
  • the data breach preventing device 100 may add noise to the original data, to thereby generate the 1-st noisy data 10.
  • the data breach preventing device 100 may perform or support another device to perform iteration of (i) a process of applying the random rotation to the k-th noisy data, to thereby generate the (k_1)-st spatial data to the (k_j)-th spatial data, (ii) a process of inputting the (k_1)-st spatial data to the (k_j)-th spatial data into the learning network 120, to thereby allow the learning network 120 to apply the learning operations to the (k_1)-st spatial data to the (k_j)-th spatial data and thus to output (k_1)-st spatial characteristic information to (k_j)-th spatial characteristic information, (iii) a process of calculating the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st spatial loss to a (k_1_j)-th spatial loss which are calculated by using the 1-st ground truth corresponding to the original data and each piece of the (k_1)-st spatial characteristic information
  • the data breach preventing device 100 may apply the random rotation to the 1-st noisy data, to thereby generate the (1_1)-st spatial data 10_1 to the (1_j)-th spatial data 10_j.
  • the data breach preventing device 100 may perform or support another device to perform a process of randomly selecting one or more angles ⁇ ranging from - ⁇ to ⁇ , and repeating j times a process of rotating the k-th noisy data as corresponding to each of the angles ⁇ , to thereby generate the (1_1)-st spatial data 10_1 to the (1_j)-th spatial data 10_j.
  • the data breach preventing device 100 may input the (1_1)-st spatial data 10_1 to the (1_j)-th spatial data 10_j into the learning network 120.
  • the learning network 120 may apply the learning operations respectively to the (1_1)-st spatial data 10_1 to the (1_j)-th spatial data 10_j using the learned parameters of the learning network 120, to thereby output the (1_1)-st spatial characteristic information to the (1_j)-th spatial characteristic information respectively about the (1_1)-st spatial data 10_1 to the (1_j)-th spatial data 10_j.
  • the data breach preventing device 100 may perform or support another device to perform (i) a process of calculating one or more (1_1)-st losses by referring to at least one averaged value over a (1_1_1)-st spatial loss to a (1_1_j)-th spatial loss which are calculated by using (i-1) the 1-st ground truth corresponding to the original data and (i-2) each piece of the (1_1)-st spatial characteristic information to the (1_j)-th spatial characteristic information, and (ii) a process of calculating one or more (1_2)-nd losses by referring to at least one averaged value over a (1_2_1)-st spatial loss to a (1_2_j)-th spatial loss which are calculated by using (ii-1) the 2-nd ground truth corresponding to the original data and (ii-2) each of a (1_1)-st spatial task specific output to a (1_j)-th spatial task specific output respectively created by using the (1_1)-st spatial characteristic information to the (1_j)-
  • the data breach preventing device 100 may perform a process of launching the adversarial attack on the 1-st noisy data via backpropagation using at least part of the (1_1)-st losses and the (1_2)-nd losses, to thereby generate the 2-nd noisy data, resulting in a single iteration.
  • the data breach preventing device 100 may apply the random rotation to the 2-nd noisy data, to thereby generate the (2_1)-st spatial data to the (2_j)-th spatial data.
  • the data breach preventing device 100 may input the (2_1)-st spatial data to the (2_j)-th spatial data into the learning network 120.
  • the learning network 120 may apply the learning operations respectively to the (2_1)-st spatial data to the (2_j)-th spatial data using the learned parameters of the learning network 120, to thereby output the (2_1)-st spatial characteristic information to the (2_j)-th spatial characteristic information respectively about the (2_1)-st spatial data to the (2_j)-th spatial data.
  • the data breach preventing device 100 may perform or support another device to perform (i) a process of calculating one or more (2_1)-st losses by referring to at least one averaged value over a (2_1_1)-st spatial loss to a (2_1_j)-th spatial loss which are calculated by using (i-1) the 1-st ground truth corresponding to the original data and (i-2) each piece of the (2_1)-st spatial characteristic information to the (2_j)-th spatial characteristic information, and (ii) a process of calculating one or more (2_2)-nd losses by referring to at least one averaged value over a (2_2_1)-st spatial loss to a (2_2_j)-th spatial loss which are calculated by using (ii-1) the 2-nd ground truth corresponding to the original data and (ii-2) each of a (2_1)-st spatial task specific output to a (2_j)-th spatial task specific output respectively created by using the (2_1)-st spatial characteristic information to the (2_j)--
  • the data breach preventing device 100 may perform a process of launching a subsequent adversarial attack on the 2-nd noisy data via backpropagation using at least part of the (2_1)-st losses and the (2_2)-nd losses, to thereby generate the 3-rd noisy data, resulting in a subsequent iteration.
  • the learning network may output results derived from the noisy data different to those from the original data.
  • the data breach preventing device 100 may perform or support another device to perform (i) a process of averaging over a (k_1_1)-st summed spatial loss to a (k_1_j)-th summed spatial loss which are calculated by adding the ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st spatial loss to the (k_1_j)-th spatial loss, in the process of calculating the (k_1)-st losses, and (ii) a process of averaging over a (k_2_1)-st summed spatial loss to a (k_2_j)-th summed spatial loss which are calculated by adding the ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st spatial loss to the (k_2_j)-th spatial loss, in the process of calculating the (k_2)-nd losses.
  • initial values of the losses at a first iteration may be zeros.
  • the data breach preventing device 100 may perform or support another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information and (i-2) the original characteristic information created by inputting the original data into the learning network 120, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output and (ii-2) an original task specific output created by referring to the original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other
  • the data breach preventing device 100 may stop the iteration and output the noisy data as the watermarked data, at the time when the number of a part of outcomes of the (k_1)-st spatial data to the (k_j)-th spatial data being different from the identity, e.g., the GT or ground truth, of the original data becomes greater than the predetermined threshold.
  • the embodiments of the present disclosure as explained above can be implemented in a form of executable program command through a variety of computer means recordable in computer readable media.
  • the computer readable media may include solely or in combination, program commands, data files, and data structures.
  • the program commands recorded to the media may be components specially designed for the present disclosure or may be usable to those skilled in the art of computer software.
  • Computer readable media include magnetic media such as hard disk, floppy disk, and magnetic tape, optical media such as CD-ROM and DVD, magneto-optical media such as floptical disk and hardware devices such as ROM, RAM, and flash memory specially designed to store and carry out program commands.
  • Program commands may include not only a machine language code made by a complier but also a high level code that can be used by an interpreter etc., which may be executed by a computer.
  • the aforementioned hardware device can work as more than a software module to perform the action of the present disclosure and vice versa.

Abstract

A method for preventing breach of original data for deep learning is provided. The method includes steps of: a data breach preventing device (a) adding noise onto the acquired original data to generate 1-st noisy data; and (b)(b1) while increasing an integer k from 1 to an integer larger than 0, (i) inputting k-th noisy data into a learning network, to apply learning operations to the k-th noisy data using learned parameters of the learning network, and to output k-th characteristic information, and (ii) launching an adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) (k_1)-st losses calculated using the k-th characteristic information and a 1-st ground truth, and (ii-2) (k_2)-nd losses calculated using (1) a k-th task specific output and (2) a 2-nd ground truth, and generating (k+1)-th noisy data, and (b2) as a result, generating n-th noisy data as watermarked data.

Description

METHOD FOR PREVENTING BREACH OF ORIGINAL DATA FOR DEEP LEARNING AND DATA BREACH PREVENTING DEVICE USING THEM
The present disclosure relates to a method for preventing breach of original data to be used for deep learning and a data breach preventing device using the same; and more particularly, to the method for converting the original data into optimal noisy data that humans can recognize but from which a learning network outputs different results, thus preventing the breach of the original data to be used for the deep learning, and the data breach preventing device using the same.
Big data refers to data including all of unstructured data and semi-structured data not utilized so far, like e-commerce data, metadata, web log data, radio frequency identification (RFID) data, sensor network data, social network data, data of Internet text and documents, Internet search indexing data, as well as all of structured data used by conventional enterprises or public institutions. Data as such are referred to as big data in the sense that common software tools and computer systems cannot easily handle such a huge volume of data.
And, although such big data may be insignificant by itself, it can be useful for generation of new data, judgment, or prediction in various fields through machine learning on patterns and the like.
And, in the machine learning and artificial intelligence industry, there is a need to share data sets to be used for deep learning, and accordingly, the data sets are becoming high value assets in the same industry.
However, there are various problems in sharing the data sets.
For example, when a user, who lacks resources to preview, verify, or label the data sets, delegates handling of the data sets to a third party, since the third party can directly access the original data included in the data sets, the original data is at risk of illegal copying. In addition, the third party can train a new deep learning network using the original data, and accordingly, there is a problem that the user's rights to the data sets are violated.
To prevent this, a method of converting the original data for the deep learning into naive watermarked data for sharing has been proposed.
However, such a method is mostly useless for protecting the original data because the learning network trained with the naive watermarked data shows performance almost similar to that of the learning network trained with the original data.
Also, since the naive watermark is easily erasable by image processing technology, there is a limit to protecting the original data.
It is an object of the present disclosure to solve all the aforementioned problems.
It is another object of the present disclosure to prevent breach of original data to be used for deep learning.
It is still another object of the present disclosure to protect the original data, by generating optimal noisy data humans can recognize but from which a learning network outputs different results unlike when using the original data.
It is still yet another object of the present disclosure to provide watermarked data, from which the original data cannot be recovered by using data processing technology such as image processing technology.
It is still yet another object of the present disclosure to provide the watermarked data which appears to humans as minimally deteriorated, and degrade accuracy of the learned learning network having been trained with the watermarked data.
In order to accomplish the objects above, distinctive structures of the present disclosure are described as follows.
In accordance with one aspect of the present disclosure, there is provided a method for preventing breach of original data to be used for deep learning, including steps of: (a) a data breach preventing device, if the original data is acquired, performing or supporting another device to perform a process of adding noise onto the original data, to thereby generate 1-st noisy data; and (b) the data breach preventing device, (b1) while increasing an integer k from 1 to n-1 wherein n is an integer larger than 1, performing or supporting another device to perform iteration of (i) a process of inputting k-th noisy data into a learning network, to thereby allow the learning network to apply one or more learning operations to the k-th noisy data using one or more learned parameters of the learning network, and thus to output k-th characteristic information about the k-th noisy data, and (ii) a process of launching an adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) one or more (k_1)-st losses calculated by referring to (1) the k-th characteristic information and (2) at least one 1-st ground truth corresponding to the original data, and (ii-2) one or more (k_2)-nd losses calculated by referring to (1) at least one k-th task specific output created by using the k-th characteristic information and (2) at least one 2-nd ground truth corresponding to the original data, and thus a process of generating (k+1)-th noisy data, and (b2) as a result of the iteration, generating or supporting another device to generate n-th noisy data as watermarked data corresponding to the original data.
As one example, the learning network includes a 1-st learning network to an m-th learning network respectively having one or more 1-st learned parameters to one or more m-th learned parameters wherein m is an integer greater than 0, and wherein, at the step of (b), the data breach preventing device performs or supports another device to perform a process of inputting the k-th noisy data respectively into the 1-st learning network to the m-th learning network, to thereby allow the 1-st learning network to the m-th learning network to (i) apply the learning operations with the 1-st learned parameters to the m-th learned parameters to the k-th noisy data respectively, and thus output (k_1)-st characteristic information to (k_m)-th characteristic information about the k-th noisy data, (ii) calculate the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st loss to a (k_1_m)-th loss which are calculated by using the 1-st ground truth and each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information, and (iii) calculate the (k_2)-nd losses by referring to at least one averaged value over a (k_2_1)-st loss to a (k_2_m)-th loss which are calculated by using the 2-nd ground truth and each of a (k_1)-st task specific output to a (k_m)-th task specific output respectively created by using the (k_1)-st characteristic information to the (k_m)-th characteristic information.
As one example, the data breach preventing device performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed loss to a (k_1_m)-th summed loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st loss to the (k_1_m)-th loss, in the process of calculating the (k_1)-st losses, and (ii) a process of averaging over a (k_2_1)-st summed loss to a (k_2_m)-th summed loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st loss to the (k_2_m)-th loss, in the process of calculating the (k_2)-nd losses.
As one example, the data breach preventing device performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information and (i-2) each piece of 1-st original characteristic information to m-th original characteristic information respectively created by inputting the original data into the 1-st learning network to the m-th learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st task specific output to a (k_m)-th task specific output and (ii-2) each of a 1-st original task specific output to an m-th original task specific output respectively created by referring to the 1-st original characteristic information to the m-th original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (iii) a process of acquiring a k-th termination condition value by adding one of the (k_1)-st condition value and the (k_2)-nd condition value onto a (k-1)-th termination condition value, and (iv) after the (k+1)-th noisy data is created, a process of determining whether the k-th termination condition value is equal to or greater than a predetermined threshold, and if the k-th termination condition value is determined as equal to or greater than the predetermined threshold, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
As one example, at the step of (b), the data breach preventing device performs or supports another device to perform (i) a process of applying random rotation to the k-th noisy data, to thereby generate (k_1)-st spatial data to (k_j)-th spatial data wherein j is an integer greater than 0, (ii) a process of inputting the (k_1)-st spatial data to the (k_j)-th spatial data into the learning network, to thereby allow the learning network to apply the deep-learning operation to the (k_1)-st spatial data to the (k_j)-th spatial data and thus to output (k_1)-st spatial characteristic information to (k_j)-th spatial characteristic information, (iii) a process of calculating the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st spatial loss to a (k_1_j)-th spatial loss which are calculated by using the 1-st ground truth and each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information, and (iv) a process of calculating the (k_2)-nd losses by referring to at least one averaged value over a (k_2_1)-st spatial loss to a (k_2_j)-th spatial loss which are calculated by using the 2-nd ground truth and each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output respectively created by using the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information.
As one example, the data breach preventing device performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed spatial loss to a (k_1_j)-th summed spatial loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st spatial loss to the (k_1_j)-th spatial loss, in the process of calculating the (k_1)-st losses, and (ii) a process of averaging over a (k_2_1)-st summed spatial loss to a (k_2_j)-th summed spatial loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st spatial loss to the (k_2_j)-th spatial loss, in the process of calculating the (k_2)-nd losses.
As one example, the data breach preventing device performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information and (i-2) each piece of original characteristic information created by inputting the original data into the learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output and (ii-2) an original task specific output created by referring to the original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (iii) a process of acquiring a k-th termination condition value by adding one of the (k_1)-st condition value and the (k_2)-nd condition value onto a (k-1)-th termination condition value, and (iv) after the (k+1)-th noisy data is created, a process of determining whether the k-th termination condition value is equal to or greater than a predetermined threshold, and if the k-th termination condition value is determined as equal to or greater than the predetermined threshold, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
As one example, the data breach preventing device performs or supports another device to perform a process of randomly selecting one or more angles θ ranging from -π to π and a process of rotating the k-th noisy data as corresponding to each of the angles θ, in the process of creating the (k_1)-st spatial data to the (k_j)-th spatial data.
As one example, at the step of (b), the data breach preventing device performs or supports another device to perform (i) a process of comparing (i-1) (k+1)-th characteristic information created by inputting the (k+1)-th noisy data into the learning network and (i-2) original characteristic information created by inputting the original data into the learning network, and if the (k+1)-th characteristic information and the original characteristic information are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data, and (ii) a process of comparing (ii-1) a (k+1)-th task specific output created by referring to the (k+1)-th characteristic information and (ii-2) an original task specific output created by referring to the original characteristic information, and if the (k+1)-th task specific output and the original task specific output are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
As one example, at the step of (a), the data breach preventing device performs or supports another device to perform a process of adding random noise, according to a normal distribution with one of variances and one of means created by a noise function, onto the original data.
In accordance with another aspect of the present disclosure, there is provided a data breach preventing device for preventing breach of original data to be used for deep learning, including: at least one memory that stores instructions; and at least one processor configured to execute the instructions to perform or support another device to perform: (I) if the original data is acquired, a process of adding noise onto the original data, to thereby generate 1-st noisy data, and (II) (II-1) while increasing an integer k from 1 to n-1 wherein n is an integer larger than 1, (i) a process of inputting k-th noisy data into a learning network, to thereby allow the learning network to apply one or more learning operations to the k-th noisy data using one or more learned parameters of the learning network, and thus to output k-th characteristic information about the k-th noisy data, and (ii) a process of launching an adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) one or more (k_1)-st losses calculated by referring to (1) the k-th characteristic information and (2) at least one 1-st ground truth corresponding to the original data, and (ii-2) one or more (k_2)-nd losses calculated by referring to (1) at least one k-th task specific output created by using the k-th characteristic information and (2) at least one 2-nd ground truth corresponding to the original data, and thus a process of generating (k+1)-th noisy data, and (II-2) as a result of the iteration, a process of generating n-th noisy data as watermarked data corresponding to the original data.
As one example, the learning network includes a 1-st learning network to an m-th learning network respectively having one or more 1-st learned parameters to one or more m-th learned parameters wherein m is an integer greater than 0, and wherein, at the process of (II), the processor performs or supports another device to perform a process of inputting the k-th noisy data respectively into the 1-st learning network to the m-th learning network, to thereby allow the 1-st learning network to the m-th learning network to (i) apply the learning operations with the 1-st learned parameters to the m-th learned parameters to the k-th noisy data respectively, and thus output (k_1)-st characteristic information to (k_m)-th characteristic information about the k-th noisy data, (ii) calculate the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st loss to a (k_1_m)-th loss which are calculated by using the 1-st ground truth and each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information, and (iii) calculate the (k_2)-nd losses by referring to at least one averaged value over a (k_2_1)-st loss to a (k_2_m)-th loss which are calculated by using the 2-nd ground truth and each of a (k_1)-st task specific output to a (k_m)-th task specific output respectively created by using the (k_1)-st characteristic information to the (k_m)-th characteristic information.
As one example, the processor performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed loss to a (k_1_m)-th summed loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st loss to the (k_1_m)-th loss, in the process of calculating the (k_1)-st losses, and (ii) a process of averaging over a (k_2_1)-st summed loss to a (k_2_m)-th summed loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st loss to the (k_2_m)-th loss, in the process of calculating the (k_2)-nd losses.
As one example, the processor performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information and (i-2) each piece of 1-st original characteristic information to m-th original characteristic information respectively created by inputting the original data into the 1-st learning network to the m-th learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st task specific output to a (k_m)-th task specific output and (ii-2) each of a 1-st original task specific output to an m-th original task specific output respectively created by referring to the 1-st original characteristic information to the m-th original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (iii) a process of acquiring a k-th termination condition value by adding one of the (k_1)-st condition value and the (k_2)-nd condition value onto a (k-1)-th termination condition value, and (iv) after the (k+1)-th noisy data is created, a process of determining whether the k-th termination condition value is equal to or greater than a predetermined threshold, and if the k-th termination condition value is determined as equal to or greater than the predetermined threshold, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
As one example, at the process of (II), the processor performs or supports another device to perform (i) a process of applying random rotation to the k-th noisy data, to thereby generate (k_1)-st spatial data to (k_j)-th spatial data wherein j is an integer greater than 0, (ii) a process of inputting the (k_1)-st spatial data to the (k_j)-th spatial data into the learning network, to thereby allow the learning network to apply the deep-learning operation to the (k_1)-st spatial data to the (k_j)-th spatial data and thus to output (k_1)-st spatial characteristic information to (k_j)-th spatial characteristic information, (iii) a process of calculating the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st spatial loss to a (k_1_j)-th spatial loss which are calculated by using the 1-st ground truth and each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information, and (iv) a process of calculating the (k_2)-nd losses by referring to at least one averaged value over a (k_2_1)-st spatial loss to a (k_2_j)-th spatial loss which are calculated by using the 2-nd ground truth and each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output respectively created by using the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information.
As one example, the processor performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed spatial loss to a (k_1_j)-th summed spatial loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st spatial loss to the (k_1_j)-th spatial loss, in the process of calculating the (k_1)-st losses, and (ii) a process of averaging over a (k_2_1)-st summed spatial loss to a (k_2_j)-th summed spatial loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st spatial loss to the (k_2_j)-th spatial loss, in the process of calculating the (k_2)-nd losses.
As one example, the processor performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information and (i-2) each piece of original characteristic information created by inputting the original data into the learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output and (ii-2) an original task specific output created by referring to the original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (iii) a process of acquiring a k-th termination condition value by adding one of the (k_1)-st condition value and the (k_2)-nd condition value onto a (k-1)-th termination condition value, and (iv) after the (k+1)-th noisy data is created, a process of determining whether the k-th termination condition value is equal to or greater than a predetermined threshold, and if the k-th termination condition value is determined as equal to or greater than the predetermined threshold, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
As one example, the processor performs or supports another device to perform a process of randomly selecting one or more angles θ ranging from -π to π and a process of rotating the k-th noisy data as corresponding to each of the angles θ, in the process of creating the (k_1)-st spatial data to the (k_j)-th spatial data.
As one example, at the process of (II), the processor performs or supports another device to perform (i) a process of comparing (i-1) (k+1)-th characteristic information created by inputting the (k+1)-th noisy data into the learning network and (i-2) original characteristic information created by inputting the original data into the learning network, and if the (k+1)-th characteristic information and the original characteristic information are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data, and (ii) a process of comparing (ii-1) a (k+1)-th task specific output created by referring to the (k+1)-th characteristic information and (ii-2) an original task specific output created by referring to the original characteristic information, and if the (k+1)-th task specific output and the original task specific output are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
As one example, at the process of (I), the processor performs or supports another device to perform a process of adding random noise, according to a normal distribution with one of variances and one of means created by a noise function, onto the original data.
In addition, recordable media that are readable by a computer for storing a computer program to execute the method of the present disclosure are further provided.
The present disclosure has an effect of preventing the breach of the original data to be used for the deep learning.
The present disclosure has another effect of protecting the original data, by generating resultant data humans can recognize but from which the learning network outputs different results, unlike when using the original data.
The present disclosure has still another effect of providing the watermarked data, from which the original data cannot be recovered by using data processing technology such as image processing technology.
The present disclosure has still yet another effect of providing the watermarked data which appears to humans as minimally deteriorated, and degrade accuracy of the learning network having been trained with the watermarked data.
The above and other objects and features of the present disclosure will become apparent from the following description of preferred embodiments given in conjunction with the accompanying drawings, in which:
Fig. 1 is a drawing schematically illustrating a data breach preventing device which is used to prevent breach of original data to be used for deep learning in accordance with one example embodiment of the present disclosure.
Fig. 2 is a drawing schematically illustrating an example of watermarked data generated by a method for preventing the breach of the original data to be used for the deep learning in accordance with one example embodiment of the present disclosure.
Fig. 3 is a drawing schematically illustrating the method for preventing the breach of the original data to be used for the deep learning in accordance with one example embodiment of the present disclosure.
Fig. 4 is a drawing schematically illustrating the method for preventing the breach of the original data to be used for the deep learning in accordance with another example embodiment of the present disclosure.
Fig. 5 is a drawing schematically illustrating the method for preventing the breach of the original data to be used for the deep learning in accordance with still another example embodiment of the present disclosure.
In the following detailed description, reference is made to the accompanying drawings that show, by way of illustration, specific embodiments in which the present disclosure may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the present disclosure. It is to be understood that the various embodiments of the present disclosure, although different, are not necessarily mutually exclusive. For example, a particular feature, structure, or characteristic described herein may be implemented as being changed from an embodiment to other embodiments without departing from the spirit and scope of the present disclosure. In addition, it is to be understood that the position or arrangement of individual elements within each embodiment may be modified without departing from the spirit and scope of the present disclosure. The following detailed description is, therefore, not to be taken in a limiting sense, and the scope of the present disclosure is described as including the appended claims, along with the full range of equivalents to which the claims are entitled. In the drawings, like numerals refer to the same or similar components throughout the several aspects.
Any images referred to in the present disclosure may include images related to any roads paved or unpaved, in which case the objects on the roads or near the roads may include vehicles, persons, animals, plants, buildings, flying objects like planes or drones, or any other obstacles which may appear in a road-related scene, but the scope of the present disclosure is not limited thereto. As another example, said any images referred to in the present disclosure may include images not related to any roads, such as images related to alleyway, land lots, sea, lakes, rivers, mountains, forests, deserts, sky, or any indoor space, in which case the objects in said any images may include vehicles, persons, animals, plants, buildings, flying objects like planes or drones, ships, amphibious planes or ships, or any other obstacles which may appear in a scene related to alleyway, land lots, sea, lakes, rivers, mountains, forests, deserts, sky, or any indoor space, but the scope of the present disclosure is not limited thereto.
The headings and abstract of the present disclosure provided herein are for convenience only and do not limit or interpret the scope or meaning of the embodiments.
To allow those skilled in the art to carry out the present disclosure easily, the example embodiments of the present disclosure will be explained in detail as shown below by referring to attached drawings.
Fig. 1 is a drawing schematically illustrating a data breach preventing device which is used to prevent breach of original data to be used for deep learning in accordance with one example embodiment of the present disclosure.
By referring to Fig. 1, the data breach preventing device 100 may include a memory 101 for storing instructions to prevent the breach of the original data to be used for the deep learning, and a processor 102 for performing processes of preventing the breach of the original data to be used for the deep learning according to the instructions in the memory 101. As one example, the instructions may correspond to processes of watermarking data in order to generate the watermarked data from which a learning network outputs results different from those of the original data, but from which humans can recognize the original data.
Specifically, the data breach preventing device 100 may typically achieve a desired system performance by using combinations of at least one computing device and at least one computer software, e.g., a computer processor, a memory, a storage, an input device, an output device, or any other conventional computing components, an electronic communication device such as a router or a switch, an electronic information storage system such as a network-attached storage (NAS) device and a storage area network (SAN) as the computing device and any instructions that allow the computing device to function in a specific way as the computer software.
Also, the processors of such devices may include hardware configuration of MPU (Micro Processing Unit) or CPU (Central Processing Unit), cache memory, data bus, etc. Additionally, the computing device may further include OS and software configuration of applications that achieve specific purposes.
Such description of the computing device does not exclude an integrated device including any combination of a processor, a memory, a medium, or any other computing components for implementing the present disclosure.
Meanwhile, if the original data is acquired, the processor 102 of the data breach preventing device 100, according to the instructions stored in the memory 101, may perform or support another device to perform a process of adding noise onto the original data, to thereby generate 1-st noisy data. And while increasing an integer k from 1 to n-1 where n is an integer larger than 1, the processor 102 may perform or support another device to perform iteration of (i) a process of inputting k-th noisy data into a learning network, to thereby allow the learning network to apply one or more learning operations to the k-th noisy data using one or more learned parameters of the learning network, and thus to output k-th characteristic information about the k-th noisy data, (ii) a process of launching an adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) one or more (k_1)-st losses calculated by referring to (1) the k-th characteristic information and (2) at least one 1-st ground truth corresponding to the original data, and (ii-2) one or more (k_2)-nd losses calculated by referring to (1) at least one k-th task specific output created by using the k-th characteristic information and (2) at least one 2-nd ground truth corresponding to the original data, and thus a process of generating (k+1)-th noisy data. And as a result of the iteration, the processor 102 may perform or support another device to perform a process of generating n-th noisy data as the watermarked data corresponding to the original data.
As one example, by referring to Fig. 2, if an original image as shown in (A) is inputted, the data breach preventing device 100 may add noise onto the original image, to thereby generate a noisy image as shown in (B) from which humans can recognize the original image, and may launch an adversarial attack on the noisy image, to thereby generate a watermarked image as shown in (C) from which humans can recognize the original image but the learning network outputs results different from those of the original image, thus preventing the breach of the original data to be used for the deep-learning.
A method for preventing the breach of the original data by using the data breach preventing device 100 in accordance with one example embodiment of the present disclosure is described by referring to Fig. 3 as follows.
First, if the original data 1 is acquired, the data breach preventing device 100 may instruct a noise network 110 to generate random noise. Herein, the original data 1 may be included in an original data set. And, the noise network 110 may generate the random noise having a normal distribution N(μ, σ²) with one of means and one of variances created according to a noise function.
And, the data breach preventing device 100 may add noise to the original data, to thereby generate the 1-st noisy data 10.
Herein, humans can easily recognize the original data 1 from the 1-st noisy data 10. Also, the learning network may output from the 1-st noisy data 10 same or similar results to those from the original data 10.
Next, while increasing an integer k from 1 to n-1, the data breach preventing device 100 may perform or support another device to perform iteration of (i) a process of inputting the k-th noisy data into the learning network 120, to thereby allow the learning network 120 to apply the learning operations to the k-th noisy data using the learned parameters of the learning network, and thus to output k-th characteristic information 20 about the k-th noisy data, (ii) a process of launching the adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) the (k_1)-st losses 130 calculated by referring to (1) the k-th characteristic information 20 and (2) the 1-st ground truth corresponding to the original data, and (ii-2) the (k_2)-nd losses 130 calculated by referring to (1) at least one k-th task specific output created by using the k-th characteristic information and (2) the 2-nd ground truth corresponding to the original data, and thus a process of generating (k+1)-th noisy data. And, as a result of the iteration, the data breach preventing device 100 may generate or support another device to generate n-th noisy data as the watermarked data 30 corresponding to the original data 1. Herein, n may be an integer larger than 1.
Detailed explanation is as follows.
First, the data breach preventing device 100 may input the 1-st noisy data 10 into the learning network 120.
Herein, the learning network 120 may include a machine learning network, but the scope of the present disclosure is not limited thereto, and may include any learning network capable of generating outputs by applying the learning operations using their own learned parameters to the inputted data. And, the machine learning network may include at least one of a k-Nearest Neighbors, a Linear Regression, a Logistic Regression, a Support Vector Machine (SVM), a Decision Tree and Random Forest, a Neural Network, a Clustering, a Visualization and a Dimensionality Reduction, an Association Rule Learning, a Deep Belief Network, a Reinforcement Learning, and a Deep learning algorithm, but the machine learning network is not limited thereto and may include various learning algorithms.
Then, the learning network 120 may apply the learning operations to the 1-st noisy data 10 using the learned parameters of the learning network 120, to thereby output the 1-st characteristic information about the 1-st noisy data 10.
Herein, the 1-st characteristic information may be features or logits corresponding to the 1-st noisy data. Also, the 1-st characteristic information may be feature values related to certain features in the 1-st noisy data, or the logits including values related to at least one of vectors, matrices, and coordinates related to the certain features. For example, if the 1-st noisy data are facial image data, the result above may be classes for face recognition, facial features, e.g., laughing expressions, coordinates of facial landmark points, e.g., both end points on far sides of an eye.
Then, the data breach preventing device 100 may perform or support another device to perform a process of launching the adversarial attack on the 1-st noisy data via backpropagation using at least one of (i) one or more (1_1)-st losses calculated by referring to the 1-st characteristic information and the 1-st ground truth corresponding to the original data, and (ii) one or more (1_2)-nd losses calculated by referring to (ii-1) at least one 1-st task specific output generated by using the 1-st characteristic information and (ii-2) the 2-nd ground truth corresponding to the original data, to thereby generate 2-nd noisy data, resulting in a single iteration.
Herein, during the backpropagation using at least part of the (1_1)-st losses and the (1_2)-nd losses, the data breach preventing device 100 may fix and not update the learned parameters of the learning network 120, and may perform the backpropagation to maximize at least part of the (1_1)-st losses and the (1_2)-nd losses regarding the 1-st noisy data only, to thereby convert the 1-st noisy date into the 2-nd noisy data. Then, the 2-nd noisy data may be recognized as the original data by humans, but may be recognized as data just with higher similarity to the original data by the learning network 120. That is, the learning network 120 may determine the original data, the 1-st noisy data, and the 2-nd noisy data as similar to one another, but with different probabilities of being true, e.g., being the original data. As one example, if the original data is an image of a dog, and if the learning network 120 is trained to recognize a dog on an image, then the learning network 120 may recognize the dog in all of the original data, the 1-st noisy data, and the 2-nd noisy data, however, the probabilities of the recognition result as the "dog" being true may decrease by degrees like 99% for the original data, 98% for the 1-st noisy data, and 97% for the 2-nd noisy data. This may be due to a fact that the 1-st noisy data is a result of adding the noise to the original data, and that the 2-nd noisy data is a result of launching the adversarial attack on the 1-st noisy data.
Also, during the backpropagation using at least part of the (1_1)-st losses and the (1_2)-nd losses, the data breach preventing device 100 may acquire at least one loss gradient for maximizing at least part of the (1_1)-st losses and the (1_2)-nd losses, and may backpropagate the loss gradient to the original data using gradient ascent.
Meanwhile, the task specific output may be an output of a task to be performed by the learning network 120, and may have various results according to the task learned by the learning network 120, such as a probability of a class for classification, coordinates resulting from regression for location detection, etc., and an activation function of an activation unit may be applied to the characteristic information outputted from the learning network 120, to thereby generate the task specific output according to the task to be performed by the learning network 120. Herein, the activation functions may include a sigmoid function, a linear function, a softmax function, an rlinear function, a square function, a sqrt function, an srlinear function, an abs function, a tanh function, a brlinear function, etc., but the scope of the present disclosure is not limited thereto.
As one example, when the learning network 120 performs the task for the classification, the data breach preventing device 100 may map the characteristic information outputted from the learning network 120 onto each of classes, to thereby generate each of one or more probabilities of the inputted data, for each of the classes. Herein, each of the probabilities for each of the classes may represent each of probabilities of the characteristic information, outputted for each of the classes from the learning network 120, being true. For example, if the original data are the facial image data, a probability of the face having a laughing expression may be outputted as 0.75, and a probability of the face not having the laughing expression may be outputted as 0.25, and the like. Herein, a softmax algorithm may be used for mapping the characteristic information outputted from the learning network 120 onto each of the classes, but the scope of the present disclosure is not limited thereto, and various algorithms may be used for mapping the characteristic information onto each of the classes.
Next, the data breach preventing device 100 may input the 2-nd noisy data into the learning network 120.
Then, the learning network 120 may apply the learning operations to the 2-nd noisy data using the learned parameters of the learning network 120, to thereby output the 2-nd characteristic information about the 2-nd noisy data.
Then, the data breach preventing device 100 may perform a process of launching a subsequent adversarial attack on the 2-nd noisy data via backpropagation using at least part of (i) one or more (2_1)-st losses calculated by referring to the 2-nd characteristic information and the 1-st ground truth corresponding to the original data, and (ii) one or more (2_2)-nd losses calculated by referring to (ii-1) at least one 2-nd task specific output generated by using the 2-nd characteristic information and (ii-2) the 2-nd ground truth corresponding to the original data, to thereby generate 3-rd noisy data, resulting in a subsequent iteration.
Herein, since the 3-rd noisy data is created by the subsequent adversarial attack on the 2-nd noisy data, the probability of the result of the 3-rd noisy data being true may be lower than the probability of the result of the 2-nd noisy data being true, where the result of the 3-rd noisy data is outputted from the learning network 120 by applying the learning operations to the 3-rd noisy data, and the result of the 2-nd noisy data is outputted from the learning network 120 by applying the learning operations to the 2-nd noisy data.
Due to the repeated adversarial attacks in the iterations as such, the learning network 120 may output results derived from the noisy data different to those from the original data. As one example, on condition that the learning network 120 has been trained to recognize data as "dog" if the probability of the data being true is greater than 50%, the learning network 120 may recognize the (k+1)-th noisy data as not being "dog" due to the repeated adversarial attacks. However, humans can still recognize the original data from the (k+1)-th noisy data.
Meanwhile, the data breach preventing device 100 may perform or support another device to perform (i) a process of comparing (i-1) (k+1)-th characteristic information created by inputting the (k+1)-th noisy data into the learning network 120 and (i-2) original characteristic information created by inputting the original data 1 into the learning network 120, and if the (k+1)-th characteristic information and the original characteristic information are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data 30, and (ii) a process of comparing (ii-1) a (k+1)-th task specific output created by referring to the (k+1)-th characteristic information and (ii-2) an original task specific output created by referring to the original characteristic information, and if the (k+1)-th task specific output and the original task specific output are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data 30.
That is, while repeating the adversarial attack in the course of the iterations, since a probability of an output being true decreases, the data breach preventing device 100 may stop the iteration and output the noisy data at the iteration as the watermarked data, at the time when an outcome determined by the learning network 120 of the output becomes different from an identity, e.g., a GT or ground truth, of the original data, by referring to its corresponding probability.
Fig. 4 is a drawing schematically illustrating the method for preventing the breach of the original data to be used for the deep learning by launching the adversarial attack on the 1-st noisy data using multiple learning networks 120_1, 120_2, …, and 120_m with their own learned parameters in accordance with another example embodiment of the present disclosure. Herein, each of the multiple learning networks 120_1, 120_2, …, and 120_m may have completed learning to perform tasks at least part of which may be different from one another, and m may be an integer larger than 0. In the description below, the part easily deducible from the explanation of Fig. 3 will be omitted.
First, if the original data 1 is acquired, the data breach preventing device 100 may add noise to the original data, to thereby generate the 1-st noisy data 10.
Next, while increasing an integer k from 1 to n-1, the data breach preventing device 100 may perform or support another device to perform iteration of (i) a process of inputting the k-th noisy data respectively into the 1-st learning network 120_1 to the m-th learning network 120_m, to thereby allow the 1-st learning network 120_1 to the m-th learning network 120_m to (i-1) apply the learning operations with one or more 1-st learned parameters to one or more m-th learned parameters to the k-th noisy data respectively, and thus to (i-2) output (k_1)-st characteristic information 20_1 to (k_m)-th characteristic information 20_m about the k-th noisy data, (ii) a process of calculating the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st loss to a (k_1_m)-th loss which are calculated by using the 1-st ground truth corresponding to the original data and each piece of the (k_1)-st characteristic information 20_1 to the (k_m)-th characteristic information 20_m, (iii) a process of calculating the (k_2)-nd losses by referring to at least one averaged value over a (k_2_1)-st loss to a (k_2_m)-th loss which are calculated by using the 2-nd ground truth corresponding to the original data and each of a (k_1)-st task specific output to a (k_m)-th task specific output respectively created by using the (k_1)-st characteristic information to the (k_m)-th characteristic information, and (iv) a process of launching the adversarial attack on the k-th noisy data via backpropagation using at least part of the (k_1)-st losses and the (k_2)-nd losses 130, and thus a process of generating (k+1)-th noisy data. And, as a result of the iteration, the data breach preventing device 100 may generate or support another device to generate n-th noisy data as the watermarked data 30 corresponding to the original data 1. Herein, m may be an integer larger than 0.
Detailed explanation is as follows.
First, the data breach preventing device 100 may input the 1-st noisy data 10 respectively into the 1-st learning network 120_1 to the m-th learning network 120_m.
Then, the 1-st learning network 120_1 to the m-th learning network 120_m may apply the learning operations, respectively using the 1-st learned parameters to the m-th learned parameters, to the 1-st noisy data 10, to thereby respectively output the (1_1)-st characteristic information to the (1_m)-th characteristic information about the 1-st noisy data 10.
Then, the data breach preventing device 100 may perform or support another device to perform (i) a process of calculating one or more (1_1)-st losses by referring to at least one averaged value over a (1_1_1)-st loss to a (1_1_m)-th loss which are calculated by using the 1-st ground truth corresponding to the original data and each piece of the (1_1)-st characteristic information to the (1_m)-th characteristic information, and (ii) a process of calculating one or more (1_2)-nd losses by referring to at least one averaged value over a (1_2_1)-st loss to a (1_2_m)-th loss which are calculated by using the 2-nd ground truth corresponding to the original data and each of a (1_1)-st task specific output to a (1_m)-th task specific output respectively created by using the (1_1)-st characteristic information to the (1_m)-th characteristic information.
And, the data breach preventing device 100 may perform a process of launching the adversarial attack on the 1-st noisy data via backpropagation using at least part of the (1_1)-st losses and the (1_2)-nd losses, to thereby generate the 2-nd noisy data, resulting in a single iteration.
Next, the data breach preventing device 100 may input the 2-nd noisy data respectively into the 1-st learning network 120_1 to the m-th learning network 120_m.
Then, the 1-st learning network 120_1 to the m-th learning network 120_m may apply the learning operations, respectively using the 1-st learned parameters to the m-th learned parameters, to the 2-nd noisy data, to thereby respectively output the (2_1)-st characteristic information to the (2_m)-th characteristic information about the 2-nd noisy data.
Then, the data breach preventing device 100 may perform or support another device to perform (i) a process of calculating one or more (2_1)-st losses by referring to at least one averaged value over a (2_1_1)-st loss to a (2_1_m)-th loss which are calculated by using (i-1) the 1-st ground truth corresponding to the original data and (i-2) each piece of the (2_1)-st characteristic information to the (2_m)-th characteristic information, and (ii) a process of calculating one or more (2_2)-nd losses by referring to at least one averaged value over a (2_2_1)-st loss to a (2_2_m)-th loss which are calculated by using (ii-1) the 2-nd ground truth corresponding to the original data and (ii-2) each of a (2_1)-st task specific output to a (2_m)-th task specific output respectively created by using the (2_1)-st characteristic information to the (2_m)-th characteristic information.
And, the data breach preventing device 100 may perform a process of launching a subsequent adversarial attack on the 2-nd noisy data via backpropagation using at least part of the (2_1)-st losses and the (2_2)-nd losses, to thereby generate the 3-rd noisy data, resulting in a subsequent iteration.
Due to the repeated adversarial attacks in the iterations as such, the 1-st learning network 120_1 to the m-th learning network 120_m may output results derived from the noisy data different from those derived from the original data.
Meanwhile, the data breach preventing device 100 may perform or support another device to perform (i) a process of averaging over a (k_1_1)-st summed loss to a (k_1_m)-th summed loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st loss to the (k_1_m)-th loss, in the process of calculating the (k_1)-st losses, and, (ii) a process of averaging over a (k_2_1)-st summed loss to a (k_2_m)-th summed loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st loss to the (k_2_m)-th loss, in the process of calculating the (k_2)-nd losses. Herein, initial values of the losses at a first iteration may be zeros.
Also, the data breach preventing device 100 may perform or support another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information and (i-2) each piece of 1-st original characteristic information to m-th original characteristic information created by inputting the original data respectively into the 1-st learning network 120_1 to the m-th learning network 120_m, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st task specific output to a (k_m)-th task specific output and (ii-2) each of a 1-st original task specific output to an m-th original task specific output respectively created by referring to the 1-st original characteristic information to the m-th original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (iii) a process of acquiring a k-th termination condition value by adding one of the (k_1)-st condition value and the (k_2)-nd condition value onto a (k-1)-th termination condition value, and (iv) after the (k+1)-th noisy data is created, a process of determining whether the k-th termination condition value is equal to or greater than a predetermined threshold, and if the k-th termination condition value is determined as equal to or greater than the predetermined threshold, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data. Herein, initial values of the termination condition values at a first iteration may be zeros.
That is, while repeating the adversarial attack in the course of the iterations, since each of the probabilities of each of the outputs being true decreases, the data breach preventing device 100 may stop the iteration and output the noisy data as the watermarked data, at the time when the number of a part of outcomes determined by the 1-st learning network 120_1 to the m-th learning network 120_m being different from the identity, e.g., the GT or ground truth, of the original data becomes greater than the predetermined threshold.
Fig. 5 is a drawing schematically illustrating the method for preventing the breach of the original data to be used for the deep learning by converting the 1-st noisy data into the (k_1)-st spatial data to the (k_j)-th spatial data through random rotation of the 1-st noisy data in accordance with still another example embodiment of the present disclosure. In the description below, the part easily deducible from the explanation of Figs. 3 and 4 will be omitted.
First, if the original data 1 is acquired, the data breach preventing device 100 may add noise to the original data, to thereby generate the 1-st noisy data 10.
Next, the data breach preventing device 100 may perform or support another device to perform iteration of (i) a process of applying the random rotation to the k-th noisy data, to thereby generate the (k_1)-st spatial data to the (k_j)-th spatial data, (ii) a process of inputting the (k_1)-st spatial data to the (k_j)-th spatial data into the learning network 120, to thereby allow the learning network 120 to apply the learning operations to the (k_1)-st spatial data to the (k_j)-th spatial data and thus to output (k_1)-st spatial characteristic information to (k_j)-th spatial characteristic information, (iii) a process of calculating the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st spatial loss to a (k_1_j)-th spatial loss which are calculated by using the 1-st ground truth corresponding to the original data and each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information and, (iv) a process of calculating the (k_2)-nd losses by referring to at least one averaged value over a (k_2_1)-st spatial loss to a (k_2_j)-th spatial loss which are calculated by using the 2-nd ground truth corresponding to the original data and each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output respectively created by using the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information, and (v) a process of launching the adversarial attack on the k-th noisy data via backpropagation using at least part of the (k_1)-st losses and the (k_2)-nd losses 130, and thus a process of generating (k+1)-th noisy data. And, as a result of the iteration, the data breach preventing device 100 may generate or support another device to generate n-th noisy data as the watermarked data 30 corresponding to the original data 1. Herein, j may be an integer larger than 0.
Detailed explanation is as follows.
First, the data breach preventing device 100 may apply the random rotation to the 1-st noisy data, to thereby generate the (1_1)-st spatial data 10_1 to the (1_j)-th spatial data 10_j.
Herein, the data breach preventing device 100 may perform or support another device to perform a process of randomly selecting one or more angles θ ranging from -π to π, and repeating j times a process of rotating the k-th noisy data as corresponding to each of the angles θ, to thereby generate the (1_1)-st spatial data 10_1 to the (1_j)-th spatial data 10_j.
And, the data breach preventing device 100 may input the (1_1)-st spatial data 10_1 to the (1_j)-th spatial data 10_j into the learning network 120.
Then, the learning network 120 may apply the learning operations respectively to the (1_1)-st spatial data 10_1 to the (1_j)-th spatial data 10_j using the learned parameters of the learning network 120, to thereby output the (1_1)-st spatial characteristic information to the (1_j)-th spatial characteristic information respectively about the (1_1)-st spatial data 10_1 to the (1_j)-th spatial data 10_j.
Then, the data breach preventing device 100 may perform or support another device to perform (i) a process of calculating one or more (1_1)-st losses by referring to at least one averaged value over a (1_1_1)-st spatial loss to a (1_1_j)-th spatial loss which are calculated by using (i-1) the 1-st ground truth corresponding to the original data and (i-2) each piece of the (1_1)-st spatial characteristic information to the (1_j)-th spatial characteristic information, and (ii) a process of calculating one or more (1_2)-nd losses by referring to at least one averaged value over a (1_2_1)-st spatial loss to a (1_2_j)-th spatial loss which are calculated by using (ii-1) the 2-nd ground truth corresponding to the original data and (ii-2) each of a (1_1)-st spatial task specific output to a (1_j)-th spatial task specific output respectively created by using the (1_1)-st spatial characteristic information to the (1_j)-th spatial characteristic information.
And, the data breach preventing device 100 may perform a process of launching the adversarial attack on the 1-st noisy data via backpropagation using at least part of the (1_1)-st losses and the (1_2)-nd losses, to thereby generate the 2-nd noisy data, resulting in a single iteration.
Next, the data breach preventing device 100 may apply the random rotation to the 2-nd noisy data, to thereby generate the (2_1)-st spatial data to the (2_j)-th spatial data.
And, the data breach preventing device 100 may input the (2_1)-st spatial data to the (2_j)-th spatial data into the learning network 120.
Then, the learning network 120 may apply the learning operations respectively to the (2_1)-st spatial data to the (2_j)-th spatial data using the learned parameters of the learning network 120, to thereby output the (2_1)-st spatial characteristic information to the (2_j)-th spatial characteristic information respectively about the (2_1)-st spatial data to the (2_j)-th spatial data.
Then, the data breach preventing device 100 may perform or support another device to perform (i) a process of calculating one or more (2_1)-st losses by referring to at least one averaged value over a (2_1_1)-st spatial loss to a (2_1_j)-th spatial loss which are calculated by using (i-1) the 1-st ground truth corresponding to the original data and (i-2) each piece of the (2_1)-st spatial characteristic information to the (2_j)-th spatial characteristic information, and (ii) a process of calculating one or more (2_2)-nd losses by referring to at least one averaged value over a (2_2_1)-st spatial loss to a (2_2_j)-th spatial loss which are calculated by using (ii-1) the 2-nd ground truth corresponding to the original data and (ii-2) each of a (2_1)-st spatial task specific output to a (2_j)-th spatial task specific output respectively created by using the (2_1)-st spatial characteristic information to the (2_j)-th spatial characteristic information.
And, the data breach preventing device 100 may perform a process of launching a subsequent adversarial attack on the 2-nd noisy data via backpropagation using at least part of the (2_1)-st losses and the (2_2)-nd losses, to thereby generate the 3-rd noisy data, resulting in a subsequent iteration.
Due to the repeated adversarial attacks in the iterations as such, the learning network may output results derived from the noisy data different to those from the original data.
Meanwhile, the data breach preventing device 100 may perform or support another device to perform (i) a process of averaging over a (k_1_1)-st summed spatial loss to a (k_1_j)-th summed spatial loss which are calculated by adding the ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st spatial loss to the (k_1_j)-th spatial loss, in the process of calculating the (k_1)-st losses, and (ii) a process of averaging over a (k_2_1)-st summed spatial loss to a (k_2_j)-th summed spatial loss which are calculated by adding the ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st spatial loss to the (k_2_j)-th spatial loss, in the process of calculating the (k_2)-nd losses. Herein, initial values of the losses at a first iteration may be zeros.
Also, the data breach preventing device 100 may perform or support another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information and (i-2) the original characteristic information created by inputting the original data into the learning network 120, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output and (ii-2) an original task specific output created by referring to the original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (iii) a process of acquiring a k-th termination condition value by adding one of the (k_1)-st condition value and the (k_2)-nd condition value onto a (k-1)-th termination condition value, and (iv) after the (k+1)-th noisy data is created, a process of determining whether the k-th termination condition value is equal to or greater than a predetermined threshold, and if the k-th termination condition value is determined as equal to or greater than the predetermined threshold, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
That is, while repeating the adversarial attack in the course of the iterations, since the probability of the output being true decreases, the data breach preventing device 100 may stop the iteration and output the noisy data as the watermarked data, at the time when the number of a part of outcomes of the (k_1)-st spatial data to the (k_j)-th spatial data being different from the identity, e.g., the GT or ground truth, of the original data becomes greater than the predetermined threshold.
The embodiments of the present disclosure as explained above can be implemented in a form of executable program command through a variety of computer means recordable in computer readable media. The computer readable media may include solely or in combination, program commands, data files, and data structures. The program commands recorded to the media may be components specially designed for the present disclosure or may be usable to those skilled in the art of computer software. Computer readable media include magnetic media such as hard disk, floppy disk, and magnetic tape, optical media such as CD-ROM and DVD, magneto-optical media such as floptical disk and hardware devices such as ROM, RAM, and flash memory specially designed to store and carry out program commands. Program commands may include not only a machine language code made by a complier but also a high level code that can be used by an interpreter etc., which may be executed by a computer. The aforementioned hardware device can work as more than a software module to perform the action of the present disclosure and vice versa.
As seen above, the present disclosure has been explained by specific matters such as detailed components, limited embodiments, and drawings. They have been provided only to help more general understanding of the present disclosure. It, however, will be understood by those skilled in the art that various changes and modification may be made from the description without departing from the spirit and scope of the disclosure as defined in the following claims.
Accordingly, the thought of the present disclosure must not be confined to the explained embodiments, and the following patent claims as well as everything including variations equal or equivalent to the patent claims pertain to the category of the thought of the present disclosure.

Claims (20)

  1. A method for preventing breach of original data to be used for deep learning, comprising steps of:
    (a) a data breach preventing device, if the original data is acquired, performing or supporting another device to perform a process of adding noise onto the original data, to thereby generate 1-st noisy data; and
    (b) the data breach preventing device, (b1) while increasing an integer k from 1 to n-1 wherein n is an integer larger than 1, performing or supporting another device to perform iteration of (i) a process of inputting k-th noisy data into a learning network, to thereby allow the learning network to apply one or more learning operations to the k-th noisy data using one or more learned parameters of the learning network, and thus to output k-th characteristic information about the k-th noisy data, and (ii) a process of launching an adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) one or more (k_1)-st losses calculated by referring to (1) the k-th characteristic information and (2) at least one 1-st ground truth corresponding to the original data, and (ii-2) one or more (k_2)-nd losses calculated by referring to (1) at least one k-th task specific output created by using the k-th characteristic information and (2) at least one 2-nd ground truth corresponding to the original data, and thus a process of generating (k+1)-th noisy data, and (b2) as a result of the iteration, generating or supporting another device to generate n-th noisy data as watermarked data corresponding to the original data.
  2. The method of Claim 1, wherein the learning network includes a 1-st learning network to an m-th learning network respectively having one or more 1-st learned parameters to one or more m-th learned parameters wherein m is an integer greater than 0, and
    wherein, at the step of (b), the data breach preventing device performs or supports another device to perform a process of inputting the k-th noisy data respectively into the 1-st learning network to the m-th learning network, to thereby allow the 1-st learning network to the m-th learning network to (i) apply the learning operations with the 1-st learned parameters to the m-th learned parameters to the k-th noisy data respectively, and thus output (k_1)-st characteristic information to (k_m)-th characteristic information about the k-th noisy data, (ii) calculate the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st loss to a (k_1_m)-th loss which are calculated by using the 1-st ground truth and each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information, and (iii) calculate the (k_2)-nd losses by referring to at least one averaged value over a (k_2_1)-st loss to a (k_2_m)-th loss which are calculated by using the 2-nd ground truth and each of a (k_1)-st task specific output to a (k_m)-th task specific output respectively created by using the (k_1)-st characteristic information to the (k_m)-th characteristic information.
  3. The method of Claim 2, wherein the data breach preventing device performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed loss to a (k_1_m)-th summed loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st loss to the (k_1_m)-th loss, in the process of calculating the (k_1)-st losses, and
    (ii) a process of averaging over a (k_2_1)-st summed loss to a (k_2_m)-th summed loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st loss to the (k_2_m)-th loss, in the process of calculating the (k_2)-nd losses.
  4. The method of Claim 2, wherein the data breach preventing device performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information and (i-2) each piece of 1-st original characteristic information to m-th original characteristic information respectively created by inputting the original data into the 1-st learning network to the m-th learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st task specific output to a (k_m)-th task specific output and (ii-2) each of a 1-st original task specific output to an m-th original task specific output respectively created by referring to the 1-st original characteristic information to the m-th original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (iii) a process of acquiring a k-th termination condition value by adding one of the (k_1)-st condition value and the (k_2)-nd condition value onto a (k-1)-th termination condition value, and (iv) after the (k+1)-th noisy data is created, a process of determining whether the k-th termination condition value is equal to or greater than a predetermined threshold, and if the k-th termination condition value is determined as equal to or greater than the predetermined threshold, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
  5. The method of Claim 1, wherein, at the step of (b), the data breach preventing device performs or supports another device to perform (i) a process of applying random rotation to the k-th noisy data, to thereby generate (k_1)-st spatial data to (k_j)-th spatial data wherein j is an integer greater than 0, (ii) a process of inputting the (k_1)-st spatial data to the (k_j)-th spatial data into the learning network, to thereby allow the learning network to apply the deep-learning operation to the (k_1)-st spatial data to the (k_j)-th spatial data and thus to output (k_1)-st spatial characteristic information to (k_j)-th spatial characteristic information, (iii) a process of calculating the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st spatial loss to a (k_1_j)-th spatial loss which are calculated by using the 1-st ground truth and each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information, and (iv) a process of calculating the (k_2)-nd losses by referring to at least one averaged value over a (k_2_1)-st spatial loss to a (k_2_j)-th spatial loss which are calculated by using the 2-nd ground truth and each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output respectively created by using the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information.
  6. The method of Claim 5, wherein the data breach preventing device performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed spatial loss to a (k_1_j)-th summed spatial loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st spatial loss to the (k_1_j)-th spatial loss, in the process of calculating the (k_1)-st losses, and
    (ii) a process of averaging over a (k_2_1)-st summed spatial loss to a (k_2_j)-th summed spatial loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st spatial loss to the (k_2_j)-th spatial loss, in the process of calculating the (k_2)-nd losses.
  7. The method of Claim 5, wherein the data breach preventing device performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information and (i-2) each piece of original characteristic information created by inputting the original data into the learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output and (ii-2) an original task specific output created by referring to the original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (iii) a process of acquiring a k-th termination condition value by adding one of the (k_1)-st condition value and the (k_2)-nd condition value onto a (k-1)-th termination condition value, and (iv) after the (k+1)-th noisy data is created, a process of determining whether the k-th termination condition value is equal to or greater than a predetermined threshold, and if the k-th termination condition value is determined as equal to or greater than the predetermined threshold, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
  8. The method of Claim 5, wherein the data breach preventing device performs or supports another device to perform a process of randomly selecting one or more angles θ ranging from -π to π and a process of rotating the k-th noisy data as corresponding to each of the angles θ, in the process of creating the (k_1)-st spatial data to the (k_j)-th spatial data.
  9. The method of Claim 1, wherein, at the step of (b), the data breach preventing device performs or supports another device to perform (i) a process of comparing (i-1) (k+1)-th characteristic information created by inputting the (k+1)-th noisy data into the learning network and (i-2) original characteristic information created by inputting the original data into the learning network, and if the (k+1)-th characteristic information and the original characteristic information are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data, and (ii) a process of comparing (ii-1) a (k+1)-th task specific output created by referring to the (k+1)-th characteristic information and (ii-2) an original task specific output created by referring to the original characteristic information, and if the (k+1)-th task specific output and the original task specific output are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
  10. The method of Claim 1, wherein, at the step of (a), the data breach preventing device performs or supports another device to perform a process of adding random noise, according to a normal distribution with one of variances and one of means created by a noise function, onto the original data.
  11. A data breach preventing device for preventing breach of original data to be used for deep learning, comprising:
    at least one memory that stores instructions; and
    at least one processor configured to execute the instructions to perform or support another device to perform: (I) if the original data is acquired, a process of adding noise onto the original data, to thereby generate 1-st noisy data, and (II) (II-1) while increasing an integer k from 1 to n-1 wherein n is an integer larger than 1, (i) a process of inputting k-th noisy data into a learning network, to thereby allow the learning network to apply one or more learning operations to the k-th noisy data using one or more learned parameters of the learning network, and thus to output k-th characteristic information about the k-th noisy data, and (ii) a process of launching an adversarial attack on the k-th noisy data via backpropagation using at least one of (ii-1) one or more (k_1)-st losses calculated by referring to (1) the k-th characteristic information and (2) at least one 1-st ground truth corresponding to the original data, and (ii-2) one or more (k_2)-nd losses calculated by referring to (1) at least one k-th task specific output created by using the k-th characteristic information and (2) at least one 2-nd ground truth corresponding to the original data, and thus a process of generating (k+1)-th noisy data, and (II-2) as a result of the iteration, a process of generating n-th noisy data as watermarked data corresponding to the original data.
  12. The data breach preventing device of Claim 11, wherein the learning network includes a 1-st learning network to an m-th learning network respectively having one or more 1-st learned parameters to one or more m-th learned parameters wherein m is an integer greater than 0, and
    wherein, at the process of (II), the processor performs or supports another device to perform a process of inputting the k-th noisy data respectively into the 1-st learning network to the m-th learning network, to thereby allow the 1-st learning network to the m-th learning network to (i) apply the learning operations with the 1-st learned parameters to the m-th learned parameters to the k-th noisy data respectively, and thus output (k_1)-st characteristic information to (k_m)-th characteristic information about the k-th noisy data, (ii) calculate the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st loss to a (k_1_m)-th loss which are calculated by using the 1-st ground truth and each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information, and (iii) calculate the (k_2)-nd losses by referring to at least one averaged value over a (k_2_1)-st loss to a (k_2_m)-th loss which are calculated by using the 2-nd ground truth and each of a (k_1)-st task specific output to a (k_m)-th task specific output respectively created by using the (k_1)-st characteristic information to the (k_m)-th characteristic information.
  13. The data breach preventing device of Claim 12, wherein the processor performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed loss to a (k_1_m)-th summed loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st loss to the (k_1_m)-th loss, in the process of calculating the (k_1)-st losses, and
    (ii) a process of averaging over a (k_2_1)-st summed loss to a (k_2_m)-th summed loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st loss to the (k_2_m)-th loss, in the process of calculating the (k_2)-nd losses.
  14. The data breach preventing device of Claim 12, wherein the processor performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st characteristic information to the (k_m)-th characteristic information and (i-2) each piece of 1-st original characteristic information to m-th original characteristic information respectively created by inputting the original data into the 1-st learning network to the m-th learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st task specific output to a (k_m)-th task specific output and (ii-2) each of a 1-st original task specific output to an m-th original task specific output respectively created by referring to the 1-st original characteristic information to the m-th original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (iii) a process of acquiring a k-th termination condition value by adding one of the (k_1)-st condition value and the (k_2)-nd condition value onto a (k-1)-th termination condition value, and (iv) after the (k+1)-th noisy data is created, a process of determining whether the k-th termination condition value is equal to or greater than a predetermined threshold, and if the k-th termination condition value is determined as equal to or greater than the predetermined threshold, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
  15. The data breach preventing device of Claim 11, wherein, at the process of (II), the processor performs or supports another device to perform (i) a process of applying random rotation to the k-th noisy data, to thereby generate (k_1)-st spatial data to (k_j)-th spatial data wherein j is an integer greater than 0, (ii) a process of inputting the (k_1)-st spatial data to the (k_j)-th spatial data into the learning network, to thereby allow the learning network to apply the deep-learning operation to the (k_1)-st spatial data to the (k_j)-th spatial data and thus to output (k_1)-st spatial characteristic information to (k_j)-th spatial characteristic information, (iii) a process of calculating the (k_1)-st losses by referring to at least one averaged value over a (k_1_1)-st spatial loss to a (k_1_j)-th spatial loss which are calculated by using the 1-st ground truth and each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information, and (iv) a process of calculating the (k_2)-nd losses by referring to at least one averaged value over a (k_2_1)-st spatial loss to a (k_2_j)-th spatial loss which are calculated by using the 2-nd ground truth and each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output respectively created by using the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information.
  16. The data breach preventing device of Claim 15, wherein the processor performs or supports another device to perform (i) a process of averaging over a (k_1_1)-st summed spatial loss to a (k_1_j)-th summed spatial loss which are calculated by adding a ((k-1)_1)-st loss respectively onto each of the (k_1_1)-st spatial loss to the (k_1_j)-th spatial loss, in the process of calculating the (k_1)-st losses, and
    (ii) a process of averaging over a (k_2_1)-st summed spatial loss to a (k_2_j)-th summed spatial loss which are calculated by adding a ((k-1)_2)-nd loss respectively onto each of the (k_2_1)-st spatial loss to the (k_2_j)-th spatial loss, in the process of calculating the (k_2)-nd losses.
  17. The data breach preventing device of Claim 15, wherein the processor performs or supports another device to perform (i) a process of comparing between elements included in each of 1-st pairs whose elements are comprised of (i-1) each piece of the (k_1)-st spatial characteristic information to the (k_j)-th spatial characteristic information and (i-2) each piece of original characteristic information created by inputting the original data into the learning network, to thereby acquire a (k_1)-st condition value corresponding to the number of a part of the 1-st pairs whose elements are determined as different from each other, (ii) a process of comparing between elements included in each of 2-nd pairs whose elements are comprised of (ii-1) each of a (k_1)-st spatial task specific output to a (k_j)-th spatial task specific output and (ii-2) an original task specific output created by referring to the original characteristic information, to thereby acquire a (k_2)-nd condition value corresponding to the number of a part of the 2-nd pairs whose elements are determined as different from each other, (iii) a process of acquiring a k-th termination condition value by adding one of the (k_1)-st condition value and the (k_2)-nd condition value onto a (k-1)-th termination condition value, and (iv) after the (k+1)-th noisy data is created, a process of determining whether the k-th termination condition value is equal to or greater than a predetermined threshold, and if the k-th termination condition value is determined as equal to or greater than the predetermined threshold, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
  18. The data breach preventing device of Claim 15, wherein the processor performs or supports another device to perform a process of randomly selecting one or more angles θ ranging from -π to π and a process of rotating the k-th noisy data as corresponding to each of the angles θ, in the process of creating the (k_1)-st spatial data to the (k_j)-th spatial data.
  19. The data breach preventing device of Claim 11, wherein, at the process of (II), the processor performs or supports another device to perform (i) a process of comparing (i-1) (k+1)-th characteristic information created by inputting the (k+1)-th noisy data into the learning network and (i-2) original characteristic information created by inputting the original data into the learning network, and if the (k+1)-th characteristic information and the original characteristic information are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data, and (ii) a process of comparing (ii-1) a (k+1)-th task specific output created by referring to the (k+1)-th characteristic information and (ii-2) an original task specific output created by referring to the original characteristic information, and if the (k+1)-th task specific output and the original task specific output are determined as different, a process of stopping the iteration and outputting the (k+1)-th noisy data as the watermarked data.
  20. The data breach preventing device of Claim 11, wherein, at the process of (I), the processor performs or supports another device to perform a process of adding random noise, according to a normal distribution with one of variances and one of means created by a noise function, onto the original data.
PCT/KR2021/004382 2020-04-24 2021-04-07 Method for preventing breach of original data for deep learning and data breach preventing device using them WO2021215710A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16/858,562 2020-04-24
US16/858,562 US10956598B1 (en) 2020-04-24 2020-04-24 Method for preventing breach of original data for deep learning and data breach preventing device using them

Publications (1)

Publication Number Publication Date
WO2021215710A1 true WO2021215710A1 (en) 2021-10-28

Family

ID=73834287

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2021/004382 WO2021215710A1 (en) 2020-04-24 2021-04-07 Method for preventing breach of original data for deep learning and data breach preventing device using them

Country Status (3)

Country Link
US (1) US10956598B1 (en)
EP (1) EP3901836A1 (en)
WO (1) WO2021215710A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10956598B1 (en) * 2020-04-24 2021-03-23 Deeping Source Inc. Method for preventing breach of original data for deep learning and data breach preventing device using them
US11308359B1 (en) * 2021-10-27 2022-04-19 Deeping Source Inc. Methods for training universal discriminator capable of determining degrees of de-identification for images and obfuscation network capable of obfuscating images and training devices using the same
US11886955B2 (en) 2022-02-16 2024-01-30 Protopia AI, Inc. Self-supervised data obfuscation in foundation models

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6122403A (en) * 1995-07-27 2000-09-19 Digimarc Corporation Computer system linked by using information in data objects
US20190019311A1 (en) * 2017-07-14 2019-01-17 Microsoft Technology Licensing, Llc Fully convolutional color constancy with confidence weighted pooling
US20190042888A1 (en) * 2017-08-02 2019-02-07 Preferred Networks, Inc. Training method, training apparatus, region classifier, and non-transitory computer readable medium
US20190370440A1 (en) * 2018-06-04 2019-12-05 International Business Machines Corporation Protecting deep learning models using watermarking
US20200034520A1 (en) * 2018-07-26 2020-01-30 Deeping Source Inc. Method for training and testing obfuscation network capable of processing data to be concealed for privacy, and training device and testing device using the same
US10956598B1 (en) * 2020-04-24 2021-03-23 Deeping Source Inc. Method for preventing breach of original data for deep learning and data breach preventing device using them

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8819172B2 (en) * 2010-11-04 2014-08-26 Digimarc Corporation Smartphone-based methods and systems
US10621379B1 (en) * 2019-10-24 2020-04-14 Deeping Source Inc. Method for training and testing adaption network corresponding to obfuscation network capable of processing data to be concealed for privacy, and training device and testing device using the same

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6122403A (en) * 1995-07-27 2000-09-19 Digimarc Corporation Computer system linked by using information in data objects
US20190019311A1 (en) * 2017-07-14 2019-01-17 Microsoft Technology Licensing, Llc Fully convolutional color constancy with confidence weighted pooling
US20190042888A1 (en) * 2017-08-02 2019-02-07 Preferred Networks, Inc. Training method, training apparatus, region classifier, and non-transitory computer readable medium
US20190370440A1 (en) * 2018-06-04 2019-12-05 International Business Machines Corporation Protecting deep learning models using watermarking
US20200034520A1 (en) * 2018-07-26 2020-01-30 Deeping Source Inc. Method for training and testing obfuscation network capable of processing data to be concealed for privacy, and training device and testing device using the same
US10956598B1 (en) * 2020-04-24 2021-03-23 Deeping Source Inc. Method for preventing breach of original data for deep learning and data breach preventing device using them

Also Published As

Publication number Publication date
US10956598B1 (en) 2021-03-23
EP3901836A1 (en) 2021-10-27

Similar Documents

Publication Publication Date Title
WO2021215710A1 (en) Method for preventing breach of original data for deep learning and data breach preventing device using them
WO2020022703A1 (en) Method for concealing data and data obfuscation device using the same
WO2022065817A1 (en) Methods for training and testing obfuscation network capable of performing distinct concealing processes for distinct regions of original image and learning and testing devices using the same
WO2020032420A1 (en) Method for training and testing data embedding network to generate marked data by integrating original data with mark data, and training device and testing device using the same
WO2022086146A1 (en) Method for training and testing obfuscation network capable of obfuscating data for privacy, and training device and testing device using the same
WO2022086147A1 (en) Method for training and testing user learning network to be used for recognizing obfuscated data created by obfuscating original data to protect personal information and user learning device and testing device using the same
WO2021261720A1 (en) Method for training obfuscation network which conceals original data to be used for machine learning and training surrogate network which uses obfuscated data generated by obfuscation network and method for testing trained obfuscation network and learning device and testing device using the same
WO2022086145A1 (en) Method for training and testing obfuscation network capable of processing data to be obfuscated for privacy, and training device and testing device using the same
WO2021221254A1 (en) Method for performing continual learning on classifier in client capable of classifying images by using continual learning server and continual learning server using the same
WO2023027340A1 (en) Method for training and testing obfuscation network capable of obfuscating data to protect personal information, and learning device and testing device using the same
WO2022124701A1 (en) Method for producing labeled image from original image while preventing private information leakage of original image and server using the same
WO2023096445A1 (en) Method for generating obfuscated image to be used in training learning network and labeling device using the same
WO2020246655A1 (en) Situation recognition method and device for implementing same
WO2022114392A1 (en) Feature selection-based mobile malicious code classification method, and recording medium and device for performing same
Cappelli et al. Adversarial robustness by design through analog computing and synthetic gradients
US20230306107A1 (en) A Method of Training a Submodule and Preventing Capture of an AI Module
US20210224688A1 (en) Method of training a module and method of preventing capture of an ai module
WO2023096444A1 (en) Learning method and learning device for training obfuscation network capable of obfuscating original data for privacy and testing method and testing device using the same
US20230050484A1 (en) Method of Training a Module and Method of Preventing Capture of an AI Module
EP4229554A1 (en) Systems and methods for providing a systemic error in artificial intelligence algorithms
WO2023204449A1 (en) Learning method and learning device for training obfuscation network capable of obfuscating original data for privacy to achieve information restriction obfuscation and testing method and testing device using the same
WO2021107231A1 (en) Sentence encoding method and device using hierarchical word information
WO2020175729A1 (en) Apparatus and method for detecting facial feature point by using gaussian feature point map and regression scheme
EP4007979A1 (en) A method to prevent capturing of models in an artificial intelligence based system
WO2023249184A1 (en) Adversarial training system and adversarial training method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21792622

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 23/03/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21792622

Country of ref document: EP

Kind code of ref document: A1