IL285479B2

IL285479B2 - System and method for using a user-action log to learn to classify encrypted traffic

Info

Publication number: IL285479B2
Application number: IL285479A
Authority: IL
Original assignee: Cognyte Tech Israel Ltd
Priority date: 2021-08-09
Filing date: 2021-08-09
Publication date: 2023-08-01
Also published as: IL285479B1; IL285479A

Description

1011-1143.1 SYSTEM AND METHOD FOR USING A USER-ACTION LOG TO LEARN TO CLASSIFY ENCRYPTED TRAFFIC FIELD OF THE DISCLOSURE The present disclosure is related to the monitoring of encrypted communication over communication networks, and to the application of machine-learning techniques to facilitate such monitoring.

BACKGROUND OF THE DISCLOSURE Many applications, such as Gmail, Facebook, Twitter, and Instagram, use an encrypted protocol, such as the Secure Sockets Layer (SSL) protocol or the Transport Layer Security (TLS) protocol. An application that uses an encrypted protocol generates encrypted traffic, upon a user using the application to perform a user action.

In some cases, marketing personnel may wish to learn more about a user’s online activities, in order to provide the user with relevant marketing material that is tailored to the user's behavioral and demographic profile. However, if the user’s traffic is mostly encrypted, it may be difficult to learn anything about the user’s online activities.

Conti, Mauro, et al. "Can't you hear me knocking: Identification of user actions on Android apps via traffic analysis," Proceedings of the 5th ACM Conference on Data and Application Security and Privacy, ACM, 2015, describes an investigation as to which extent it is feasible to identify the specific actions that a user is performing on mobile apps, by eavesdropping on their encrypted network traffic.

Saltaformaggio, Brendan, et al. "Eavesdropping on finegrained user activities within smartphone apps over encrypted network traffic," Proc. USENIX Workshop on Offensive Technologies, 2016, demonstrates that a passive eavesdropper is capable of 1011-1143.1 identifying fine-grained user activities within the wireless network traffic generated by apps. The paper presents a technique, called NetScope, that is based on the intuition that the highly specific implementation of each app leaves a fingerprint on its traffic behavior (e.g., transfer rates, packet exchanges, and data movement). By learning the subtle traffic behavioral differences between activities (e.g., "browsing" versus "chatting" in a dating app), NetScope is able to perform robust inference of users’ activities, for both Android and iOS devices, based solely on inspecting IP headers.

Grolman, Edita, et al., "Transfer Learning for User Action Identification in Mobile Apps via Encrypted Traffic Analysis," IEEE Intelligent Systems (2018), describes an approach for inferring user actions performed in mobile apps by analyzing the resulting encrypted network traffic. The approach generalizes across different app versions, mobile operating systems, and device models, collectively referred to as configurations. The different configurations are treated as a case for transfer learning, and the co-training method is adapted to support the transfer learning process. The approach leverages a small number of labeled instances of encrypted traffic from a source configuration, in order to construct a classifier capable of identifying a user’s actions in a different (target) configuration which is completely unlabeled.

Hanneke, Steve, et al., Iterative Labeling for SemiSupervised Learning, University of Illinois, 2004 proposes a unified perspective of a large family of semi-supervised learning algorithms, which select and label unlabeled data in an iterative process.

SUMMARY OF THE DISCLOSURE There is provided, in accordance with some embodiments of the present disclosure ,a system that includes a communication interface and a processor. The processor is configured to obtain 2 1011-1143.1 a user-action log that specifies (i) a series of actions, of respective action types, performed using an application, and (ii) respective action times at which the actions were performed. The processor is further configured to, using the communication interface, obtain a network-traffic report that specifies properties of a plurality of packets that were exchanged, while the series of actions were performed, between the application and a server for the application, the properties including respective receipt times at which the packets were received while en route between the application and the server. The processor is further configured to, based on the receipt times, define multiple nonoverlapping blocks of consecutive ones of the packets. The processor is further configured to identify a correspondence between the actions and respective corresponding ones of the blocks, by correlating between the action times and the receipt times, and, based on the identified correspondence, train a classifier to associate other blocks of packets with respective ones of the action types based on the properties of the other blocks.

In some embodiments, the processor is configured to identify the correspondence and train the classifier by iteratively (i) using the classifier to select additional ones of the corresponding blocks, and augmenting a training set with the additional corresponding blocks, and (ii) using the augmented training set, retraining the classifier.

In some embodiments, the processor is configured to select the additional ones of the corresponding blocks by, for each action in a subset of the actions that do not yet belong to the training set:identifying one or more candidate blocks whose respective earliest receipt times correspond to the action time of the action, andusing the classifier to select one of the candidate blocks as the block that corresponds to the action. 1011-1143.1 In some embodiments, the processor is configured to identify the candidate blocks by:defining a window of time that includes the action time of the action, andidentifying the candidate blocks in response to the candidate blocks beginning in the window of time.

In some embodiments, the processor is configured to use the classifier to select one of the candidate blocks by:using the classifier, computing respective levels of confidence for the candidate blocks being associated with the action type of the action, andselecting the candidate block whose level of confidence is highest, relative to the other candidate blocks.

In some embodiments, the processor is configured to select the candidate block whose level of confidence is highest provided that the highest level of confidence is greater than a level-of- confidence threshold, and the processor is further configured to iteratively lower the level-of-confidence threshold when iteratively augmenting the training set.

In some embodiments, the processor is further configured to add the other candidate blocks, with respective labels indicating that the other candidate blocks do not correspond to any of the actions, to the training set.

In some embodiments, the processor is further configured to cause the user actions to be performed automatically.

In some embodiments, content of the packets is encrypted, and the properties of the packets do not include any of the encrypted content.

In some embodiments, the processor is further configured to, prior to identifying the correspondence between the actions and the respective corresponding ones of the blocks, inflate the action times.

In some embodiments, the processor is configured to inflate 4 1011-1143.1 the action times by, for each unique action type:computing, for a subgroup of the actions that are of the unique action type, respective estimated communication delays, by, for each action in the subgroup:identifying a block whose earliest receipt time follows the action time of the action and is closest to the action time of the action, relative to the other blocks, andcomputing the estimated communication delay for the action, by subtracting the action time of the action from the earliest receipt time of the identified block,computing a median of the estimated communication delays, and adding the median to the respective action times of the subgroup.

In some embodiments, the processor is further configured to: repeatedly define the blocks based on different respective sets of packet-aggregation rules, such that multiple classifiers are trained for the different respective sets of packetaggregation rules, andselect a best-performing one of the multiple classifiers for use.

There is further provided, in accordance with some embodiments of the present disclosure, a method that includes obtaining a user-action log that specifies (i) a series of actions, of respective action types, performed using an application, and (ii) respective action times at which the actions were performed. The method further includes obtaining a network-traffic report that specifies properties of a plurality of packets that were exchanged, while the series of actions were performed, between the application and a server for the application, the properties including respective receipt times at which the packets were received while en route between the application and the server. The method further includes, based on the receipt times, defining multiple non-overlapping blocks of consecutive ones of the packets. The method further includes identifying a correspondence 5 1011-1143.1 between the actions and respective corresponding ones of the blocks, by correlating between the action times and the receipt times, and, based on the identified correspondence, training a classifier to associate other blocks of packets with respective ones of the action types based on the properties of the other blocks.

The present disclosure will be more fully understood from the following detailed description of embodiments thereof, taken together with the drawings, in which: BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 is a schematic illustration of a system for training a classifier to classify encrypted network traffic, in accordance with some embodiments of the present disclosure; Fig. 2 is a schematic illustration of an example networktraffic report, in accordance with some embodiments of the present disclosure; Fig. 3 is a flow diagram for a method for preprocessing a user-action log, in accordance with some embodiments of the present disclosure; Fig. 4 is a flow diagram for a method for training multiple classifiers, in accordance with some embodiments of the present disclosure; Fig. 5 is a flow diagram for a method for training a classifier, in accordance with some embodiments of the present disclosure; and Fig. 6 pictorially illustrates various aspects of training a classifier, in accordance with some embodiments of the present disclosure. 1011-1143.1

Claims

1.,479/

2.CLAIMS 1. A system, comprising: a communication interface; and a processor, configured to: obtain a user-action log that specifies (i) a series of actions, of respective action types, performed using an application, and (ii) respective action times at which the actions were performed, using the communication interface, obtain a network-traffic report that specifies properties of a plurality of packets that were exchanged, while the series of actions were performed, between the application and a server for the application, the properties including respective receipt times at which the packets were received while en route between the application and the server, based on the receipt times, define multiple non-overlapping blocks of consecutive ones of the packets, inflate the action times, by, for each unique action type, computing, for a subgroup of the actions that are of the unique action type, respective estimated communication delays, by, for each action in the subgroup: identifying a block whose earliest receipt time follows the action time of the action and is closest to the action time of the action, relative to the other blocks, and computing the estimated communication delay for the action, by subtracting the action time of the action from the earliest receipt time of the identified block, computing a median of the estimated communication delays, and 285,479/ adding the median to the respective action times of the subgroup; identify a correspondence between the actions and respective corresponding ones of the blocks, by correlating between the action times and the receipt times, and based on the identified correspondence, train a classifier to associate other blocks of packets with respective ones of the action types based on the properties of the other blocks. 2. The system according to claim 1, wherein the processor is configured to identify the correspondence and train the classifier by iteratively (i) using the classifier to select additional ones of the corresponding blocks by, for each action in a subset of the actions that do not yet belong to the training set; identifying one or more candidate blocks whose respective earliest receipt times correspond to the action time of the action, and using the classifier to select one of the candidate blocks as the block that corresponds to the action, and (ii) augmenting a training set with the additional corresponding blocks, and (iii) using the augmented training set, retraining the classifier.

3. The system according to claim 2, wherein the processor is configured to identify the candidate blocks by: defining a window of time that includes the action time of the action, and identifying the candidate blocks in response to the candidate blocks beginning in the window of time. 285,479/

4. The system according to claim 2, wherein the processor is configured to use the classifier to select one of the candidate blocks by: using the classifier, computing respective levels of confidence for the candidate blocks being associated with the action type of the action, and selecting the candidate block whose level of confidence is highest, relative to the other candidate blocks.

5. The system according to claim 4, wherein the processor is configured to select the candidate block whose level of confidence is highest provided that the highest level of confidence is greater than a level-of-confidence threshold, and wherein the processor is further configured to iteratively lower the level-of-confidence threshold when iteratively augmenting the training set.

6. The system according to claim 4, wherein the processor is further configured to add the other candidate blocks as no-action blocks, with respective labels indicating that the other candidate blocks do not correspond to any of the actions, to the training set.

7. The system according to claim 1, wherein the processor is further configured to: repeatedly define the blocks based on different respective sets of packet-aggregation rules, such that multiple classifiers are trained for the different respective sets of packet-aggregation rules, and select a best-performing one of the multiple classifiers for use. 285,479/

8. A method, comprising: obtaining a user-action log that specifies (i) a series of actions, of respective action types, performed using an application, and (ii) respective action times at which the actions were performed; obtaining a network-traffic report that specifies properties of a plurality of packets that were exchanged, while the series of actions were performed, between the application and a server for the application, the properties including respective receipt times at which the packets were received while en route between the application and the server; based on the receipt times, defining multiple non-overlapping blocks of consecutive ones of the packets; inflating the action times by computing, for a subgroup of the actions that are of the unique action type, respective estimated communication delays, by, for each action in the subgroup: identifying a block whose earliest receipt time follows the action time of the action and is closest to the action time of the action, relative to the other blocks, and computing the estimated communication delay for the action, by subtracting the action time of the action from the earliest receipt time of the identified block; computing a median of the estimated communication delays; and adding the median to the respective action times of the subgroup; 285,479/ identifying a correspondence between the actions and respective corresponding ones of the blocks, by correlating between the action times and the receipt times; and based on the identified correspondence, training a classifier to associate other blocks of packets with respective ones of the action types based on the properties of the other blocks.

9. The method according to claim 8, wherein identifying the correspondence and training the classifier comprises iteratively (i) using the classifier to select additional ones of the corresponding blocks by, for each action in a subset of the actions that do not yet belong to the training set; identifying one or more candidate blocks whose respective earliest receipt times correspond to the action time of the action; and using the classifier to select one of the candidate blocks as the block that corresponds to the action; (ii) augmenting a training set with the additional corresponding blocks, and (iii) using the augmented training set, retraining the classifier.

10. The method according to claim 9, wherein identifying the candidate blocks comprises: defining a window of time that includes the action time of the action; and identifying the candidate blocks in response to the candidate blocks beginning in the window of time.

11. The method according to claim 9, wherein using the classifier to select one of the candidate blocks comprises: 285,479/ using the classifier, computing respective levels of confidence for the candidate blocks being associated with the action type of the action, and selecting the candidate block whose level of confidence is highest, relative to the other candidate blocks.

12. The method according to claim 11, wherein selecting the candidate block whose level of confidence is highest comprises selecting the block whose level of confidence is highest provided that the highest level of confidence is greater than a level-of-confidence threshold, and wherein iteratively augmenting the training set further comprises iteratively lowering the level-of-confidence threshold.

13. The method according to claim 11, wherein iteratively augmenting the training set further comprises adding the other candidate blocks as no-action blocks, with respective labels indicating that the other candidate blocks do not correspond to any of the actions, to the training set.

14. The method according to claim 8, further comprising: repeatedly defining the blocks based on different respective sets of packet-aggregation rules, such that multiple classifiers are trained for the different respective sets of packet-aggregation rules; and selecting a best-performing one of the multiple classifiers for use.