US10846573B2 - Detecting, redacting, and scoring confidential information in images - Google Patents
Detecting, redacting, and scoring confidential information in images Download PDFInfo
- Publication number
- US10846573B2 US10846573B2 US16/050,219 US201816050219A US10846573B2 US 10846573 B2 US10846573 B2 US 10846573B2 US 201816050219 A US201816050219 A US 201816050219A US 10846573 B2 US10846573 B2 US 10846573B2
- Authority
- US
- United States
- Prior art keywords
- bitmap image
- grams
- text
- bitmap
- server system
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/62—Protecting access to data via a platform, e.g. using keys or access control rules
- G06F21/6218—Protecting access to data via a platform, e.g. using keys or access control rules to a system of files or objects, e.g. local or distributed file system or database
- G06F21/6245—Protecting personal data, e.g. for financial or medical purposes
-
- G06K9/726—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/211—Selection of the most significant subset of features
- G06F18/2113—Selection of the most significant subset of features by ranking or filtering the set of features, e.g. using a measure of variance or of feature cross-correlation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/217—Validation; Performance evaluation; Active pattern learning techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2413—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/24323—Tree-organised classifiers
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/60—Protecting data
- G06F21/62—Protecting access to data via a platform, e.g. using keys or access control rules
-
- G06K9/00979—
-
- G06K9/344—
-
- G06K9/623—
-
- G06K9/6262—
-
- G06K9/627—
-
- G06K9/6282—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/94—Hardware or software architectures specially adapted for image or video understanding
- G06V10/95—Hardware or software architectures specially adapted for image or video understanding structured as a network, e.g. client-server architectures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/148—Segmentation of character regions
- G06V30/153—Segmentation of character regions using recognition of characters or words
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/26—Techniques for post-processing, e.g. correcting the recognition result
- G06V30/262—Techniques for post-processing, e.g. correcting the recognition result using context analysis, e.g. lexical, syntactic or semantic context
- G06V30/274—Syntactic or semantic context, e.g. balancing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/10—Network architectures or network communication protocols for network security for controlling access to devices or network resources
- H04L63/102—Entity profiles
-
- G06K2209/01—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/001—Texturing; Colouring; Generation of texture or colour
-
- G06T5/002—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration
- G06T5/70—Denoising; Smoothing
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Definitions
- the present disclosure relates generally to distributed computing and, more specifically, to detecting, redacting, and scoring confidential information in images.
- Screen-content sharing applications take a variety of forms. In some cases, users may share their desktop on a video chatting application with another user. In other cases, a user may grab screenshots of their screen display and upload those screenshots for sharing with others. Some applications support window-specific screenshots of just the interface of that application. In some cases, the screenshot is of the entire display or a subset thereof, for example, a given application being displayed. In some cases, the screen-content sharing applications offer screen-content sharing as a more tangential feature to a larger suite of tools, such as task management applications, chat applications, productivity explications, and the like.
- Some aspects include a process, including receiving, with one or more processors, a screen capture event from an operating system of a first client computing device of a first user, the screen capture event including, or being associated with, a bitmap image of at least part of a display of the first computing device; causing, with one or more processors, optical character recognition (OCRing) of text in the bitmap image and obtaining, as a result of the OCRing, an OCR record with text appearing in the bitmap image and indicating locations of characters of the text in the image with coordinates of pixels in the bitmap image, the text comprising a plurality of n-grams; scoring, with one or more processors, each of the n-grams based on whether the respective n-grams match any of a plurality of patterns; classifying, with one or more processors, each of the n-grams based on the scoring into two or more categories, the two or more categories including a category for confidential information; for each of the n-grams classified in the category for confidential information
- Some aspects include a process, including: receiving, with one or more processors of a server system implementing a confidential-information redaction service, via a network, one or more bitmap images of a display presented by graphical operating system on a computing device distinct from the server system, at least some of the bitmap images displaying at least part of a user interface of an application executed within the graphical operating system; detecting and localizing, with one or more processors of the server system, confidential information depicted in the one or more bitmap images to produce a set of areas to be redacted; modifying, with one or more processors of the server system, pixels in each of the set of areas to be redacted to redact the confidential information and form modified versions of the one or more bitmap images; and storing, with one or more processors of the server system, the modified versions of the one or more bitmap images in memory or outputting, with one or more processors of the server system, the modified versions of the one or more bitmap images to another network-accessible service.
- Some aspects include a tangible, non-transitory, machine-readable medium storing instructions that when executed by a data processing apparatus cause the data processing apparatus to perform operations including the above-mentioned process.
- Some aspects include a system, including: one or more processors; and memory storing instructions that when executed by the processors cause the processors to effectuate operations of the above-mentioned process.
- FIG. 1 is a block diagram showing a logical and physical architecture of an example of a screen-content sharing application in accordance with some embodiments of the present techniques
- FIG. 2 is a flowchart showing an example of a process to redact confidential information in shared image content in accordance with some embodiments of the present techniques
- FIGS. 3 and 4 are examples of screenshots before and after redacting in accordance with some embodiments of the present techniques
- FIG. 5 is a block diagram showing a logical and physical architecture of an example of another screen-content sharing application configured to redact confidential information in video in accordance with some embodiments of the present techniques
- FIG. 6 is a flowchart of an example of a process to redact confidential information in video shared by a screen-content sharing application in accordance with some embodiments of the present techniques.
- FIG. 7 is a block logical and physical architecture diagram of an example of a computing device by which the present techniques may be implemented.
- FIG. 1 through four depict examples of aspects of a set of techniques by which confidential and other types of information in bitmap images shared with a content-sharing application may be detected, redacted, or scored for risk in accordance with some embodiments. To this end, some embodiments:
- FIGS. 5 and 6 depict aspects of another set of complementary techniques by which video may be processed to similar ends, in some cases drawing upon the techniques of FIGS. 1 through 4 , but with a significantly lower computational load than na ⁇ ve application of these techniques to video would be expected to generate. To this end, some embodiments perform the following operations:
- Some embodiments may implement techniques to expedite the processing of video frames, for instance, by selectively processing a subset of the video frames exhibiting greater than a threshold amount of entropy relative to the preceding frame or frames, and in some cases selectively processing subsets of video frames by filtering out those portions of images predicted to be unlikely to have text containing confidential information (e.g., toolbars or whitespace).
- a threshold amount of entropy relative to the preceding frame or frames
- selectively processing subsets of video frames by filtering out those portions of images predicted to be unlikely to have text containing confidential information (e.g., toolbars or whitespace).
- the computing environment 10 may include a plurality of user computing devices 14 , a screen-content sharing application 12 , one or more networks 16 by which these components communicate (such as the Internet), and an OCR service 18 configured to perform optical character recognition on bitmap images sent to the OCR service 18 .
- Three user computing devices 14 are shown, but commercial embodiments are expected to include substantially more, such as more than 10,000, more than 100,000, or more than 1 million different user computing devices 14 , in some cases geographically distributed over an area larger than 1000 km 2 , 10,000 km 2 , North America, or the world.
- each of the user computing devices 14 may be used by a different user having a different user account in the screen-content sharing application 12 . Or some users may access the same account with different devices. In some cases, subsets of the user devices may operate on secure local area networks through which they access the application 12 or the Internet.
- the user computing devices 14 may be various kinds of computing devices, such as desktop computers, laptop computers, set-top boxes, tablet computers, smart phones, in-automotive computing devices, in-store kiosks, and the like.
- the user computing devices 14 may each execute an instance of an operating system 20 and a screen-sharing client application 22 , along with various applications 24 , which may execute in the operating system 20 .
- the applications 24 present a user interface on a display of the user computing device, e.g., in a “window,” and the screen-sharing application 22 may capture screenshots or screen casts video of the state of that display including that user interface for sharing via the server-side screen-content sharing application 12 with other user computing devices 14 .
- Collectively the components 22 and 12 form a distributed screen-content sharing application, but this term is used to refer to the distributed application and the server side and client side components interchangeably unless indicated otherwise.
- the applications 24 may be various types of applications, including web browsers, email clients, productivity applications, and the like.
- the screen-sharing application 22 may be a native application installed on the user computing device 14 , for instance, during part of a registration process with the server-side screen-content sharing application 12 by which user credentials are selected or otherwise assigned and stored in memory of the screen-sharing application 22 in a user account. Some embodiments may include in communications between the components 12 and 14 identifiers of such an account so that shared content or access requests may be organized by account.
- the client-side screen-sharing application 22 registers with the operating system 20 to receive various types of events, such as file creation events or other invents indicating that a screen capture or screen cast video display of the display presented by the operating system 20 is occurring. Some embodiments may execute an event handler that responds to these events to effectuate the functionality described herein. In some cases, these events may include a reference to or copy of a bitmap image or sequence of bitmap images in a video that may be accessed by the screen-sharing application 22 for purposes of subsequent operations described below. In some cases, the screen capture screen cast is caused by the screen-sharing application 22 , for instance, responsive to user input to the application requesting the sharing or casting.
- events such as file creation events or other invents indicating that a screen capture or screen cast video display of the display presented by the operating system 20 is occurring.
- Some embodiments may execute an event handler that responds to these events to effectuate the functionality described herein. In some cases, these events may include a reference to or copy of a bitmap image or
- the screen-sharing application 22 queries a window manager of the operating system 20 responsive to such an event to obtain bounding boxes of windows of the various applications 24 and associates with those bounding boxes identifiers of the application. In some embodiments, this information may be reported along with the captured image or video to facilitate subsequent classification of content in the bounding boxes, using the application identifier as an additional feature for pattern matching.
- captured images or video may be redacted to form versions without confidential information.
- Forming a new version, modifying, redacting, or otherwise obfuscating may be performed by modifying values in memory encoding an existing version, for instance, changing bytes in a data structure in memory at addresses storing the existing version or by creating a new copy with the changed values.
- Reference to a “bitmap image” or “frame” singular includes reference to the various versions along a processing pipeline, including where the different versions exist as different copies with different processing or other transformations applied thereto or where a single copy has a subset of its values changed through such processing or transformation.
- Techniques related to video processing are described in greater detail with reference to FIGS. 5 and 6 . To illustrate these and other techniques, FIGS. 1 through 4 are described with reference to redaction of images, and those techniques may be applied to individual frames as described below with reference to FIGS. 5 and 6 .
- the image may be sent to the screen-content sharing application 12 via the Internet 16 along with an account identifier by the screen-sharing application 22 for redaction and sharing.
- any subset of the presently described steps of the redaction process may be offloaded to the screen-sharing application 22 for client-side processing to keep confidential information within a network, or some embodiments may execute a on-premises instance of the screen-content sharing application or a set of services related to redaction and classification and risk monitoring on-premises on remote hardware that is distinct from that hosting the screen-content sharing application 12 and the user computing devices 14 .
- a single instance of the various modules are shown, but embodiments are consistent with scalable architectures in which multiple instances of each module may be instantiated, for instance, behind load balancers in implementations designed to dynamically scale the number of instances responsive to computing load to concurrently process sessions.
- content may be offloaded to a content delivery network in accordance with the techniques described herein. Sending instructions to retrieve content from a content delivery network is an example of sending content as that and related phrase and related terms are used herein.
- Some embodiments include software as a service implementations in which different entities, such as different enterprises, have different tenant accounts hosted by the same computing instances providing the server-side screen-content sharing application 12 .
- the server-side application may be implemented as a distributed application, for instance, a micro services application, in which different virtual machines or containers instantiate the different illustrated modules.
- Some embodiments may implement the illustrated functionality with a serverless architecture, for instance, in which the different modules are exposed as lambda functions.
- the web server 26 and the API server 28 are nonblocking servers configured to service a relatively high volume of traffic, such as more than one session per second, 10 sessions per second, 100 sessions per second, or 1000 sessions per second or operating concurrently.
- the web server 26 may host dynamic webpages by which screen captures are viewed, shared, modified, commented on, or otherwise interacted with on user computing devices in web browsers.
- the API server 28 may interact with the screen-sharing application 22 to upload screen captures or video or otherwise expose functionality of the server side screen-content sharing application via an API.
- the API server 28 may expose the functionality described herein independent of the client application 22 .
- images (such as screen shots or frames of video) sent between client applications and various network accessible services (like chat services, social networking services, document repositories, email, etc.) may be routed through the application 12 .
- images may be intercepted (e.g., by a firewall or browser extension) or client applications may be configured to provide images that the user requests to be uploaded to the API server 28 .
- the API server may, in turn, cause the control module 30 to effectuate a redaction like those described herein.
- the user repository 32 may store user account records.
- each user account record may include a user identifier, user credentials by which a user is authenticated (such as a salted cryptographic hash of a user password), user account configuration settings, identifiers of current sessions under the user account, and references to records in the other repositories, such as teams including the user, tenants including those teams or tenants for which the user is an employee, content uploaded by the user, or content to which the user has access.
- uploaded content may be organized into collections of such content, for instance, collections having thumbnails of such content arrayed visually in a grid, and users may select those thumbnails to access the content that corresponds to the thumbnail.
- these collections may be referred to as “boards,” and various users may each have a plurality of boards.
- users may upload content to the boards, share the boards, delete the boards, or otherwise allocate access collectively to content in a board to other users.
- teams may have boards or tenants may have boards as well, and users with access to all content corresponding to these entities may have access to those boards and content therein.
- the tenant records may further include policies that specify who can share information with who and which information is to be redacted or otherwise obfuscated from shared content.
- these policies may map various sets of patterns to various teams or users, or the same set of patterns may be applied to all users of a tenant account.
- the patterns may indicate which subsets of shared content is to be redacted or otherwise obfuscated.
- the indication is a white list indication in which content that matches the pattern is not redacted.
- the indication is a blacklist indication in which content that matches the pattern is redacted.
- Patterns may match to various types of content.
- the patterns matched to non-text images, like faces, images of objects, images of rooms, images of maps, images of schematics, images of CAD files, and the like.
- the patterns include object detection and localization models, such convolution neural networks trained on labeled training sets including examples of images with labels identifying objects to be detected.
- such models may be trained by executing a stochastic gradient descent on the training set, or subset thereof.
- patterns may match exactly to one and only one string or some patterns may matched to a class of strings that are specified by the pattern. For instance, some patterns may require certain characters or tokens while allowing other characters or tokens to vary, for instance, by specifying wildcard characters or tokens. Some embodiments may filter out or allow terms having less than a threshold term frequency inverse document frequency score, such as stop words like “the”, “a,” “and,” and the like. Various types of TF-IDF scores may be used, including Okapi BM25. In some embodiments, patterns may match to strings within a threshold edit distance of a specified string or class of strings, like a Levenshtein edit distance. In some embodiments, patterns may be specified by regular expressions or natural language search operators.
- LDA latent Dirichlet allocation
- Other examples may implement latent Dirichlet allocation (LDA) to train a model to classify units of text by topic, and the patterns may be implemented by models configured to score and classify text according to the strength with which it exhibits various topics.
- LDA latent Dirichlet allocation
- Other examples may implement sentiment analysis models, named entity detection models, proper noun detection models, or various forms of information extraction models to form the patterns.
- Some embodiments may include a user interface by which users may select redacted text to indicate that they believe the text was improperly redacted or select unredacted text to indicate that they believe the text was improperly unredacted. Some embodiments may aggregate these reports, for instance, by calculating an aggregate amount of false positive reactions for each pattern and ranking the results in a report to an administrator to guide subsequent editing of patterns. Further, some embodiments may log and present reports of false negatives to guide subsequent editing in addition of patterns.
- some embodiments may report screenshots with bounding boxes of windows and identifiers of applications in those windows queried from an operating system, and some embodiments may apply patterns that specify these applications, for instance, as criteria in other patterns or as standalone patterns themselves.
- some embodiments may include a pattern that designates text matching a regular expression and appearing within a bounding box of a window for Microsoft ExcelTM, or some embodiments may designate for redaction the entire content of any window corresponding to a user interface of a computer aided design (CAD) application.
- CAD computer aided design
- Some embodiments may further include a content repository 36 that may include instances of uploaded screenshots or screen casts videos. In some embodiments, this content may be organized, as described above, in various boards associated with the content in the content repository 36 . In some embodiments, individual instances of continent 10 to may be stored in multiple associated versions in the content repository 36 , for instance, in unredacted and redacted form, or with varying amounts of redaction in different versions corresponding to different levels of access afforded different permission designations for different user accounts. For example, some embodiments may redact information of type one for a first group of users and information of types one and two for a second group of users, while redacting no information for a third type of users.
- some embodiments may organize patterns in data structures designed to afford relatively fast computation. For instance, keywords may be arranged in a hash table, prefix tree, or bloom filter to afford relatively fast matching relative to more na ⁇ ve approaches.
- Some embodiments may further include a team repository 38 including team records that have identifiers of each user on a team, roles and permissions of users on a team, an identifier of a tenant account associated with the team, and one or more boards or other instances of content accessible to the team.
- a team repository 38 including team records that have identifiers of each user on a team, roles and permissions of users on a team, an identifier of a tenant account associated with the team, and one or more boards or other instances of content accessible to the team.
- various types of access described herein may be allocated at the level of the individual, the team, the tenant, and such access may be granted with relatively high granularity to individual units of content, boards, or collections of boards.
- Some embodiments may further include a text scoring module 40 .
- a text scoring module 40 may parse from that OCR record units of text in text format determined by OCRing to appear in the image.
- the text format may group the depicted text by line without explicitly indicating whether the text is arranged in a single column, multiple columns, with inserts, or the like, other than to indicate bounding boxes of each unit of text and pixel coordinates of the image.
- the image sent to be OCRed may be sent with instructions that indicate pixel coordinates of a subset of the image to OCR, for example, to expedite the OCRing operation. Or some embodiments may trim a subset of the image to be sent to be OCRed to reduce bandwidth costs in accordance with the techniques described below by which subsets of images are filtered to identify the subsets likely to have relevant text.
- the classification is a binary classification as confidential or nonconfidential. In some embodiments, the classification is into a ranking of different confidentiality levels. In some cases, the classification is to into a hierarchical taxonomy or other ontology, for instance with classifications indicating that text is confidential and is only to be viewed by employees above a threshold rank, or is confidential and only to be viewed by users on the same team or on the same tenant account or within the same network. In some embodiments, the classifications indicate that the text is indicative of a particular state of mind, such as a disgruntled employee, happy customer, angry vendor, or the like. Some embodiments may apply different transformations to image pixel values based upon these different classifications, for example, highlighting text corresponding to a particular state of mind, while redacting other text classified as confidential.
- a given token may be subject to multiple classifications in virtue of being part of different n-grams classified in different ways. Some embodiments may select a classification that affords a highest amount of confidential information protection from among these different classifications for the given token.
- Some embodiments may infer a background color based on a median pixel value for the image, for instance, a median red, blue, and green sub-pixel value, and then some embodiments may select an opposing color from a color space mapping, for instance selecting a white redaction color for text on a black background, or selecting purple redaction for an orange background. Or some embodiments may select as the redaction color the color of text. For example, some embodiments may determine that color corresponds to text in a bounding box by determining a histogram of pixel colors in the bounding box and selecting a peak in the histogram that is different from pixel colors corresponding to a perimeter of the bounding box (e.g., an average pixel value at the perimeter).
- Some embodiments may modifying pixel values with other types of transformations. For example, some embodiments may apply a blurring convolution to pixels in the bounding box, such as a Gaussian blur convolution. For example, for each pixel, some embodiments may calculate the average of pixels within five pixels in each direction in image space and set the value of that central pixel to be equal to the average.
- a blurring convolution to pixels in the bounding box, such as a Gaussian blur convolution.
- some embodiments may calculate the average of pixels within five pixels in each direction in image space and set the value of that central pixel to be equal to the average.
- Some embodiments may facilitate sharing by associating the set of versions of the bitmap image with a relatively short uniform resource locator with the URL generator 46 .
- Some embodiments may determine a URL of the collection of versions, such as a URL with less than eight characters, less than 16 characters, or less than 32 characters.
- To generate the URL some embodiments may calculate a hash digest of the bitmap image, such as the unredacted version or the redacted version, and include this digest in the URL.
- Some embodiments may provide the URL to the user computing device 14 uploaded the image and present the URL in association with the image in a webpage provided by the web server 26 , and users may share the image by providing the URL to other users, for instance, by texting, email, or the like.
- the URLs may expire assets over some duration of time.
- a timestamp may be associated with URL by the application 12 , and some embodiments may deindex the URL to the content referenced by the URL in response to determining that a threshold duration of time has elapsed or a step threshold number of access requests have been received.
- scoring of risk associated with sharing instances may depend upon a role assigned to a sharing person in a tenant account, for instance, a public relations or marketing person may be expected to share more prolifically than an engineer, and the above-described sharing instance weightings may be adjusted based upon weights assigned to these roles, for instance in tenant policies, as weighted sums.
- some embodiments may modulate access control with access control module 50 . In some embodiments, this may include responding to the above-described risk related alerts to depermission users automatically. Some embodiments may determine whether a request for a content item at a given URL should be serviced with a more heavily redacted version, a less heavily redacted version, or an unredacted version of a content item based upon permissions associated with a request or permissions associated with a sharer that created the content item at the given URL.
- Some embodiments may deindex URLs from content items responsive to determining that the URL was created more than a threshold duration of time in the past or that the content item at the URL has been access more than a threshold amount of times or the access requests indicate greater than a threshold level of risk. Some embodiments may selectively provide access based upon roles and permissions. For example, some embodiments may automatically provide access to unredacted versions of content items to members on the same team or to users in the same tenant organization or to users on the same or a specified local area network or virtual private network, while blocking access or providing access to redacted versions to users requesting content that do not satisfy these criteria. Some embodiments may rate limit access requests associated with individual users or computing devices, for instance, by maintaining a count of a number of access request received over a trailing duration of time and blocking or throttling responses by inserting delays when the count exceeds a threshold.
- FIG. 2 shows an example of a process 60 that may be implemented in the computing environment 10 described above, which is not limited to that implementation, and which is not to suggest that any other description herein is limiting.
- program code by which the functionality of process 60 and the other functionality described herein is implemented may be stored on a tangible, non-transitory, machine-readable medium, such that when that program code is executed by one or more processors, the described functionality is effectuated.
- the term “medium,” singular, is used herein to refer to either a singular instance or media or multiple instances of media storing different subsets of the program code, for instance, in memory of different computing devices. So reference to singular “medium” should be read as encompassing instances in which different subsets of the described instructions are executed on different computing devices.
- Some embodiments may receive a bitmap image corresponding to a screen capture event on a first device, as illustrated by block 62 .
- this operation may be performed by the screen-sharing application 22 , or a bitmap image may also be characterized as having been received by the screen-content sharing application 12 upon the screen-sharing application 22 sending that bitmap image to the screen-content sharing application 12 .
- the bitmap image is received in compressed encoded format or uncompressed format.
- the bitmap image is received with metadata identifying a user account sharing the bitmap image corresponding to records in the illustrated repositories of FIG. 1 .
- the bitmap image is received with metadata obtained from the operating system indicating bounding boxes of different windows and identifiers of applications depicted in those windows.
- the above-describe patterns may include operators that specify these applications, for example, designating some patterns as only applying to certain subsets of applications or individual applications installed on user devices.
- some embodiments may receive a text document with results of the OCRing, the text document including a plurality of entries, each entry including text depicted in a subset of the document, like a line of text, and a bounding box with pixel coordinates indicating where in the bitmap image that text is depicted.
- this text document may be in a serialized hierarchical data format, for instance, as a collection of lists and dictionaries, like in a JSON document. It should be emphasized that OCRing is one example of a variety of image detection and localization techniques by which some embodiments may determine which areas of an image depict confidential information of various types.
- Some embodiments may apply computer vision techniques to detect and localize confidential information (e.g., Haar edge detection, Canny edge detection, blob detection, color/intensity histogram analysis, and the like, along with various spacial and color thresholding on detected features).
- Some embodiments may apply object detection and localization algorithms based on machine learning techniques, like convolutional neural networks trained to detect and localize confidential information. In some cases, these techniques may be pipelined, e.g., with neural networks responding to featured detected with computer vision techniques or by OCRing.
- Some embodiments may then access a collection of patterns in a policy in a tenant account associated with the user from which the bitmap image is received and determine whether there are more patterns in the set to process, as indicate by block 66 . Upon determining that there are more patterns to process, some embodiments may select a next pattern in the set, as indicated by block 68 . Some embodiments may then determine whether there are more n-grams in the OCR results to compare to the selected pattern, as indicated by block 70 . Upon determining that there are more n-grams process, some embodiments may select a next n-gram, as indicated by block 72 .
- this may include incrementing forward one token and selecting the next n-gram of some threshold size and repeating this process iteratively for increasing sizes, for instance, ranging from one up through five tokens in series. Some embodiments may then determine whether the selected n-gram matches the pattern, as indicated by block 74 . Upon determining that the pattern does not match, some embodiments may return to block 70 to determine whether there are more n-grams to compare with the pattern. Upon determining that there are no more n-grams to compare with the pattern, some embodiments may return to block 66 to determine whether there are more patterns to process.
- Some embodiments may then modify pixels in the bounding box to obfuscate the pattern matching n-gram, as indicated by block 78 .
- Obfuscation may take any of the above-described forms, and as noted above, modification may include changing values in an existing copy or forming a new copy without modifying an original copy.
- Upon modifying the pixels some embodiments may return to block 70 . Or some embodiments may execute these operations in a different order, for instance, identifying all pattern matches before determining bounding boxes and modifying pixels, which is not to suggest that any other describe sequence or described feature herein is limiting.
- some embodiments may proceed to store the modified version of the bitmap image, as indicated by block 79 , for instance in the above-describe content repository 36 .
- some embodiments may store multiple versions of the bitmap image, some modified with different amounts of redaction, and associated with an identifier associated with the collection of different versions.
- Some embodiments may proceed to update a risk score based on the pattern matches, as indicated by block 80 .
- Some embodiments may update risk scores as a batch process, for instance, daily or in response to events, like an individual sharing event.
- Some embodiments may update risk scores by calculating aggregate values, like measures of central tendency, such as means, medians, or modes, over collections of risk scores from individual content items that are shared or individual instances in which content items are shared, for instance, over a trailing duration of time, like over a day, week, or month, in some cases down-weighting older risk scores in the aggregate, for instance with a half-life weighting that decreases the effect of events over time.
- Such aggregate risk scores may be calculated for users, teams, types of classified content, tenants, client-side applications in which the content is depicted in screenshots or videos, or the like.
- Results may be depicted in dashboards like those described above, for instance in rankings, timeseries, list of those exceeding some threshold, or the like, and results may be acted upon by logging or alerting instances in which thresholds are exceeded or automatically depermission users.
- Some embodiments may determine a URL for the screen capture to be shared, as indicated by block 82 , and provide that URL to the first computing device, as indicated by block 84 .
- the user of the first device may then provide that URL to other users, which may request their browser to navigate to the URL to retrieve the shared content or redacted version thereof from a remote server.
- some embodiments may subsequently receive a request for content at the URL from a second computing device, as indicated by block 86 .
- Some embodiments may determine whether the requester is authorized, as indicated by block 88 , for instance whether they have supplied the appropriate credentials, are associated with the appropriate role, are on the appropriate network, or are on the appropriate team or tenant account, and in response to determining the user is not authorized, deny access, as indicated by block 90 . Alternatively, some embodiments may then determine whether the user has permission to access the shared content in unredacted form, for instance, based on one or more of these criteria as well, as indicated by block 92 . Upon determining that the user does not have permission to access the content in unredacted form, some embodiments may send the modified version of the bitmap image to the second computing device, as indicated by block 94 .
- some embodiments may send the received version of the bitmap image, as indicated by block 96 , an operation that may include sending a version subject to some transformation that leaves the text unredacted, for instance, a version that is compressed with a different compression format from that in which it is received may still serve as sending the received version.
- FIG. 3 shows an example of a screen captured bitmap image 100 having regions 102 with depicted text expressed as pixel values.
- bitmap image as illustrated, include substantially more.
- the bitmap image, as received by the sharing application is a collection of pixel coordinates and pixel intensities and does not directly explicitly indicate to a computer the depicted text, the font of the depicted text, or the any particular subset of the image depicts text, despite the appearance of such text being self-evident to a human viewer.
- FIG. 4 shows a modified version 104 of the screenshot in which some of the regions 106 have been modified to redact portions of the text while other regions 102 remain unredacted
- Video may present particular challenges due to scaling issues that arise from the number of images presented in the frames of video and the leakage of information across sequential frames that, in some cases may not be present in any one frame.
- some embodiments may implement a server-side screen-content sharing application 120 shown in FIG. 5 .
- the computing environment 10 of FIG. 5 may be like that of FIG. 1 , with like element numbers depicting components with the same or similar features.
- some embodiments may add to the screen-content sharing application an inter-frame entropy scoring module 122 , a frame filter 124 , a video segmenter 126 , and access control module and a reverse selector module 128 . These components may be controlled by the control module 30 , which in some cases may cause the components of the application 122 cooperate to effectuate a process described below with reference to FIG. 6 .
- the application 120 may be configured to classify information appearing in video frames, such as text information from OCRing video frames, and manipulate pixel values in the video frames based on the classification, for instance, to redact or otherwise obfuscate text classified as confidential.
- policies may be downloaded to client applications 22 , such as policy having patterns by which text is classified. In some cases, policies may be pushed to the client application 22 or pulled by the client application 22 .
- users may screen cast video with their client-side screen-sharing application 22 .
- Screen casting video may show the client device's display as it evolves over time, while the user interacts with various user interfaces displayed thereon.
- screen cast video may further include audio, such as audio generated by the computer presenting the display or audio supplied by the user via a microphone, for example, to explain their actions in the user interfaces in a tutorial.
- the video may be shared in real time, for example, in video chat where the video is streamed and displayed on a recipients computing device within less than one second, such as less than 200 ms, of when the displayed events occur on the client computing device screen casting the video.
- the video may be recorded for sharing at a later time, for instance, for being stored in one of the above-described boards and shared among members of a team, a tenant account, or outside of tenant accounts, for example, with the general public.
- the above-described access control techniques may be applied to video content in a manner similar or identical to that described with reference to screen capture images.
- risk monitoring techniques may be applied to video content in a manner that is similar or identical to that described with reference to screen capture images.
- the inter-frame entropy scoring module 122 may be configured to receive a screen cast video.
- the entire screen cast video may be received concurrently, for instance, a stored screen cast video uploaded for sharing in non-real-time use cases.
- the screen cast video may be received as a stream, for instance, in video chat applications, where some segments of the video are received before the video in its entirety is received and in some cases before the video in its entirety is generated.
- screen cast video may be received in a compressed encoding format, for instance, after compressing the video frames with the video codec. Examples include MPEG-4, AV1, VP 8, VP 9, and the like.
- the video codec may designate different frames as i-frames, p-frames, or b-frames, depending upon amounts of change in information depicted between frames.
- the video codec may compress individual frames by segmenting the individual frames into square blocks, such as an eight pixel by eight pixel blocks or larger.
- those blocks may be transformed, such as into a discrete cosine transform matrix or an asymmetric discrete sine transform matrix, in which spatial frequency components of variation in pixel intensity are represented in various matrix values.
- Some embodiments may threshold these matrices according to a quantization matrix to set a subset of the values to zero.
- Some embodiments may then compress the resulting matrix values, for instance, with entropy coding compression algorithm, like Huffman coding, arithmetic coding, or the like.
- the compressed encoding may further include movement vectors of various blocks or macro blocks indicating changes in position of blocks between frames, for instance, relative to an i-frame that proceeds a given p or b frame.
- the compression is lossy or in some embodiments the compression is lossless.
- the compressed video is sent from the client screen sharing application 22 to the server-side screen-content sharing application 120 , or some embodiments may apply the below-described filtering techniques on the client side and only send a subset of the frames or subset of individual frames to the server-side application 120 for processing.
- the operating system may be queried by the screen-sharing application for a current state of bounding boxes of applications and identifiers of those applications. Some embodiments may associate a stream of window bounding box data with the video for purposes of classification using techniques like those described above.
- the client application 22 may periodically or responsive to some event (like one signaling a screen capture) query the operating system with, for example, an EnumWindows command to obtain a list of windows (and in some cases, identifiers of applications) and then iterate through members of that list and call a GetWindowRect function to obtain bounding boxes in screen coordinates of displayed windows.
- This information may be reported, in some case, with an offset to coordinate origins of a screen capture, e.g., where the screen capture is a subset of the screen and screen coordinates have a different origin from image pixel coordinates, so that embodiments may translate between the coordinate systems and overlay the bounding boxes of identified windows on the screen shot to add an additional channel of information about the displayed information.
- Some embodiments may decode the compressed video with a suitable video decoder and access the individual frames a video, which may each be a bitmap image having an associated sequence identifier in the sequence of frames of the video.
- Some embodiments of the scoring module 122 may determine an inter-frame entropy score that indicates an amount of change in information presented in a given frame relative to one or more previous frames.
- the entropy score may be an aggregate value based upon a pixel-by-pixel subtraction of one frame from a preceeding frame, such as a root-mean-square (RMS) value of pixel deltas.
- RMS root-mean-square
- Some embodiments may determine the difference as a perceived difference with metrics described in a paper titled “Measuring perceived color difference using YIQ NTSC transmission color space in mobile applications” by Kotsarenko et al., Programa Terms Matematica y Software (2010), Vol. 2. No 2. ISSN: 2007-3283, the contents of which are hereby incorporated by reference.
- some of these differences may be discarded as being caused by various artifacts that are not indicative of changes in client-side program state, such as differences due to antialiasing.
- some embodiments may apply the techniques in a paper titled “Anit-alias Pixel Intensity Slope Detector” Vysniauskas, Elektronika it Elektrotechnika, October 2009, the contents of which are hereby incorporated by reference. Lossy compression may similarly give rise to some differences.
- Some embodiments may down weight differences, for instance, by multiplying by a weight coefficient between zero and one with differences in blocks including a mouse pointer, for example, which may cause lossy compression artifacts to vary between frames in local blocks if the mouse pointer moves but nothing else on the screen changes.
- pixel values and frames may be sampled, for instance some embodiments may determine pixel-by-pixel differences only for every even row or column (or both), of pixel values. Or some embodiments may sample more sparsely, for instance, sampling every 10th pixel horizontally in every 10th row to determine deltas.
- Some embodiments may assign entropy scores to entire frames, or some embodiments may also assign scores to subsets of frames, for instance some embodiments may determine a convex hull or bounding box of pixel values having greater than a threshold delta between consecutive frames and designate only those pixels within these bounding areas as warranting subsequent processing to redact subsets of frames. Or some embodiments designate areas in bounding boxes of windows in which an aggregate amount of difference exceeds some threshold.
- the video compression encoding process may be leveraged to expedite identification of frames having a relatively high entropy relative to an earlier frame.
- Some embodiments may designate i-frames labeled in the compression encoding as having a binary entropy score of one and non-i-frames having a binary entropy score of zero.
- some embodiments may infer inter-frame entropy based upon an amount of data encoding a non-i-frame relative to a previous i-frame or a previous non-a frame.
- the inter-frame entropy score is based upon differences between a given frame and multiple preceeding frames or a non-sequentially adjacent proceeding frame. For example, some embodiments may determine a difference between a given frame and a frame two, four, six, or eight frames earlier to avoid scenarios by which information gradually creeps onto a display, for instance with a slowly moved window that is gradually moved from a peripheral offscreen position into a portion of the display screen where the window is visible.
- Some embodiments may determine an aggregate score based upon both these earlier frames and an immediately preceding frame, such as a score of zero if neither of these types of earlier frames produces an aggregate measure of difference greater than a threshold and a score of one if either one of these produces an aggregate measure of difference greater than one. Or some embodiments may calculate an aggregate score relative to multiple preceding frames based, for example, on a weighted sum with a half-life weighting assigned to differences of each of a number of proceeding frames.
- Some embodiments may leverage compression encoding to facilitate concurrent operations. For example, some embodiments may segment a video by i-frame and then concurrent currently processed video frames in different threads or processes with the techniques described herein on each of the different segments starting with a distinct i-frame. Or some embodiments may concurrently determine inter-frame entropy scores on each of multiple frames, for instance, in some cases with map reduce operations in which a mapping function determines pixel differences and a reducing function determines an aggregate measure, for example with a Hadoop implementation.
- Some embodiments may then filter the frames with a frame filter 124 to select a subset of the frames to be OCRed or otherwise subject to more intensive and computationally expensive analysis.
- the frame filter may filter the frames based upon the inter-frame entropy scores, for instance, selecting those frames having greater than a threshold score (or less than a threshold score if signs are reversed) for processing.
- some embodiments may select a subset of the frames in the video having a relatively large amount of entropy, or presented information, relative to earlier frames in the sequence.
- the frame filter may select subsets of individual frames to be OCRed or otherwise subject to more intensive and computationally expensive analysis. Some embodiments may designate subsets of frames corresponding to bounding boxes of applications having particular identifiers to be subject to this analysis or to be excluded from this analysis.
- an API of the OCR service may include an input by which pixel coordinates may be designated for analysis and OCRing.
- images, like frames, submitting to be OCRed may be modified to effectuate faster transmission and OCRing, for instance, by setting pixel values to clear or white in regions that are not to be OCRed, thereby enhancing compression of the image for transmission and potentially expediting the OCR operations by reducing the pixel count involving more intensive analysis.
- some embodiments may break up a frame into multiple images from subsets of the frame and submit those different images to be OCRed.
- some embodiments may maintain a mapping of coordinate spaces between an origin of the segmented images and an origin of the original frame, and these mappings may be accessed to translate bounding box positions from OCR records back into a coordinate space of the original frame.
- the compression encoding may be leveraged to identify subsets of a frame that potentially have text information. For example, some embodiments may designate regions of an image with transform matrix blocks having amplitudes of less than a threshold value for frequency components with greater than a threshold value as containing information unlikely to encode text. For instance, white space transform matrices often have values of zero for all frequency components other than the DC component.
- some embodiments may interrogate both the compressed format and the un-compressed bitmap images produced by a video decoder to expedite computation.
- Some embodiments may then designate subsequent frames or portions thereof as being represented by frames or portions thereof that pass the frame filter 124 with the video segmentor 126 .
- some of embodiments may associate each selected frame with a duration of time, which in some cases may be represented by a number of subsequent frames and need not be encoded in units of time, and that duration of time may expressly or implicitly identify a set of subsequent frames in the video that will receive the same or similar treatment as the selected frame or subset thereof.
- the video segmentor 126 may, for each selected frame, form a segment that includes each frame until the next selected frame. That is, segments may be delimited by selected frames.
- some embodiments may form a video segment corresponding to, for example an application window, and there may be multiple overlapping segments in time corresponding to different subsets of the area of the display, with pixels in those different areas, along those different segments, being subject to similar or the same processing, based upon an initially selected frame or region thereof.
- some embodiments may designate the rest of the display as being part of one region and one segment through a relatively long duration of time, while the smaller window in that one corner may be designated as another region that may include multiple segments over that region and the duration of time.
- Some embodiments may then submit each of the initially selected frames or regions of frames to the above-describe processing by which bitmap images are selectively redacted or otherwise obfuscated or modified. For instance, upon a user launching a new application window, a new video segment may be started by the video segmentor 126 responsive to the frame filter 124 selecting that initial presentation of information that may be relatively different from what was on the display before. In some cases, that initial presentation of information may then be subject to redaction as a bitmap image with the techniques described above.
- the same or similar modifications applied to the initial frame or segment of a frame of a segment may be applied to each of the subsequent frames in a segment.
- those same pixel addresses may be set to black in each of the subsequent frames in a video segment determined by the video segmentor 126 .
- Similar operations may be applied in implementations in which frames are processed as distinct regions, for instance, by window bounding boxes both in time and display space, whereby pixel modifications are carried forward within video segments to portions of subsequent frames, in some cases, with a given frame having different regions in different overlapping segments that start and stop at different times.
- operations may be expedited by manipulating the compressed encoded version of video.
- some embodiments may manipulate the amplitude components of transform matrices, for example, setting all values to zero except the DC component, which may be set to a value that causes the block to depict black or white.
- some embodiments may avoid the relatively computationally expensive operation of fully re-compressing the video. This operation is consistent with reference to manipulating pixel values in a bitmap image.
- some embodiments may directly manipulate pixel values in a bitmap image in uncompressed form and then recompress the video for distribution, which is not to imply that other features may not be varied.
- a sequence of video frames may leak information in a way that does not match to a pattern, for instance, a video sequence in which a user types in confidential information that does not match to a pattern until a final character is entered.
- Another example includes when a user types a password into an input box that automatically displays each typed character and then replaces it with a placeholder, like an asterix.
- the video sequence may reveal the password, even if the password does not pattern match in a final frame of the sequence because of the replacements with the placeholders.
- some embodiments may include a reverse frame selector 128 configured to add earlier frames to a video segment and apply pixel modifications, like redaction to those earlier frames in a manner similar or identical to how those manipulations are applied to the frames subsequent to the frame selected by the frame filter 124 .
- the reverse selector 128 may select earlier frames in which (e.g. in response to determining that) a prefix of text classified as confidential is present within a bounding box that overlaps a bounding box of the text classified as confidential.
- the prefix being typed may be redacted in some use cases in the earlier frames based upon the confidential information being detected in the later frame, even if that prefix does not match a specified pattern.
- Some embodiments may similarly be configured to detect placeholder characters with a pattern that matches to those placeholder characters in a password input.
- a pattern may specify the text of the word password be within some threshold distance on a display of these placeholder characters in OCR text of a selected frame.
- Some embodiments may then apply the above-described time-reversed redaction technique, for instance, over an entire bounding box region of the fully entered password, in some cases expanding this bounding box over some threshold margin, to obfuscate individual type characters of a password.
- FIG. 6 shows an example of a process 150 that may be executed by the above-described system of FIG. 5 , though which is not limited to that implementation.
- the arrangement of operations may be modified in ways like that described above with reference to FIG. 2 , which is not to suggest that any other description is limiting.
- the process 150 includes obtaining screen cast video, as indicated by block 152 , and determining whether there are more frames of the video to process, as indicated by block 154 . Upon determining that there are more frames to process, some embodiments may select a next frame, as indicated by block 156 , and score the inter-frame entropy of that frame relative to earlier frames, as indicated by block 158 . Some embodiments may determine whether that score exceeds a threshold, as indicated by block 160 . Upon determining that the score does exceeds a threshold, some embodiments may designate frames since a previously selected frame as part of the same video segment beginning with that previously selected frame, as indicated by block 162 .
- Some embodiments may filter regions of the first frame of the designated segment to OCR, in some cases, as indicated by block 164 .
- filtering of those regions may include segmenting the frames into different images, each corresponding to different windows or subsets thereof.
- filtering regions may include classifying portions of the frame as indicating information unlikely to be confidential, like toolbars.
- Some embodiments may detect toolbars with a convolutional neural network trained on labeled examples of display screens, and those trained models may detect and localize, for instance, with bounding boxes, toolbars to be excluded from OCR processing.
- some embodiments may detect those toolbars with unsupervised image processing machine learning models, for example trained on earlier video in which the toolbars exhibit features that appear relatively frequently compared to other content. The unsupervised model may be trained to detect those features that appear more frequently and based on those features detect and localize toolbars to designate areas to be excluded from OCRing.
- Some embodiments may OCR the first frame of a segment, or portion thereof, as indicated by block 166 . In some cases, this may include causing the first frame to be OCRed by a third party service.
- Some embodiments may then score text in the OCR record produced by OCRing, indicated by block 168 , for instance, with the techniques described above with reference to FIGS. 1 through 4 . Some embodiments may then classify text in the OCR record, for instance, with the techniques described above with reference to FIGS. 1 through 4 , as indicated by block 170 . Some embodiments may determine whether earlier frames are implicated, as indicated by block 172 , for instance, with the above-described reverse selector 128 of FIG. 5 . Upon determining that the earlier frames are implicated, some embodiments may prepend the earlier frames to the segment and designate a new frame as the first frame of that segment.
- some frames may end up in multiple segments and thus may be manipulated with different types of redaction due to processing of those different segments.
- some embodiments may lock frames being processed, for example, with a mutex or spin lock, or some embodiments may generate a set of masks to be applied to frames once all operations are complete, thereby redacting the joint set of areas redacted in the masks.
- some embodiments may modify pixel values in bounding boxes of text classified as confidential to redact that text, as indicated by block 176 . Or some embodiments may apply any of the above-described modifications based upon different types of classified text. Some embodiments may proceed to determine whether there are more frames in the segment the process, as indicated by block 178 . Upon determining that there are more frames in the segment to process, some embodiments may select a next frame, as indicated by block 180 , and return to block 176 to continue modifying pixel values in the bounding boxes of the text classified as confidential in the selected frame by which the segment is defined.
- some embodiments may append the modified segment to the redacted version of the video, as indicated by block 182 .
- reference to the “video” includes reference to a single copy that is modified or reference to a collection of copies with different versions having different modifications, or reference to any one of those versions.
- some embodiments may execute concurrent operations in which additional segments are initiated, returning to block 154 to determine whether there are more frames to process.
- some embodiments may store the redacted version of the screen cast video in memory, as indicated by block once 184 , for instance in the above-describe content repository 36 .
- multiple versions may be associated with one another with different amounts of redaction applied to different versions (in some cases all associated with the same generated URL) and with different permissioning, and risk scores may be generated in association with those operations.
- Some embodiments may selectively share the redacted or unredacted versions of the screen cast video, as indicated by block 186 , for example, implementing the above-describe techniques by which authorization and permission are determined or access is otherwise selectively granted, for instance, with different users being exposed to different versions of the video.
- a user interface may be presented in real-time showing a person conducting a screen cast which portions of their video are redacted, so they can see what the person with whom there sharing their screen sees.
- some embodiments may overlay redacted regions with a different type of pixel modification that designates those regions but still renders them viewable to the person conducting the screen cast.
- some embodiments change a tint or highlighting of those regions to indicate what is being modified, so that the person conducting the screen cast can understand what the recipient is seeing.
- some embodiments may include a user input by which the person conducting the screen cast can toggle a view to show redacted in unredacted versions of video or screen capture images, so they can understand what they are sharing.
- techniques like those described above may also be applied to audio.
- some embodiments may submit the audio of a screen cast to a speech-to-text conversion service and apply the above-describe patterns to the text.
- Some embodiments may change the audio wave form, for example, suppressing it or silencing it during durations of time corresponding to text that matches a pattern.
- the above-described techniques are not limited to screen-sharing use cases. Techniques like those applied above may also be applied to other forms of video, audio, images, or documents. For example, the above-describe techniques may be applied to images embedded in emails or other documents, for instance, in an email filter, intrusion detection system, or network firewall. Similarly, the techniques described above may be applied to image content generally in chat messages, Internet traffic, or the like.
- FIG. 7 is a diagram that illustrates an exemplary computing system 1000 in accordance with embodiments of the present technique.
- Various portions of systems and methods described herein may include or be executed on one or more computer systems similar to computing system 1000 . Further, processes and modules described herein may be executed by one or more processing systems similar to that of computing system 1000 .
- Computing system 1000 may include one or more processors (e.g., processors 1010 a - 1010 n ) coupled to system memory 1020 , an input/output I/O device interface 1030 , and a network interface 1040 via an input/output (I/O) interface 1050 .
- a processor may include a single processor or a plurality of processors (e.g., distributed processors).
- a processor may be any suitable processor capable of executing or otherwise performing instructions.
- a processor may include a central processing unit (CPU) that carries out program instructions to perform the arithmetical, logical, and input/output operations of computing system 1000 .
- CPU central processing unit
- a processor may execute code (e.g., processor firmware, a protocol stack, a database management system, an operating system, or a combination thereof) that creates an execution environment for program instructions.
- a processor may include a programmable processor.
- a processor may include general or special purpose microprocessors.
- a processor may receive instructions and data from a memory (e.g., system memory 1020 ).
- Computing system 1000 may be a uni-processor system including one processor (e.g., processor 1010 a ), or a multi-processor system including any number of suitable processors (e.g., 1010 a - 1010 n ). Multiple processors may be employed to provide for parallel or sequential execution of one or more portions of the techniques described herein.
- I/O device interface 1030 may provide an interface for connection of one or more I/O devices 1060 to computer system 1000 .
- I/O devices may include devices that receive input (e.g., from a user) or output information (e.g., to a user).
- I/O devices 1060 may include, for example, graphical user interface presented on displays (e.g., a cathode ray tube (CRT) or liquid crystal display (LCD) monitor), pointing devices (e.g., a computer mouse or trackball), keyboards, keypads, touchpads, scanning devices, voice recognition devices, gesture recognition devices, printers, audio speakers, microphones, cameras, or the like.
- I/O devices 1060 may be connected to computer system 1000 through a wired or wireless connection.
- I/O devices 1060 may be connected to computer system 1000 from a remote location.
- I/O devices 1060 located on remote computer system for example, may be connected to computer system 1000 via a network and network interface 1040 .
- System memory 1020 may be configured to store program instructions 1100 or data 1110 .
- Program instructions 1100 may be executable by a processor (e.g., one or more of processors 1010 a - 1010 n ) to implement one or more embodiments of the present techniques.
- Instructions 1100 may include modules of computer program instructions for implementing one or more techniques described herein with regard to various processing modules.
- Program instructions may include a computer program (which in certain forms is known as a program, software, software application, script, or code).
- a computer program may be written in a programming language, including compiled or interpreted languages, or declarative or procedural languages.
- a computer program may include a unit suitable for use in a computing environment, including as a stand-alone program, a module, a component, or a subroutine.
- a computer program may or may not correspond to a file in a file system.
- a program may be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub programs, or portions of code).
- a computer program may be deployed to be executed on one or more computer processors located locally at one site or distributed across multiple remote sites and interconnected by a communication network.
- System memory 1020 may include a tangible program carrier having program instructions stored thereon.
- a tangible program carrier may include a non-transitory computer readable storage medium.
- a non-transitory computer readable storage medium may include a machine readable storage device, a machine readable storage substrate, a memory device, or any combination thereof.
- Non-transitory computer readable storage medium may include non-volatile memory (e.g., flash memory, ROM, PROM, EPROM, EEPROM memory), volatile memory (e.g., random access memory (RAM), static random access memory (SRAM), synchronous dynamic RAM (SDRAM)), bulk storage memory (e.g., CD-ROM and/or DVD-ROM, hard-drives), or the like.
- non-volatile memory e.g., flash memory, ROM, PROM, EPROM, EEPROM memory
- volatile memory e.g., random access memory (RAM), static random access memory (SRAM), synchronous dynamic RAM (SDRAM)
- bulk storage memory e.g.
- System memory 1020 may include a non-transitory computer readable storage medium that may have program instructions stored thereon that are executable by a computer processor (e.g., one or more of processors 1010 a - 1010 n ) to cause the subject matter and the functional operations described herein.
- a memory e.g., system memory 1020
- Instructions or other program code to provide the functionality described herein may be stored on a tangible, non-transitory computer readable media. In some cases, the entire set of instructions may be stored concurrently on the media, or in some cases, different parts of the instructions may be stored on the same media at different times.
- I/O interface 1050 may be configured to coordinate I/O traffic between processors 1010 a - 1010 n , system memory 1020 , network interface 1040 , I/O devices 1060 , and/or other peripheral devices. I/O interface 1050 may perform protocol, timing, or other data transformations to convert data signals from one component (e.g., system memory 1020 ) into a format suitable for use by another component (e.g., processors 1010 a - 1010 n ). I/O interface 1050 may include support for devices attached through various types of peripheral buses, such as a variant of the Peripheral Component Interconnect (PCI) bus standard or the Universal Serial Bus (USB) standard.
- PCI Peripheral Component Interconnect
- USB Universal Serial Bus
- Embodiments of the techniques described herein may be implemented using a single instance of computer system 1000 or multiple computer systems 1000 configured to host different portions or instances of embodiments. Multiple computer systems 1000 may provide for parallel or sequential processing/execution of one or more portions of the techniques described herein.
- Computer system 1000 is merely illustrative and is not intended to limit the scope of the techniques described herein.
- Computer system 1000 may include any combination of devices or software that may perform or otherwise provide for the performance of the techniques described herein.
- computer system 1000 may include or be a combination of a cloud-computing system, a data center, a server rack, a server, a virtual server, a desktop computer, a laptop computer, a tablet computer, a server device, a client device, a mobile telephone, a personal digital assistant (PDA), a mobile audio or video player, a game console, a vehicle-mounted computer, or a Global Positioning System (GPS), or the like.
- PDA personal digital assistant
- GPS Global Positioning System
- Computer system 1000 may also be connected to other devices that are not illustrated, or may operate as a stand-alone system.
- the functionality provided by the illustrated components may in some embodiments be combined in fewer components or distributed in additional components.
- the functionality of some of the illustrated components may not be provided or other additional functionality may be available.
- instructions stored on a computer-accessible medium separate from computer system 1000 may be transmitted to computer system 1000 via transmission media or signals such as electrical, electromagnetic, or digital signals, conveyed via a communication medium such as a network or a wireless link.
- Various embodiments may further include receiving, sending, or storing instructions or data implemented in accordance with the foregoing description upon a computer-accessible medium. Accordingly, the present techniques may be practiced with other computer system configurations.
- illustrated components are depicted as discrete functional blocks, but embodiments are not limited to systems in which the functionality described herein is organized as illustrated.
- the functionality provided by each of the components may be provided by software or hardware modules that are differently organized than is presently depicted, for example such software or hardware may be intermingled, conjoined, replicated, broken up, distributed (e.g. within a data center or geographically), or otherwise differently organized.
- the functionality described herein may be provided by one or more processors of one or more computers executing code stored on a tangible, non-transitory, machine readable medium.
- third party content delivery networks may host some or all of the information conveyed over networks, in which case, to the extent information (e.g., content) is said to be supplied or otherwise provided, the information may provided by sending instructions to retrieve that information from a content delivery network.
- the word “may” is used in a permissive sense (i.e., meaning having the potential to), rather than the mandatory sense (i.e., meaning must).
- the words “include”, “including”, and “includes” and the like mean including, but not limited to.
- the singular forms “a,” “an,” and “the” include plural referents unless the content explicitly indicates otherwise.
- Statements in which a plurality of attributes or functions are mapped to a plurality of objects encompasses both all such attributes or functions being mapped to all such objects and subsets of the attributes or functions being mapped to subsets of the attributes or functions (e.g., both all processors each performing steps A-D, and a case in which processor 1 performs step A, processor 2 performs step B and part of step C, and processor 3 performs part of step C and step D), unless otherwise indicated.
- statements that one value or action is “based on” another condition or value encompass both instances in which the condition or value is the sole factor and instances in which the condition or value is one factor among a plurality of factors.
- statements that “each” instance of some collection have some property should not be read to exclude cases where some otherwise identical or similar members of a larger collection do not have the property, i.e., each does not necessarily mean each and every.
- a tangible, non-transitory, machine-readable medium storing instructions that when executed by one or more processors effectuate operations comprising: receiving, with one or more processors, a screen capture event from an operating system of a first client computing device of a first user, the screen capture event including, or being associated with, a bitmap image of at least part of a display of the first computing device; causing, with one or more processors, optical character recognition (OCRing) of text in the bitmap image and obtaining, as a result of the OCRing, an OCR record with text appearing in the bitmap image and indicating locations of characters of the text in the image with coordinates of pixels in the bitmap image, the text comprising a plurality of n-grams; scoring, with one or more processors, each of the n-grams based on whether the respective n-grams match any of a plurality of patterns; classifying, with one or more processors, each of the n-grams based on the scoring into two or more categories, the two or more categories
- the bitmap image is sent by the first client computing device to the remote server system before the scoring; the remote server system causes the OCRing, scores each of the n-grams, classifies each of the n-grams, obfuscates respective n-grams in the bitmap image, and stores the modified version of the bitmap image; at least some of the patterns specify personally identifiable information; and at least some of the patterns specify proper nouns.
- the operations comprise: determining a risk score at least in part by normalizing an amount of pattern matches to text depicted in the bitmap image. 4.
- a screen-capture sharing client application executing on the first client computing device receives the bitmap image, causes the OCR, scores each of the n-grams, classifies the n-grams, obfuscates the respective n-grams, and provides the modified version of the bitmap image to the remote server system without providing the captured version of the bitmap image to the remote server system, such that the remote server system does not have access to obfuscated n-grams in un-obfuscated form.
- the URI is 16 characters or less; and the remote server system is configured to stop providing access to the modified version of the bitmap image after more than a threshold duration of time elapses from a reference event.
- the remote server system is configured to arrange the modified bitmap image in a collection of boards of an account of the first user, each of the boards having a plurality of bitmap images of screen captures, at least some of which depicting obfuscated region that displayed pattern matching text in a captured version of the respective image.
- the remote server system is configured to selectively grant access to the bitmap image or the modified version of the bitmap image to each of a plurality of members of a team that includes the first user based on determinations that requests for the bitmap image are from members of the team.
- the remote servers system is configured to determine whether requests are from members of the team and, in response, selectively grant access to the bitmap image to members of the team and grant access to the modified version of the bitmap image to other users who are not members of the team. 10.
- the remote server system is configured to selectively grant access to the bitmap image in response to determining that a requestor is authenticated; and the remote server system is configured to selectively grant access to the modified version of the bitmap image in response to determining that a requestor is not authenticated.
- a plurality of scores are determined for each n-gram based on one or more trained natural language processing models trained on past examples of confidential information and non-confidential information; each score indicates whether the respective n-gram matches to a respective one of the plurality of patterns; and n-grams are classified as in the category for confidential information in response to determining that any scores for the respective n-gram match to a respective one of the plurality of patterns.
- 12. The medium of any one of embodiments 1-11 wherein: at least some of the patterns match to n-grams within a threshold edit distance of a specified pattern.
- the medium of any one of embodiments 1-12 wherein: at least some of the patterns are matched, at least in part, by a prefix trie, a bloom filter, or a hash table; the OCR record comprises more than 500 delimited tokens; the plurality of patterns include some patterns matching to n-grams ranging in length up to n of 5; and the plurality of patterns comprises more than 1,000 different patterns. 14.
- the remote server system hosts accounts for a plurality of tenants; each of at least some of the tenants has a plurality of teams; each of at least some of the teams has a plurality of users; each of at least some of the users or teams have a plurality of boards; each of at least some of the boards have a plurality of screenshots shared by client applications executing on computing devices of the plurality of users with other users on the same team; and each of at least some of the tenants has a respective policy specifying a respective set of patterns by which n-grams are scored, at least one policy specifying the plurality of patterns. 15.
- classifying comprises classifying n-grams into a confidentiality hierarchy having three or more levels; and a first class of users are granted access, by the remote server system, to a first version of the bitmap image with n-grams classified in all three levels obfuscated; a second class of users are granted access, by the remote server system, to a second version of the bitmap image with n-grams classified in two of the three levels obfuscated and n-grams classified in one of the three levels obfuscated; and a third class of users are granted access, by the remote server system, to a third version of the bitmap image with n-grams classified in one of the three levels obfuscated and n-grams classified in two of the three levels obfuscated.
- the medium of any one of embodiments 1-16 comprising; logging classifications of n-grams in a plurality of screenshots taken by a plurality of users; determining risk scores based on the logged classifications; and causing the risk scores to be presented in a user interface.
- the risk scores include an aggregate team risk score based on logged classifications for a plurality of users in a team; and matches to different patterns are weighted differently based on respective types of confidential information in the risk scores based on the logged classifications or aggregate team risk score.
- modifying pixel values comprises applying a blurring convolution to pixels. 20.
- determining coordinates of pixels in the bitmap image corresponding to the respective n-gram comprises: receiving the record in a hierarchical data serialization format; parsing, from the record, a description of text detected by the OCRing and vertices of bounding polygons of respective units of text, the vertices being expressed in pixel coordinates; and designating pixels within at least some bounding polygons as corresponding to the respective n-gram.
- a tangible, non-transitory, machine-readable medium storing instructions that when executed by one or more processors effectuate operations comprising: receiving, with one or more processors of a server system implementing a confidential-information redaction service, via a network, one or more bitmap images of a display presented by graphical operating system on a computing device distinct from the server system, at least some of the bitmap images displaying at least part of a user interface of an application executed within the graphical operating system; detecting and localizing, with one or more processors of the server system, confidential information depicted in the one or more bitmap images to produce a set of areas to be redacted; modifying, with one or more processors of the server system, pixels in each of the set of areas to be redacted to redact the confidential information and form modified versions of the one or more bitmap images; and storing, with one or more processors of the server system, the modified versions of the one or more bitmap images in memory or outputting, with one or more processors of the server system, the modified versions of the one or
- detecting and localizing confidential information comprises: causing at least some of the one or more bitmap images to undergo optical character recognition; causing a machine learning model to detect and localize confidential information in at least some of the one or more bitmap images; or causing a computer vision algorithm to detect and localize confidential information in at least some of the one or more bitmap images.
- detecting and localizing confidential information comprises: causing at least some of the one or more bitmap images to undergo optical character recognition; causing a machine learning model to detect and localize confidential information in at least some of the one or more bitmap images; and causing a computer vision algorithm to detect and localize confidential information in at least some of the one or more bitmap images.
- the medium of claim 21 wherein: the one or more bitmap images are received via an application-program interface request initiated responsive to a request to upload the one or more bitmap images to a third party network-accessible service; and the operations comprise providing the modified versions of the one or more bitmap images in memory to the third party network-accessible service without revealing the confidential information to the third party network-accessible service.
- the one or more bitmap images are frames of a screen cast video from the computing device; or the one or more bitmap images are screen shots from the computing device. 26.
- a method comprising: the operations of any one of embodiments 1-25.
- a system comprising: one or more processors; and memory storing instructions that when executed by the processors cause the processors to effectuate operations comprising: the operations of any one of embodiments 1-25.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Software Systems (AREA)
- Computer Hardware Design (AREA)
- Multimedia (AREA)
- Computer Security & Cryptography (AREA)
- Bioethics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Medical Informatics (AREA)
- Computing Systems (AREA)
- Databases & Information Systems (AREA)
- Computational Linguistics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
-
- 1. Run the bitmap image through an optical character recognition (OCR) algorithm to extract text and word bounding boxes;
- 2. Analyze the string results by:
- a. Using pattern matching detect emails, credit card numbers, social security numbers, phone number, addresses, and the like, and
- b. Using natural language processing, detect proper nouns;
- 3. Apply a redaction filter to the regions of the image where embodiments detect a positive match; and
- 4. Calculate a numerical risk score for the image, in some cases normalizing the score based on a NumberOfMatches/NumberOfWords or based on a SUM(BoundaryBoxOfMatches)/image width*height.
-
- 1. Break apart video into frames;
- 2. Build an array of significantly unique frames by detecting changes in (algorithmically inferred) perceived color differences and applying anti-alias detection to ignore superficial differences, e.g., by filtering out differences of less than 80% of the previous image to avoid reacting to differences like a mouse moving across the screen. Each unique image may then be associated with a time-range for redaction purposes.
- 3. Run each significantly unique frame through the above-noted analysis and redaction algorithm;
- 4. Calculate a risk score as the sum of each frame score; and
- 5. For each detected bit of confidential information (e.g., personally identifiable information, insider information, information subject to HIPPA, etc.), apply a redaction filter to the video for the associated time frame.
2. The medium of embodiment 1, wherein: the bitmap image is sent by the first client computing device to the remote server system before the scoring; the remote server system causes the OCRing, scores each of the n-grams, classifies each of the n-grams, obfuscates respective n-grams in the bitmap image, and stores the modified version of the bitmap image; at least some of the patterns specify personally identifiable information; and at least some of the patterns specify proper nouns.
3. The medium of any one of embodiments 1-2, wherein the operations comprise: determining a risk score at least in part by normalizing an amount of pattern matches to text depicted in the bitmap image.
4. The medium of embodiment 3, wherein normalizing is based on: a ratio of an amount of text depicted in the bitmap image to pattern matching text; or a ratio of a total area of the bitmap image to cumulative area of bounding boxes of pattern matching text.
5. The medium of any one of embodiments 1-4, wherein: a screen-capture sharing client application executing on the first client computing device receives the bitmap image, causes the OCR, scores each of the n-grams, classifies the n-grams, obfuscates the respective n-grams, and provides the modified version of the bitmap image to the remote server system without providing the captured version of the bitmap image to the remote server system, such that the remote server system does not have access to obfuscated n-grams in un-obfuscated form.
6. The medium of any one of embodiments 1-5, wherein: the URI is 16 characters or less; and the remote server system is configured to stop providing access to the modified version of the bitmap image after more than a threshold duration of time elapses from a reference event.
7. The medium of any one of embodiments 1-6, wherein: the remote server system is configured to arrange the modified bitmap image in a collection of boards of an account of the first user, each of the boards having a plurality of bitmap images of screen captures, at least some of which depicting obfuscated region that displayed pattern matching text in a captured version of the respective image.
8. The medium of any one of embodiments 1-7, wherein: the remote server system is configured to selectively grant access to the bitmap image or the modified version of the bitmap image to each of a plurality of members of a team that includes the first user based on determinations that requests for the bitmap image are from members of the team.
9. The medium of embodiment 8, wherein: the remote servers system is configured to determine whether requests are from members of the team and, in response, selectively grant access to the bitmap image to members of the team and grant access to the modified version of the bitmap image to other users who are not members of the team.
10. The medium of any one of embodiments 1-9, wherein: the remote server system is configured to selectively grant access to the bitmap image in response to determining that a requestor is authenticated; and the remote server system is configured to selectively grant access to the modified version of the bitmap image in response to determining that a requestor is not authenticated.
11. The medium of any one of embodiments 1-10, wherein: a plurality of scores are determined for each n-gram based on one or more trained natural language processing models trained on past examples of confidential information and non-confidential information; each score indicates whether the respective n-gram matches to a respective one of the plurality of patterns; and n-grams are classified as in the category for confidential information in response to determining that any scores for the respective n-gram match to a respective one of the plurality of patterns.
12. The medium of any one of embodiments 1-11, wherein: at least some of the patterns match to n-grams within a threshold edit distance of a specified pattern.
13. The medium of any one of embodiments 1-12, wherein: at least some of the patterns are matched, at least in part, by a prefix trie, a bloom filter, or a hash table; the OCR record comprises more than 500 delimited tokens; the plurality of patterns include some patterns matching to n-grams ranging in length up to n of 5; and the plurality of patterns comprises more than 1,000 different patterns.
14. The medium of any one of embodiments 1-13, wherein, in program state: the remote server system hosts accounts for a plurality of tenants; each of at least some of the tenants has a plurality of teams; each of at least some of the teams has a plurality of users; each of at least some of the users or teams have a plurality of boards; each of at least some of the boards have a plurality of screenshots shared by client applications executing on computing devices of the plurality of users with other users on the same team; and each of at least some of the tenants has a respective policy specifying a respective set of patterns by which n-grams are scored, at least one policy specifying the plurality of patterns.
15. The medium of any one of embodiments 1-14, wherein: sentences or paragraphs are collectively scored based on similarity scores from latent semantic analysis comparisons with labeled bodies of text having labels corresponding to at least some of the two or more categories after filtering n-grams based on term-frequency inverse document frequency scores.
16. The medium of any one of embodiments 1-15, wherein: classifying comprises classifying n-grams into a confidentiality hierarchy having three or more levels; and a first class of users are granted access, by the remote server system, to a first version of the bitmap image with n-grams classified in all three levels obfuscated; a second class of users are granted access, by the remote server system, to a second version of the bitmap image with n-grams classified in two of the three levels obfuscated and n-grams classified in one of the three levels obfuscated; and a third class of users are granted access, by the remote server system, to a third version of the bitmap image with n-grams classified in one of the three levels obfuscated and n-grams classified in two of the three levels obfuscated.
17. The medium of any one of embodiments 1-16, comprising; logging classifications of n-grams in a plurality of screenshots taken by a plurality of users; determining risk scores based on the logged classifications; and causing the risk scores to be presented in a user interface.
18. The medium of embodiment 17, wherein: the risk scores include an aggregate team risk score based on logged classifications for a plurality of users in a team; and matches to different patterns are weighted differently based on respective types of confidential information in the risk scores based on the logged classifications or aggregate team risk score.
19. The medium of any one of embodiments 1-18, wherein: modifying pixel values comprises applying a blurring convolution to pixels.
20. The medium of any one of embodiments 1-19, wherein determining coordinates of pixels in the bitmap image corresponding to the respective n-gram comprises: receiving the record in a hierarchical data serialization format; parsing, from the record, a description of text detected by the OCRing and vertices of bounding polygons of respective units of text, the vertices being expressed in pixel coordinates; and designating pixels within at least some bounding polygons as corresponding to the respective n-gram.
21. A tangible, non-transitory, machine-readable medium storing instructions that when executed by one or more processors effectuate operations comprising: receiving, with one or more processors of a server system implementing a confidential-information redaction service, via a network, one or more bitmap images of a display presented by graphical operating system on a computing device distinct from the server system, at least some of the bitmap images displaying at least part of a user interface of an application executed within the graphical operating system; detecting and localizing, with one or more processors of the server system, confidential information depicted in the one or more bitmap images to produce a set of areas to be redacted; modifying, with one or more processors of the server system, pixels in each of the set of areas to be redacted to redact the confidential information and form modified versions of the one or more bitmap images; and storing, with one or more processors of the server system, the modified versions of the one or more bitmap images in memory or outputting, with one or more processors of the server system, the modified versions of the one or more bitmap images to another network-accessible service.
22. The medium of claim 21, wherein detecting and localizing confidential information comprises: causing at least some of the one or more bitmap images to undergo optical character recognition; causing a machine learning model to detect and localize confidential information in at least some of the one or more bitmap images; or causing a computer vision algorithm to detect and localize confidential information in at least some of the one or more bitmap images.
23. The medium of claim 21, wherein detecting and localizing confidential information comprises: causing at least some of the one or more bitmap images to undergo optical character recognition; causing a machine learning model to detect and localize confidential information in at least some of the one or more bitmap images; and causing a computer vision algorithm to detect and localize confidential information in at least some of the one or more bitmap images.
24. The medium of claim 21, wherein: the one or more bitmap images are received via an application-program interface request initiated responsive to a request to upload the one or more bitmap images to a third party network-accessible service; and the operations comprise providing the modified versions of the one or more bitmap images in memory to the third party network-accessible service without revealing the confidential information to the third party network-accessible service.
25. The medium of claim 21, wherein: the one or more bitmap images are frames of a screen cast video from the computing device; or the one or more bitmap images are screen shots from the computing device.
26. A method, comprising: the operations of any one of embodiments 1-25.
27. A system, comprising: one or more processors; and memory storing instructions that when executed by the processors cause the processors to effectuate operations comprising: the operations of any one of embodiments 1-25.
Claims (27)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/050,219 US10846573B2 (en) | 2018-07-31 | 2018-07-31 | Detecting, redacting, and scoring confidential information in images |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/050,219 US10846573B2 (en) | 2018-07-31 | 2018-07-31 | Detecting, redacting, and scoring confidential information in images |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20200042837A1 US20200042837A1 (en) | 2020-02-06 |
| US10846573B2 true US10846573B2 (en) | 2020-11-24 |
Family
ID=69228125
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US16/050,219 Active US10846573B2 (en) | 2018-07-31 | 2018-07-31 | Detecting, redacting, and scoring confidential information in images |
Country Status (1)
| Country | Link |
|---|---|
| US (1) | US10846573B2 (en) |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11237854B2 (en) | 2018-10-22 | 2022-02-01 | Citrix Systems, Inc. | Providing a virtual desktop within a computing environment |
| US11531703B2 (en) * | 2019-06-28 | 2022-12-20 | Capital One Services, Llc | Determining data categorizations based on an ontology and a machine-learning model |
| US20230367904A1 (en) * | 2022-05-11 | 2023-11-16 | Sick Ag | Method and apparatus for the anonymized image acquisition in an industrial plant |
| US20240095447A1 (en) * | 2022-06-22 | 2024-03-21 | Nvidia Corporation | Neural network-based language restriction |
| US11949840B1 (en) * | 2022-11-11 | 2024-04-02 | Kyocera Document Solutions Inc. | Redacting confidential information in a document and reversal thereof |
Families Citing this family (58)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10755090B2 (en) * | 2018-03-16 | 2020-08-25 | Open Text Corporation | On-device partial recognition systems and methods |
| US10742725B2 (en) * | 2018-05-04 | 2020-08-11 | Citrix Systems, Inc. | Detection and repainting of semi-transparent overlays |
| US11030349B2 (en) * | 2018-10-26 | 2021-06-08 | International Business Machines Corporation | Secure data display |
| US11093620B2 (en) | 2018-11-02 | 2021-08-17 | ThreatConnect, Inc. | Ahead of time application launching for cybersecurity threat intelligence of network security events |
| US11317005B2 (en) * | 2018-12-11 | 2022-04-26 | GlassBox Ltd. | System and method for determining compression rates for images comprising text |
| US11663405B2 (en) * | 2018-12-13 | 2023-05-30 | Microsoft Technology Licensing, Llc | Machine learning applications for temporally-related events |
| US11489818B2 (en) * | 2019-03-26 | 2022-11-01 | International Business Machines Corporation | Dynamically redacting confidential information |
| US11126746B2 (en) * | 2019-03-28 | 2021-09-21 | The Toronto-Dominion Bank | Dynamic security controls for data sharing between systems |
| CN111800671B (en) * | 2019-04-08 | 2022-08-12 | 百度时代网络技术(北京)有限公司 | Method and apparatus for aligning paragraphs and video |
| US11949677B2 (en) * | 2019-04-23 | 2024-04-02 | Microsoft Technology Licensing, Llc | Resource access based on audio signal |
| EP3984207A1 (en) * | 2019-06-12 | 2022-04-20 | Koninklijke Philips N.V. | Dynamically modifying functionality of a real-time communications session |
| US11620570B2 (en) * | 2019-06-24 | 2023-04-04 | Kyndkyl, Inc. | Self-learning ontology-based cognitive assignment engine |
| US11281794B2 (en) * | 2019-09-26 | 2022-03-22 | Microsoft Technology Licensing, Llc | Fine grained access control on procedural language for databases based on accessed resources |
| US11755775B2 (en) * | 2019-11-07 | 2023-09-12 | International Business Machines Corporation | Upload management |
| US11321956B1 (en) * | 2019-12-03 | 2022-05-03 | Ciitizen, Llc | Sectionizing documents based on visual and language models |
| US11263388B2 (en) * | 2020-02-17 | 2022-03-01 | Wipro Limited | Method and system for dynamically generating summarised content for visual and contextual text data |
| US11863573B2 (en) | 2020-03-06 | 2024-01-02 | ThreatConnect, Inc. | Custom triggers for a network security event for cybersecurity threat intelligence |
| US11711372B2 (en) * | 2020-03-16 | 2023-07-25 | AVAST Software s.r.o. | Network resource privacy negotiation system and method |
| US11468142B1 (en) | 2020-03-21 | 2022-10-11 | Menlo Security, Inc. | Managing content uploads |
| CN111899184B (en) * | 2020-03-31 | 2023-11-28 | 珠海市杰理科技股份有限公司 | Image defect repair, neural network training methods, devices, equipment and systems |
| CN111652272B (en) * | 2020-04-27 | 2024-05-28 | 中国平安财产保险股份有限公司 | Image processing method and device, computer equipment and storage medium |
| US11216621B2 (en) * | 2020-04-29 | 2022-01-04 | Vannevar Labs, Inc. | Foreign language machine translation of documents in a variety of formats |
| US11411918B2 (en) * | 2020-05-26 | 2022-08-09 | Microsoft Technology Licensing, Llc | User interface for web server risk awareness |
| US12149516B2 (en) * | 2020-06-02 | 2024-11-19 | Flex Integration, LLC | System and methods for tokenized hierarchical secured asset distribution |
| US10949961B1 (en) * | 2020-06-03 | 2021-03-16 | Netskope, Inc. | Detecting screenshot images for protecting against loss of sensitive screenshot-borne data |
| US11144669B1 (en) * | 2020-06-11 | 2021-10-12 | Cognitive Ops Inc. | Machine learning methods and systems for protection and redaction of privacy information |
| US11822697B2 (en) * | 2020-08-17 | 2023-11-21 | Paypal, Inc. | Dynamic pixel display in electronic communications to enhance data security |
| US11663842B2 (en) * | 2020-11-05 | 2023-05-30 | Jpmorgan Chase Bank, N.A. | Method and system for tabular information extraction |
| US11436357B2 (en) * | 2020-11-30 | 2022-09-06 | Lenovo (Singapore) Pte. Ltd. | Censored aspects in shared content |
| US11822599B2 (en) * | 2020-12-16 | 2023-11-21 | International Business Machines Corporation | Visualization resonance for collaborative discourse |
| US11790110B2 (en) * | 2021-02-09 | 2023-10-17 | Nice Ltd. | System and method for preventing sensitive information from being recorded |
| US12107879B2 (en) * | 2021-02-17 | 2024-10-01 | Microsoft Technology Licensing, Llc | Determining data risk and managing permissions in computing environments |
| US12047355B2 (en) * | 2021-03-08 | 2024-07-23 | Adobe Inc. | Machine learning techniques for mitigating aggregate exposure of identifying information |
| US11550934B2 (en) * | 2021-03-16 | 2023-01-10 | Check Point Software Technologies, Ltd. | Systems and methods for the efficient detection of improperly redacted electronic documents |
| US11496738B1 (en) * | 2021-03-24 | 2022-11-08 | Amazon Technologies, Inc. | Optimized reduced bitrate encoding for titles and credits in video content |
| US11354485B1 (en) * | 2021-05-13 | 2022-06-07 | iCIMS, Inc. | Machine learning based classification and annotation of paragraph of resume document images based on visual properties of the resume document images, and methods and apparatus for the same |
| US11706381B2 (en) * | 2021-05-24 | 2023-07-18 | Getac Technology Corporation | Selective obfuscation of objects in media content |
| JP7301905B2 (en) * | 2021-05-24 | 2023-07-03 | 株式会社日立製作所 | Integrated system and method |
| GB2609708B (en) * | 2021-05-25 | 2023-10-25 | Samsung Electronics Co Ltd | Method and apparatus for video recognition |
| CN113221846B (en) * | 2021-06-07 | 2025-05-30 | 北京百度网讯科技有限公司 | Image recognition method, device, equipment, storage medium and program product |
| US12229246B2 (en) | 2021-06-25 | 2025-02-18 | ThreatConnect, Inc. | Browser extension for cybersecurity threat intelligence and response |
| US11985144B2 (en) * | 2021-06-25 | 2024-05-14 | ThreatConnect, Inc. | Browser extension for cybersecurity threat intelligence and response |
| US11475158B1 (en) | 2021-07-26 | 2022-10-18 | Netskope, Inc. | Customized deep learning classifier for detecting organization sensitive data in images on premises |
| KR102634376B1 (en) * | 2021-07-30 | 2024-02-05 | 주식회사 카카오 | Method for providing subscription service, system, user device, and application implementing the method |
| US20230081934A1 (en) * | 2021-09-15 | 2023-03-16 | Shimadzu Corporation | Management device for material testing machine, management system for material testing machine, and management method for material testing machine |
| US12164676B2 (en) * | 2021-09-22 | 2024-12-10 | Ridgeline, Inc. | Enabling an action based on a permission identifier for real-time identity resolution in a distributed system |
| US11989310B2 (en) | 2021-12-14 | 2024-05-21 | Royal Bank Of Canada | Method and system for facilitating identification of electronic data exfiltration |
| US12400032B2 (en) * | 2022-01-26 | 2025-08-26 | Material Security Inc. | One-shot challenge to search and access unredacted vaulted electronic communications |
| JP2023161622A (en) * | 2022-04-26 | 2023-11-08 | 株式会社野村総合研究所 | Audit apparatus and audit method |
| US12230245B2 (en) * | 2022-08-31 | 2025-02-18 | Nvidia Corporation | Text normalization and inverse text normalization using weighted finite-state transducers and neural language models |
| US12373559B1 (en) | 2022-10-03 | 2025-07-29 | Menlo Security, Inc. | Secure archive explorer |
| US20240119695A1 (en) * | 2022-10-05 | 2024-04-11 | Microsoft Technology Licensing, Llc | Generation of emphasis image with emphasis boundary |
| US12124608B2 (en) * | 2022-10-27 | 2024-10-22 | Nice Ltd. | Computerized-method and computerized-system for sensitive data redaction from screenshots |
| US12289430B2 (en) * | 2023-01-03 | 2025-04-29 | Capital One Services, Llc | Instant optical character recognition during upload process |
| US20240241500A1 (en) * | 2023-01-12 | 2024-07-18 | Sap Se | Dynamic data masking for graphical and textual content in robotic process automation |
| US12013945B1 (en) * | 2023-10-27 | 2024-06-18 | Morgan Stanley Services Group Inc. | Fraudulent overlay detection in electronic documents |
| US12105844B1 (en) * | 2024-03-29 | 2024-10-01 | HiddenLayer, Inc. | Selective redaction of personally identifiable information in generative artificial intelligence model outputs |
| CN120071858B (en) * | 2025-04-30 | 2025-08-05 | 北京天河地塬安防技术服务有限公司 | Anti-shooting screen display control method and system based on dynamic grating interference |
Citations (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070030528A1 (en) | 2005-07-29 | 2007-02-08 | Cataphora, Inc. | Method and apparatus to provide a unified redaction system |
| US20080175377A1 (en) | 2007-01-22 | 2008-07-24 | Global Crypto Systems | Methods and Systems for Digital Authentication Using Digitally Signed Images |
| US20080204788A1 (en) | 2004-10-14 | 2008-08-28 | Onstream Systems Limited | Process for Electronic Document Redaction |
| US20100097662A1 (en) | 2008-10-20 | 2010-04-22 | John Eric Churilla | System and method for scanning and processing printed media |
| US20100131551A1 (en) * | 2008-11-19 | 2010-05-27 | Theladders.Com, Inc. | System and method for managing confidential information |
| US20100322373A1 (en) | 2009-01-14 | 2010-12-23 | John Eric Churilla | System and method for scanning and processing printed media |
| US20120265762A1 (en) | 2010-10-06 | 2012-10-18 | Planet Data Solutions | System and method for indexing electronic discovery data |
| US20140123237A1 (en) | 2012-10-25 | 2014-05-01 | Edward J. Gaudet | Secure content sharing |
| US20140281871A1 (en) | 2013-03-15 | 2014-09-18 | Meditory Llc | Method for mapping form fields from an image containing text |
| US20140278366A1 (en) | 2013-03-12 | 2014-09-18 | Toytalk, Inc. | Feature extraction for anonymized speech recognition |
| US20150220626A1 (en) * | 2014-01-31 | 2015-08-06 | Verint Systems Ltd. | Automated Removal of Private Information |
| US20170124347A1 (en) * | 2015-11-04 | 2017-05-04 | Ricoh Company, Ltd. | Information processing apparatus, information processing method, and recording medium |
| US20180275751A1 (en) * | 2017-03-21 | 2018-09-27 | Microsoft Technology Licensing, Llc | Index, search, and retrieval of user-interface content |
-
2018
- 2018-07-31 US US16/050,219 patent/US10846573B2/en active Active
Patent Citations (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20080204788A1 (en) | 2004-10-14 | 2008-08-28 | Onstream Systems Limited | Process for Electronic Document Redaction |
| US20070030528A1 (en) | 2005-07-29 | 2007-02-08 | Cataphora, Inc. | Method and apparatus to provide a unified redaction system |
| US20080175377A1 (en) | 2007-01-22 | 2008-07-24 | Global Crypto Systems | Methods and Systems for Digital Authentication Using Digitally Signed Images |
| US20100097662A1 (en) | 2008-10-20 | 2010-04-22 | John Eric Churilla | System and method for scanning and processing printed media |
| US20100131551A1 (en) * | 2008-11-19 | 2010-05-27 | Theladders.Com, Inc. | System and method for managing confidential information |
| US20100322373A1 (en) | 2009-01-14 | 2010-12-23 | John Eric Churilla | System and method for scanning and processing printed media |
| US20120265762A1 (en) | 2010-10-06 | 2012-10-18 | Planet Data Solutions | System and method for indexing electronic discovery data |
| US20150055867A1 (en) | 2010-10-06 | 2015-02-26 | Planet Data Solutions | System and method for indexing electronic discovery data |
| US20170308528A1 (en) | 2010-10-06 | 2017-10-26 | Planet Data Solutions | System and method for indexing electronic discovery data |
| US20170126652A1 (en) | 2012-10-25 | 2017-05-04 | Edward J. Gaudet | Secure content sharing |
| US20140123237A1 (en) | 2012-10-25 | 2014-05-01 | Edward J. Gaudet | Secure content sharing |
| US20140278366A1 (en) | 2013-03-12 | 2014-09-18 | Toytalk, Inc. | Feature extraction for anonymized speech recognition |
| US20140281871A1 (en) | 2013-03-15 | 2014-09-18 | Meditory Llc | Method for mapping form fields from an image containing text |
| US20150220626A1 (en) * | 2014-01-31 | 2015-08-06 | Verint Systems Ltd. | Automated Removal of Private Information |
| US20170124347A1 (en) * | 2015-11-04 | 2017-05-04 | Ricoh Company, Ltd. | Information processing apparatus, information processing method, and recording medium |
| US20180275751A1 (en) * | 2017-03-21 | 2018-09-27 | Microsoft Technology Licensing, Llc | Index, search, and retrieval of user-interface content |
Non-Patent Citations (4)
| Title |
|---|
| Kotsarenko, "Measuring perceived color difference using YIQ NTSC transmission color space in mobile applications", 2010, http://www.progmat.uaem.mx:8080/artVol2Num2/Articulo3Vol2Num2.pdf, pp. 1 to 17. |
| Related U.S. Appl. No. 16/050,071, filed Jul. 31, 2018, pp. 1 to 65. |
| Shagan et al., "Video redaction: a survey and comparison of enabling technologies", Jul. 20, 2017, https://www.spiedigitallibrary.org/journals/journal-of-electronic-imaging/volume-26/issue-05/051406/Video-redaction-a-survey-and-comparison-of-enabling-technologies/10.1117/11EI.26.5.051406.full?SSO=1, pp. 1 to. |
| Vysniauskas, "Anti-aliased Pixel and Intensity Slope Detector", Oct. 2009, https://www.researchgate.net/publication/234126755_Anti-aliased_Pixel_and_Intensity_Slope_Detector, pp. 1 to 16. |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11237854B2 (en) | 2018-10-22 | 2022-02-01 | Citrix Systems, Inc. | Providing a virtual desktop within a computing environment |
| US11531703B2 (en) * | 2019-06-28 | 2022-12-20 | Capital One Services, Llc | Determining data categorizations based on an ontology and a machine-learning model |
| US12056188B2 (en) | 2019-06-28 | 2024-08-06 | Capital One Services, Llc | Determining data categorizations based on an ontology and a machine-learning model |
| US20230367904A1 (en) * | 2022-05-11 | 2023-11-16 | Sick Ag | Method and apparatus for the anonymized image acquisition in an industrial plant |
| US20240095447A1 (en) * | 2022-06-22 | 2024-03-21 | Nvidia Corporation | Neural network-based language restriction |
| US11949840B1 (en) * | 2022-11-11 | 2024-04-02 | Kyocera Document Solutions Inc. | Redacting confidential information in a document and reversal thereof |
Also Published As
| Publication number | Publication date |
|---|---|
| US20200042837A1 (en) | 2020-02-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10846573B2 (en) | Detecting, redacting, and scoring confidential information in images | |
| US10347293B1 (en) | Detecting, redacting, and scoring confidential information in video | |
| US11797634B2 (en) | System and method for providing a content item based on computer vision processing of images | |
| Cao et al. | Coverless information hiding based on the generation of anime characters | |
| US11195099B2 (en) | Detecting content items in violation of an online system policy using semantic vectors | |
| CN114579876B (en) | False information detection method, device, equipment and medium | |
| von Pape et al. | Privacy by disaster? Press coverage of privacy and digital technology | |
| Sebyakin et al. | Spatio-temporal deepfake detection with deep neural networks | |
| EP4377907A1 (en) | Customized deep learning classifier for detecting organization sensitive data in images on premises | |
| JP2022003517A (en) | Detection of image-derived identification documents to protect sensitive information | |
| CN117690002A (en) | Information interaction methods, devices, electronic equipment and storage media | |
| US9853982B2 (en) | Image-based group profiles | |
| Fadhil et al. | Enhancing data security using Laplacian of Gaussian and Chacha20 encryption algorithm | |
| Cheung et al. | Evaluating the privacy risk of user-shared images | |
| Roberts et al. | Image2struct: Benchmarking structure extraction for vision-language models | |
| US10891463B2 (en) | Signature match system and method | |
| Chang et al. | A framework for detecting fake news by identifying fake text messages and forgery images | |
| US20250173438A1 (en) | System and method for detecting prompt injection attacks to large language models | |
| Samal et al. | ASYv3: Attention‐enabled pooling embedded Swin transformer‐based YOLOv3 for obscenity detection | |
| US20220377108A1 (en) | Method and device for clustering phishing web resources based on visual content image | |
| US10938881B2 (en) | Data engagement for online content and social networks | |
| Hodges et al. | Forensic analysis of memetic image propagation: introducing the SMOC BRISQUEt method | |
| CN116432210A (en) | File management method and system based on security protection | |
| Li et al. | Research on night tourism recommendation based on Intelligent Image Processing Technology | |
| US12001486B2 (en) | Identifying reference data in a source data set |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO SMALL (ORIGINAL EVENT CODE: SMAL); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY |
|
| AS | Assignment |
Owner name: DROPLR, INC., OREGON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SKINNER, GRAY;NUNNINK, LEVI;REEL/FRAME:047673/0187 Effective date: 20180730 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: EX PARTE QUAYLE ACTION MAILED |
|
| AS | Assignment |
Owner name: TRIANGLE DIGITAL VENTURES II, LLC, COLORADO Free format text: ASSET PURCHASE AGREEMENT;ASSIGNOR:DROPLR, INC.;REEL/FRAME:052049/0736 Effective date: 20190708 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO EX PARTE QUAYLE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| AS | Assignment |
Owner name: COVIDEO LLC, COLORADO Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:TRIANGLE DIGITAL VENTURES II, LLC;REEL/FRAME:057488/0901 Effective date: 20210830 |
|
| AS | Assignment |
Owner name: STERLING NATIONAL BANK, NEW YORK Free format text: SECURITY INTEREST;ASSIGNOR:COVIDEO LLC;REEL/FRAME:060662/0465 Effective date: 20211112 |
|
| MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YR, SMALL ENTITY (ORIGINAL EVENT CODE: M2551); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY Year of fee payment: 4 |