US20230153437A1 - Proactive browser content analysis - Google Patents
Proactive browser content analysis Download PDFInfo
- Publication number
- US20230153437A1 US20230153437A1 US18/158,218 US202318158218A US2023153437A1 US 20230153437 A1 US20230153437 A1 US 20230153437A1 US 202318158218 A US202318158218 A US 202318158218A US 2023153437 A1 US2023153437 A1 US 2023153437A1
- Authority
- US
- United States
- Prior art keywords
- content
- malware
- browser
- browser engine
- engine
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000004458 analytical method Methods 0.000 title abstract description 36
- 239000011814 protection agent Substances 0.000 claims abstract description 27
- 238000000034 method Methods 0.000 claims description 37
- 239000003795 chemical substances by application Substances 0.000 claims description 5
- 230000001172 regenerating effect Effects 0.000 claims 1
- 238000012545 processing Methods 0.000 abstract description 7
- 230000004044 response Effects 0.000 abstract description 7
- 230000008569 process Effects 0.000 description 23
- 238000013459 approach Methods 0.000 description 13
- 230000004048 modification Effects 0.000 description 10
- 238000012986 modification Methods 0.000 description 10
- 230000006870 function Effects 0.000 description 6
- 238000013515 script Methods 0.000 description 6
- 230000008901 benefit Effects 0.000 description 5
- 230000006837 decompression Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000009877 rendering Methods 0.000 description 4
- 230000006872 improvement Effects 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- 230000008520 organization Effects 0.000 description 2
- VYZAMTAEIAYCRO-UHFFFAOYSA-N Chromium Chemical compound [Cr] VYZAMTAEIAYCRO-UHFFFAOYSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000010191 image analysis Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000011012 sanitization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
- G06F21/50—Monitoring users, programs or devices to maintain the integrity of platforms, e.g. of processors, firmware or operating systems
- G06F21/55—Detecting local intrusion or implementing counter-measures
- G06F21/56—Computer malware detection or handling, e.g. anti-virus arrangements
- G06F21/566—Dynamic detection, i.e. detection performed at run-time, e.g. emulation, suspicious activities
Definitions
- An exemplary aspect of the present invention generally relates to computer system management.
- an exemplary aspect relates to systems and methods for controlling pestware or malware or other undesirable or unwanted applications and/or instructions.
- Malware Personal computers and business computers are continually attacked by viruses, trojans, spyware, adware, etc., collectively referred to as “malware” or “pestware.” These types of programs generally act to gather information about a person or organization--often without the person or organization's knowledge. Some pestware is highly malicious. Other pestware is non-malicious but may cause issues with privacy or system performance. And yet other pestware is actually beneficial or wanted by the user.
- Pestware is sometimes not characterized as “pestware” or “spyware.” But, unless specified otherwise, “pestware” or “malware” as used herein refers to any program that is malicious in some way and/or collects and/or reports information about a person or an organization and any “watcher processes” related to the pestware or malware.
- a protection module operates to analyze threats, at the protocol level (e.g., at the HTML level), by intercepting all requests that a browser engine resident in a computing device sends and receives, and the protection agent completes the requests without the help of the browser engine.
- the protocol level e.g., at the HTML level
- the protection module analyzes and/or modifies the completed data before the browser engine has access to it, to, for example, display it.
- the protection module After performing all of its processing, removing, and/or adding any code as needed, the protection module provides the HTML content to the browser engine, and the browser engine receives responses from the protection agent as if it was speaking to an actual web server, when in fact, browser engine is speaking to an analysis engine of the protection module.
- protection module This allows the protection module to have control over what a browser engine “sees,” providing means to remove any exploits, malware, and other threats dynamically. This also enables the protection module to add content into the browser stream at the HTML level, before receipt by the browser.
- search engine results e.g., results provided by Google®
- Yahoo®, and Bing® are annotated/updated/amended by the protection module—within the HTML code—to denote if a particular website is legitimate or malicious. For example, a legitimate link in the search results may be depicted in connection with a green check mark and a suspect link may be depicted with a red cross. (Of course other indicators could also be used that identify to a user whether or not a link is “good,” “bad,” or “unknown.)
- the protocol-level analysis approach may also be used in connection with anti-phishing and URL analysis among other types of analysis.
- the differences between the disclosed protocol-level analysis approach compared to other prior anti-malware approaches are significant.
- the data e.g., a web page of search results
- an analysis engine of the protection module which has control over every element of a web page before the web page is operated on by the browser engine.
- This is in contrast to prior approaches that just make high-level modifications to the content after the content has been rendered and displayed through a Browser Helper Object.
- the protection module When the protection module receives content from a web server, the protection module then, if necessary, decrypts and decompresses the web content and then assembles the requested web page (e.g., in a decrypted and decompressed HTML format that the web page existed in at the remote server). The protection module then analyzes the web page to determine whether the web page includes links that may lead to sites hosting malware or whether the web page itself includes malware. The analysis of the assembled web page may include communicating with a remote security center so that a malware management analysis may be performed to analyze one or more portions of the content of the assembled web page and/or the protection module itself may perform analysis of content of the assembled webpage. The analyzed webpage can then be forwarded to the web browser for display to a user.
- FIG. 1 illustrates an exemplary embodiment of a computing environment according to an exemplary embodiment.
- FIG. 2 illustrates an exemplary embodiment of systems and operations at the remote computer.
- FIG. 3 is a flowchart illustrating exemplary process flow at the remote computer.
- FIG. 4 illustrates an exemplary embodiment of one of the computers in FIG. 1 .
- FIG. 1 it is a block diagram depicting an environment in which several embodiments of the invention may be implemented.
- a security center 102 also referred to herein as a “central” or “base” computer
- remote user 104 operating a remote computer 105 a malware source 106
- a web server 108 a web server 108
- networks e.g., the Internet and/or local or wide area networks
- security center 102 each of these logically represents a potentially unlimited number of persons, entities and/or computers or computing resources.
- the remote user 104 may be an individual or a business enterprise that operates the remote computer 105 , which may each be a personal computer, a server of any type, a PDA, mobile phone, tablet, netbook, an interactive television, or any other device capable of loading and operating computer objects.
- the malware source 106 generally represents a source of malware that ends up or is strategically placed at the web server 108 , which may or may not be suspected of hosting malware.
- the malware source 106 may generate a malware object in a variety of forms including in a scripting language such as ECMAscript-based scripting languages (e.g., JavaScript or Adobe Flash), but the malware source may generate other types of objects such as computer files, part of a file or a sub-program, an instruction(s), macro, web page or any other piece of code to be operated by or on the computer, or any other event whether executed, emulated, simulated or interpreted.
- ECMAscript-based scripting languages e.g., JavaScript or Adobe Flash
- the security center 102 is disposed and configured to be accessible to the user 104 so that, as discussed further herein, the security center 102 may facilitate the management of malware on the remote computer 104 .
- the security center 102 operates according to a Software as a Service (SaaS) business model to generally provide Web security services “in the cloud.”
- SaaS Software as a Service
- the exemplary security center 102 includes a malware management portion 112 that is coupled to a data store 114 .
- the security center 102 may also include components that provide other services (e.g., internet policy enforcement, in/outbound content control, application control, compliance-related services, etc.).
- the security center 102 is generally configured to obtain information about malware threats and to be a resource for the remote computer 105 to enable the remote computer to manage malware threats more effectively and efficiently. It should be noted that that the malware management component 112 and data store 114 are presented for convenience as single entities, but the security center 102 can be scaled and comprised of multiple geographically distributed computers and servers, etc., and the data store can be made up multiple databases and storage distributed around this central system and/or be located in a cloud-type environment.
- the malware management 112 component of the security center 102 may maintain the data store 114 as a community database that is populated, over time, with information relating to each object run on all of the connected remote computers as disclosed in US A 2007/0016953, published 18 Jan. 2007, entitled “METHODS AND APPARATUS FOR DEALING WITH MALWARE,” the entire contents of which are hereby incorporated herein by reference.
- data representative of each malware object may take the form of a so-called signature or key relating to and/or identifying the object, its attributes and/or behavior(s).
- the protection agent 116 in this embodiment operates to analyze threats, at the protocol level (e.g., at the HTML level), by intercepting all requests that the browser engine 118 sends and receives, and the protection agent 116 completes the requests without the help of the browser engine 118 . And then the protection agent 116 analyzes and/or modifies the completed data before the browser engine 118 has access to it. After performing all of its processing, removing, and/or adding any code as needed, the protection agent feeds the HTML content back to the browser engine 118 , and the browser engine 118 receives responses from the protection agent 116 as if it was “speaking” to an actual web server (e.g., web server 108 ) when in fact, it is speaking to an analysis engine of the protection agent 116 .
- the protocol level e.g., at the HTML level
- protection agent 116 This allows the protection agent 116 to have full control over what the browser engine 118 “sees,” providing means to remove any exploits, malware, and other threats dynamically. This also enables the protection agent 116 to add content into the browser stream at the HTML level. Stated another way, the protection agent 116 caches web content requested by a browser, analyzes and/or modifies the retrieved web content, and provides a clean or sanitized version of the web content, free of malware, to the browser.
- search engine results are annotated by the protection agent 116 —within the HTML code—to denote if a particular website is legitimate or malicious. For example, a legitimate link in the search results may be depicted in connection with a green check mark and a suspect link may be depicted with a red cross.
- the protocol-level analysis approach may also be used in connection with anti-phishing and URL analysis among other types of analysis.
- the differences between the protocol-level analysis approach disclosed herein as compared to other prior anti-malware approaches are significant.
- the data e.g., a web page of search results
- an analysis engine of the protection agent 116 which has full control over every element of the web page before the page is operated on by the browser engine 118 .
- This is in contrast to prior approaches that just make high-level modifications to the content after it has been displayed through a Browser Helper Object.
- the present protocol-level approach there is virtually no performance overhead, and in many cases, there is actually a performance improvement when performing the browser content analysis.
- the protection agent 116 When the protection agent 116 receives content from the web server 108 , the protection agent 116 then, if necessary, decrypts and decompresses the web content and then assembles the requested web page (e.g., in a decrypted and decompressed HTML format that the web page existed in at the remote server 108 ). The protection agent 116 then analyzes the web page to determine whether the web page includes links that may lead to sites hosting malware or whether the web page itself includes malware. The analysis of the assembled web page may include communicating with the security center 102 so that the malware management component 112 may analyze one or more portions of the content of the assembled web page and/or the protection agent 116 itself may perform analysis of content of the assembled webpage.
- the cleaning process that the protection agent 116 carries out takes place through a highly optimized routine written in, for example, raw C with an inline assembler reducing processing effort within the browser engine 118 itself by decrypting/decompressing/de-encoding any of the content outside of the browser engine 118 .
- Having full control at this level also means that complex inferential algorithms can be applied to the browser content as a whole, taking into account any external script/image links, to build an in-memory picture of the final content before it is rendered to the user by the browser on a display (not shown).
- This general operation can be extended to remove/modify any form of content, whether illicit images, ads, fake password request forms, malicious exploits, cross site scripting attacks (XSS), etc.
- FIG. 2 shown is a block diagram depicting exemplary functional components and operations that reside and take place on the remote computer 105 at the location of the remote user 104 .
- the browser-related processes and protection-agent-related processes are separated so that the interaction between the two types of processes may be more clearly understood.
- FIG. 3 depicts exemplary aspects of the process flow that occurs in connection with the components depicted in FIG. 2 . Depicted next to each of the blocks in FIG. 3 is either a “K” or a “U,” which indicate that the operation is implemented at the kernel or user-level, respectively. It should be recognized, however, that the operations described with reference to FIGS.
- FIGS. 2 and 3 may be implemented with some kernel-mode operations being implemented at the user-level and vice versa.
- the illustrated arrangement of components in FIGS. 2 and 3 is logical, and is not meant to be an actual hardware diagram. Thus, many of the components can be combined and/or further separated in an actual implementation.
- the browser processes in this embodiment include agent processes 220 that are installed on the remote computer 105 in connection with the protection agent 116 so that functions of the protection agent 116 are integrated with a typical browser engine 218 .
- the agent processes 220 of the browser processes are implemented by additional code that is wrapped around a typical browser engine 218 to intercept what is requested and received.
- these agent processes 220 enable all content that is requested by an application 222 (e.g., a browser) and received by the browser engine 218 to be intercepted.
- a connection request 360 is initiated as a POST/GET request 362 to a website (e.g., hosted by the webserver 108 ) and the analysis engine 224 looks at the context of the request to assess whether the request is the first request in a session (Block 364 ), and if the request is the first request, a determination is made as to whether the request is associated with known, malicious content (Block 366 ), and if so, the request is blocked (Blocks 368 , 370 ).
- the analysis engine 224 accesses the security center 102 via the Internet and the security center 102 is utilized to facilitate whether the request is a request for known malicious content (e.g., the URL of the request may be compared to a black list of URLs). But the analysis engine 224 may also include some malware checking functionality locally. As shown, if the request is not blocked, the request may be pre-processed by the content acquisition component 226 (Block 372 ) (e.g., to set aside sufficient memory in RAM in anticipation of the content from the website being received) before the request is sent to the destination website 108 .
- the content acquisition component 226 Block 372
- the content acquisition component 226 when the first response is received by the content acquisition component 226 (Block 374 ), if the response is not complete, the response is stored in memory (Block 376 ), and the next request is sent to the destination website 108 (Block 378 ). In this way, the content acquisition component 226 continues to obtain content from the website 108 (Blocks 374 , 376 , 378 ) until the web page is complete, and the complete page is held in memory by the content acquisition component 226 .
- the initial request 360 by the application e.g., browser
- the content acquisition component 226 iteratively sends requests and receives content (Blocks 374 , 376 , 378 ) from the website 108 until the requested content has been completely received.
- This is very different from the ordinary operation of the browser engine 218 , which would (if unaltered by the protection agent 116 ) obtain the content from the webserver 108 itself by way of a series of GET requests.
- the content is passed along to the service process component 228 via the user process component 230 , and the data is then preprocessed (Block 380 ), if necessary, to decrypt, remove chunks, and decompress the gathered content.
- Block 380 the data is then preprocessed.
- a web page When a web page is received from the webserver 108 , it may be compressed, chunked encoded, and encrypted, and decompressing, decoding and unencrypting the content enables a complete picture of the webpage looked like before if any of the these forms of obfuscation were applied to the content by the webserver 108 .
- One of ordinary skill in the art will appreciate that various layers of encoding and compression may be applied to content, but for simplicity, these known details are not included herein for clarity.
- the browser page may be analyzed by the protocol-level analysis/modification component 382 depicted in FIG. 3 outside of the context of the browser engine 218 before the browser engine 218 has operated on the webpage. For example, all the links in the webpage, all the pictures in the webpage, and any scripts in the webpage may be analyzed at a low level of granularity. And code may be changed, enhanced, and removed according to default modes and/or user-configurable mode of operation. As one particular example, if a malicious script is found, it may be commented out before the content is handed to the browser engine.
- the protocol-level analysis/modification component 382 modifies and/or annotates the content—as HTML within the content—to provide the user with textual, audible and/or graphical indicators of risk associated with the content.
- a green check may be added within the HTML code next to a result that is a low risk link
- a red X may be added within the HTML code next to a result the is a high risk link.
- an image tag and/or text may be added within the HTML to indicate a risk of being exposed to malware.
- making modifications to the content (e.g., to include risk indicators) at the HTML level avoids having to address the differences that different browsers introduce into the presentation of rendered content.
- This approach of modifying content at the HTML level is very different than the prior approaches of rendering (e.g., using a Browser Helper Object) annotations on top of a page that has already been parsed and rendered by a browser engine.
- This prior approach is problematic because it allows the browser engine to potentially execute malicious scripts or perform malicious actions while it is parsing and rendering the code.
- the annotations are added after rending, the annotation process must account for the rendering differences (e.g., differences in how and where content is displayed) that different browsers (e.g., Firefox, Safari, Chrome, Internet Explorer, etc.) exhibit.
- search results annotation is one application for the protocol-level (e.g., HTML level) handling of content
- protocol-level e.g., HTML level
- anti-malware-related applications of the protocol-level handling of content.
- One application for example is an anti-phishing application, which may automatically modify HTML data so that user need not type in a password and expose the password to a keylogger object.
- the service process 228 is utilized to initiate parsing of the HTML content and extraction of any scripts 230 in the content before building a tree representation of the content 234 , which is then asynchronously analyzed 236 .
- the analysis may include URL analysis 238 , IP analysis 240 , image analysis (e.g., to analyze phishing threat) 242 , and script/HTML analysis 244 .
- the results of the analysis are then aggregated by the result aggregator 246 so that cloud verification 248 may be performed on the single aggregated collection of data by the base computer 112 of the security center 102 to determine malware vulnerabilities.
- the verification may be effectuated locally or in combination with the security center 102 .
- the results of the cloud verification are provided to the final content packager 250 of the service process 228 , and then in some modes of operation, annotations are performed 252 on the packaged HTML code before being distributed 254 back to the data wrapper component 256 , which sits on top of the response receive component of the browser engine 218 and performs final context modifications 258 to ensure content is properly displayed before being passed to the browser engine 218 .
- final content modifications are performed at 258 before the final content 260 is provided to the browser engine 218 for parsing and rendering of the HTML content before being displayed by the application 222 .
- a loop (Blocks 384 , 386 , 364 ) that depicts the cleaned and/or annotated content being fed back to the browser engine 218 .
- the pointer is set to the next byte to be read (Block 384 ), and once all the content is on hand, the browser engine requests content (Block 386 ), and the cleaned and/or annotated content is fed to the browser engine 218 in the manner the browser engine 218 would have requested and received the content.
- the browser engine 218 is obtaining the content from the webserver 108 directly.
- the protection agent 116 operates as an emulated server in memory to provide the clean content in the way the web server 108 would have provided the content to the browser engine 118 , 218 if the browser received the content form the web server 108 directly.
- the clean/modified content is not encoded or encrypted once it is cleaned.
- the packet headers e.g., length headers
- the decryption and decompression is carried out with code implemented in raw C (and most browsers are written in a high level language), so the decryption and decompression are actually carried out faster than an ordinary browser would do so.
- the decryption and decoding is generally carried out after the request is complete, so the decryption and decompression is more efficient than handling the decryption and decompression over several passes.
- N processing components 140 described with reference to FIG. 1 are depicted as N processors 440 that are coupled to a bus 460 , and also coupled to the bus 460 are a memory 438 (corresponding to memory 138 ), storage medium 412 (corresponding to the storage medium 112 ), a keyboard/pointing device 462 , a display/graphics adapter 464 , and a network interface 466 .
- a display 468 is coupled to the display/graphics adapter 464 .
- the storage medium 412 may be any device capable of holding substantial amounts of data, such as a hard drive, flash memory, or some other form of fixed or removable storage device. And the storage medium 412 in this embodiment stores processor-readable code with instructions to effectuate the functions described herein (e.g., the functions of the components in FIG. 1 depicted in the user 102 and kernel 104 environments).
- the processors 440 generally function to execute code and process other information that resides in memory and may be any specific or general-purpose processor such as an INTEL x86 or POWERPC-compatible central processing unit (CPU), and each may include one or multiple (e.g., four) cores.
- the memory 438 may include several gigabytes of random access memory, but this is merely exemplary and other memory types and sizes may be utilized.
- an operating system e.g., LINUX® or WINDOWS®
- LINUX® or WINDOWS® may also reside in the storage medium 412 and memory 438 and function (e.g., when executed by one or more of the processors 440 ) to enable the components to operate as described with reference to FIG. 1 .
- FIG. 4 depicts only an exemplary embodiment, and the processes presented herein are not inherently related to any particular computing device or other apparatus.
- Various general purpose systems may be used with programs in accordance with the teachings herein, or it may prove convenient to construct a more specialized apparatus to perform the desired method.
- embodiments of the present invention are not described with reference to any particular programming language. It will be appreciated that a variety of programming languages may be used to implement the teachings of the invention as described herein.
- operations, capabilities, and features described herein may be implemented with any combination embodied in firmware, software, application-specific integrated circuits (ASICs), and/or programmable logic devices.
- ASICs application-specific integrated circuits
- a computing device such as a personal computer, server, dedicated computing device, distributed processing system, in a cloud, or the like, or a separately programmed general purpose computer.
- the systems and methods of this invention can be implemented on a special purpose computer, a programmed microprocessor or microcontroller and peripheral integrated circuit element(s), an ASIC or other integrated circuit, a digital signal processor, a hard-wired electronic or logic circuit such as a discrete element circuit, a programmable logic device such as a PLD, PLA, FPGA, PAL, or the like, in fuzzy logic, artificial intelligence and/or neural networks.
- any device(s) or module which can be any combination of hardware and/or software, capable of implementing a state machine that is in turn capable of implementing the processes described herein can be used to implement this invention.
- the disclosed methods may readily implemented in software using, for example, object or object-oriented software development environments that provide portable source code that can be used on a variety of computer or workstation and/or server platforms.
- the software can be stored on a non-transitory computer-readable medium, with the software including one or more processor executable instructions.
- the disclosed system and methodology may also be implemented partially or fully in hardware using standard logic circuits or, for example, a VLSI design. Whether software or hardware is used to implement the systems in accordance with this invention is dependent on the speed and/or efficiency requirements of the system, the particular function, and the particular software or hardware systems or microprocessor or microcomputer systems being utilized.
- the various components of the system can be located at distant portions of a distributed network, such as a communications network and/or the Internet and/or within a dedicated network.
- a distributed network such as a communications network and/or the Internet and/or within a dedicated network.
- the various components can be combined into one or more devices or collocated on a particular node of a distributed network and/or in a cloud.
- the components can be arranged at any location within a distributed network without affecting the operation of the system.
- links connecting elements can be wired or wireless links, or a combination thereof, or any known or later developed element(s) that is capable of supplying and/or communicating data to and from the elements.
- the present disclosure in various aspects, embodiments, and/or configurations, includes components, methods, processes, systems and/or apparatus substantially as depicted and described herein, including various aspects, embodiments, configurations embodiments, subcombinations, and/or subsets thereof.
- the present disclosure in various aspects, embodiments, and/or configurations, includes providing devices and processes in the absence of items not depicted and/or described herein or in various aspects, embodiments, and/or configurations hereof, including in the absence of such items as may have been used in previous devices or processes, e.g., for improving performance, achieving ease and ⁇ or reducing cost of implementation.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Security & Cryptography (AREA)
- Computer Hardware Design (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Virology (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
- This application is a continuation of, and claims a benefit of priority under 35 U.S.C. § 120 from U.S. patent application Ser. No. 17/221,028, filed Apr. 2, 2021, entitled “PROACTIVE BROWSER CONTENT ANALYSIS,” which is a continuation of, and claims a benefit of priority under 35 U.S.C. § 120 from U.S. patent application Ser. No. 16/036,022, filed Jul. 16, 2018, entitled “PROACTIVE BROWSER CONTENT ANALYSIS,” issued as U.S. Pat. No. 11,281,777, which is a continuation of, and claims a benefit of priority under 35 U.S.C. § 120 from U.S. patent application Ser. No. 13/633,956, filed Oct. 3, 2012, entitled “PROACTIVE BROWSER CONTENT ANALYSIS,” issued as U.S. Pat. No. 10,025,928, which claims the benefit of, and priority under 35 U.S.C. § 119(e) to U.S. Provisional Application No. 61/542,693, filed Oct. 3, 2011, entitled “PROACTIVE BROWSER CONTENT ANALYSIS,” the disclosures of which are hereby incorporated by reference herein in their entireties.
- An exemplary aspect of the present invention generally relates to computer system management. In particular, but not by way of limitation, an exemplary aspect relates to systems and methods for controlling pestware or malware or other undesirable or unwanted applications and/or instructions.
- Personal computers and business computers are continually attacked by viruses, trojans, spyware, adware, etc., collectively referred to as “malware” or “pestware.” These types of programs generally act to gather information about a person or organization--often without the person or organization's knowledge. Some pestware is highly malicious. Other pestware is non-malicious but may cause issues with privacy or system performance. And yet other pestware is actually beneficial or wanted by the user. Wanted pestware is sometimes not characterized as “pestware” or “spyware.” But, unless specified otherwise, “pestware” or “malware” as used herein refers to any program that is malicious in some way and/or collects and/or reports information about a person or an organization and any “watcher processes” related to the pestware or malware.
- In accordance with an exemplary aspect, a protection module operates to analyze threats, at the protocol level (e.g., at the HTML level), by intercepting all requests that a browser engine resident in a computing device sends and receives, and the protection agent completes the requests without the help of the browser engine.
- And then the protection module analyzes and/or modifies the completed data before the browser engine has access to it, to, for example, display it. After performing all of its processing, removing, and/or adding any code as needed, the protection module provides the HTML content to the browser engine, and the browser engine receives responses from the protection agent as if it was speaking to an actual web server, when in fact, browser engine is speaking to an analysis engine of the protection module.
- This allows the protection module to have control over what a browser engine “sees,” providing means to remove any exploits, malware, and other threats dynamically. This also enables the protection module to add content into the browser stream at the HTML level, before receipt by the browser.
- In some exemplary implementations, search engine results (e.g., results provided by Google®,
- Yahoo®, and Bing®) are annotated/updated/amended by the protection module—within the HTML code—to denote if a particular website is legitimate or malicious. For example, a legitimate link in the search results may be depicted in connection with a green check mark and a suspect link may be depicted with a red cross. (Of course other indicators could also be used that identify to a user whether or not a link is “good,” “bad,” or “unknown.) In addition to search result annotation, the protocol-level analysis approach may also be used in connection with anti-phishing and URL analysis among other types of analysis.
- The differences between the disclosed protocol-level analysis approach compared to other prior anti-malware approaches are significant. In the context of search result annotation for example, the data (e.g., a web page of search results) is first analyzed and modified by an analysis engine of the protection module, which has control over every element of a web page before the web page is operated on by the browser engine. This is in contrast to prior approaches that just make high-level modifications to the content after the content has been rendered and displayed through a Browser Helper Object. With an exemplary aspect of the present protocol-level approach, there is virtually no performance overhead, and in many cases, there is actually a performance improvement when performing the browser content analysis.
- When the protection module receives content from a web server, the protection module then, if necessary, decrypts and decompresses the web content and then assembles the requested web page (e.g., in a decrypted and decompressed HTML format that the web page existed in at the remote server). The protection module then analyzes the web page to determine whether the web page includes links that may lead to sites hosting malware or whether the web page itself includes malware. The analysis of the assembled web page may include communicating with a remote security center so that a malware management analysis may be performed to analyze one or more portions of the content of the assembled web page and/or the protection module itself may perform analysis of content of the assembled webpage. The analyzed webpage can then be forwarded to the web browser for display to a user.
- The preceding is a simplified summary of the disclosure to provide an understanding of some aspects of the disclosure. This summary is neither an extensive nor exhaustive overview of the disclosure and its various aspects, embodiments, and/or configurations. It is intended neither to identify key or critical elements of the disclosure nor to delineate the scope of the disclosure but to present selected concepts of the disclosure in a simplified form as an introduction to the more detailed description presented below. As will be appreciated, other aspects, embodiments, and/or configurations of the disclosure are possible utilizing, alone or in combination, one or more of the features set forth above or described in detail below.
-
FIG. 1 illustrates an exemplary embodiment of a computing environment according to an exemplary embodiment. -
FIG. 2 illustrates an exemplary embodiment of systems and operations at the remote computer. -
FIG. 3 is a flowchart illustrating exemplary process flow at the remote computer. -
FIG. 4 illustrates an exemplary embodiment of one of the computers inFIG. 1 . - Referring now to the drawings, where like or similar elements are designated with identical reference numerals throughout the several views, and referring in particular to
FIG. 1 , it is a block diagram depicting an environment in which several embodiments of the invention may be implemented. As shown, a security center 102 (also referred to herein as a “central” or “base” computer),remote user 104 operating aremote computer 105, amalware source 106, and aweb server 108 are all communicatively coupled through one or more networks (e.g., the Internet and/or local or wide area networks) 110 andlinks 5. Although only oneremote user 104,remote computer 105,malware source 106,web server 108, andsecurity center 102 are depicted, each of these logically represents a potentially unlimited number of persons, entities and/or computers or computing resources. - The
remote user 104 may be an individual or a business enterprise that operates theremote computer 105, which may each be a personal computer, a server of any type, a PDA, mobile phone, tablet, netbook, an interactive television, or any other device capable of loading and operating computer objects. - In the depicted environment, the
malware source 106 generally represents a source of malware that ends up or is strategically placed at theweb server 108, which may or may not be suspected of hosting malware. For example, themalware source 106 may generate a malware object in a variety of forms including in a scripting language such as ECMAscript-based scripting languages (e.g., JavaScript or Adobe Flash), but the malware source may generate other types of objects such as computer files, part of a file or a sub-program, an instruction(s), macro, web page or any other piece of code to be operated by or on the computer, or any other event whether executed, emulated, simulated or interpreted. - As depicted, the
security center 102 is disposed and configured to be accessible to theuser 104 so that, as discussed further herein, thesecurity center 102 may facilitate the management of malware on theremote computer 104. In many implementations, thesecurity center 102 operates according to a Software as a Service (SaaS) business model to generally provide Web security services “in the cloud.” - As depicted in
FIG. 1 , theexemplary security center 102 includes amalware management portion 112 that is coupled to adata store 114. Although not depicted, thesecurity center 102 may also include components that provide other services (e.g., internet policy enforcement, in/outbound content control, application control, compliance-related services, etc.). - The
security center 102 is generally configured to obtain information about malware threats and to be a resource for theremote computer 105 to enable the remote computer to manage malware threats more effectively and efficiently. It should be noted that that themalware management component 112 anddata store 114 are presented for convenience as single entities, but thesecurity center 102 can be scaled and comprised of multiple geographically distributed computers and servers, etc., and the data store can be made up multiple databases and storage distributed around this central system and/or be located in a cloud-type environment. - Although not required, the
malware management 112 component of thesecurity center 102 may maintain thedata store 114 as a community database that is populated, over time, with information relating to each object run on all of the connected remote computers as disclosed in US A 2007/0016953, published 18 Jan. 2007, entitled “METHODS AND APPARATUS FOR DEALING WITH MALWARE,” the entire contents of which are hereby incorporated herein by reference. As discussed in the above-identified application, data representative of each malware object may take the form of a so-called signature or key relating to and/or identifying the object, its attributes and/or behavior(s). - In operation, the
protection agent 116 in this embodiment operates to analyze threats, at the protocol level (e.g., at the HTML level), by intercepting all requests that thebrowser engine 118 sends and receives, and theprotection agent 116 completes the requests without the help of thebrowser engine 118. And then theprotection agent 116 analyzes and/or modifies the completed data before thebrowser engine 118 has access to it. After performing all of its processing, removing, and/or adding any code as needed, the protection agent feeds the HTML content back to thebrowser engine 118, and thebrowser engine 118 receives responses from theprotection agent 116 as if it was “speaking” to an actual web server (e.g., web server 108) when in fact, it is speaking to an analysis engine of theprotection agent 116. This allows theprotection agent 116 to have full control over what thebrowser engine 118 “sees,” providing means to remove any exploits, malware, and other threats dynamically. This also enables theprotection agent 116 to add content into the browser stream at the HTML level. Stated another way, theprotection agent 116 caches web content requested by a browser, analyzes and/or modifies the retrieved web content, and provides a clean or sanitized version of the web content, free of malware, to the browser. - In some optional implementations for example, search engine results (e.g., results provided by Google, Yahoo, and Bing) are annotated by the
protection agent 116—within the HTML code—to denote if a particular website is legitimate or malicious. For example, a legitimate link in the search results may be depicted in connection with a green check mark and a suspect link may be depicted with a red cross. In addition to search result annotation, the protocol-level analysis approach may also be used in connection with anti-phishing and URL analysis among other types of analysis. - The differences between the protocol-level analysis approach disclosed herein as compared to other prior anti-malware approaches are significant. In the context of search result annotation for example, the data (e.g., a web page of search results) is first analyzed and modified by an analysis engine of the
protection agent 116, which has full control over every element of the web page before the page is operated on by thebrowser engine 118. This is in contrast to prior approaches that just make high-level modifications to the content after it has been displayed through a Browser Helper Object. With the present protocol-level approach, there is virtually no performance overhead, and in many cases, there is actually a performance improvement when performing the browser content analysis. - When the
protection agent 116 receives content from theweb server 108, theprotection agent 116 then, if necessary, decrypts and decompresses the web content and then assembles the requested web page (e.g., in a decrypted and decompressed HTML format that the web page existed in at the remote server 108). Theprotection agent 116 then analyzes the web page to determine whether the web page includes links that may lead to sites hosting malware or whether the web page itself includes malware. The analysis of the assembled web page may include communicating with thesecurity center 102 so that themalware management component 112 may analyze one or more portions of the content of the assembled web page and/or theprotection agent 116 itself may perform analysis of content of the assembled webpage. - In many embodiments, the cleaning process that the
protection agent 116 carries out takes place through a highly optimized routine written in, for example, raw C with an inline assembler reducing processing effort within thebrowser engine 118 itself by decrypting/decompressing/de-encoding any of the content outside of thebrowser engine 118. Having full control at this level also means that complex inferential algorithms can be applied to the browser content as a whole, taking into account any external script/image links, to build an in-memory picture of the final content before it is rendered to the user by the browser on a display (not shown). This general operation can be extended to remove/modify any form of content, whether illicit images, ads, fake password request forms, malicious exploits, cross site scripting attacks (XSS), etc. - Referring to
FIG. 2 , shown is a block diagram depicting exemplary functional components and operations that reside and take place on theremote computer 105 at the location of theremote user 104. In the depiction ofFIG. 2 , the browser-related processes and protection-agent-related processes are separated so that the interaction between the two types of processes may be more clearly understood. While referring toFIG. 2 , simultaneous reference is also made toFIG. 3 , which depicts exemplary aspects of the process flow that occurs in connection with the components depicted inFIG. 2 . Depicted next to each of the blocks inFIG. 3 is either a “K” or a “U,” which indicate that the operation is implemented at the kernel or user-level, respectively. It should be recognized, however, that the operations described with reference toFIGS. 2 and 3 may be implemented with some kernel-mode operations being implemented at the user-level and vice versa. Moreover, the illustrated arrangement of components inFIGS. 2 and 3 is logical, and is not meant to be an actual hardware diagram. Thus, many of the components can be combined and/or further separated in an actual implementation. - As shown, the browser processes in this embodiment include agent processes 220 that are installed on the
remote computer 105 in connection with theprotection agent 116 so that functions of theprotection agent 116 are integrated with atypical browser engine 218. In other words, the agent processes 220 of the browser processes are implemented by additional code that is wrapped around atypical browser engine 218 to intercept what is requested and received. As discussed further herein, these agent processes 220 enable all content that is requested by an application 222 (e.g., a browser) and received by thebrowser engine 218 to be intercepted. - More specifically as shown, when a user initiates a request via the application 222 (e.g., web browser or other application that requests web content), a
connection request 360 is initiated as a POST/GET request 362 to a website (e.g., hosted by the webserver 108) and theanalysis engine 224 looks at the context of the request to assess whether the request is the first request in a session (Block 364), and if the request is the first request, a determination is made as to whether the request is associated with known, malicious content (Block 366), and if so, the request is blocked (Blocks 368, 370). - In some embodiments, the
analysis engine 224 accesses thesecurity center 102 via the Internet and thesecurity center 102 is utilized to facilitate whether the request is a request for known malicious content (e.g., the URL of the request may be compared to a black list of URLs). But theanalysis engine 224 may also include some malware checking functionality locally. As shown, if the request is not blocked, the request may be pre-processed by the content acquisition component 226 (Block 372) (e.g., to set aside sufficient memory in RAM in anticipation of the content from the website being received) before the request is sent to thedestination website 108. - And as shown, when the first response is received by the content acquisition component 226 (Block 374), if the response is not complete, the response is stored in memory (Block 376), and the next request is sent to the destination website 108 (Block 378). In this way, the
content acquisition component 226 continues to obtain content from the website 108 (Blocks content acquisition component 226. - Thus, in short, the
initial request 360 by the application (e.g., browser) is intercepted and if the request does not appear to be a request for malicious content, thecontent acquisition component 226 iteratively sends requests and receives content (Blocks website 108 until the requested content has been completely received. This is very different from the ordinary operation of thebrowser engine 218, which would (if unaltered by the protection agent 116) obtain the content from thewebserver 108 itself by way of a series of GET requests. - As shown in
FIG. 2 , once the web content is completely gathered by thecontent acquisition component 226, the content is passed along to theservice process component 228 via theuser process component 230, and the data is then preprocessed (Block 380), if necessary, to decrypt, remove chunks, and decompress the gathered content. When a web page is received from thewebserver 108, it may be compressed, chunked encoded, and encrypted, and decompressing, decoding and unencrypting the content enables a complete picture of the webpage looked like before if any of the these forms of obfuscation were applied to the content by thewebserver 108. One of ordinary skill in the art will appreciate that various layers of encoding and compression may be applied to content, but for simplicity, these known details are not included herein for clarity. - And after the data is decrypted and decompressed, the data is in an HTML format, so at this point, the protection has the content in the same HTML format that the
website 108 had the content in. And at this point thebrowser engine 218 is unaware that the requested content (e.g., an entire webpage) has been received. As a consequence, the browser page may be analyzed by the protocol-level analysis/modification component 382 depicted inFIG. 3 outside of the context of thebrowser engine 218 before thebrowser engine 218 has operated on the webpage. For example, all the links in the webpage, all the pictures in the webpage, and any scripts in the webpage may be analyzed at a low level of granularity. And code may be changed, enhanced, and removed according to default modes and/or user-configurable mode of operation. As one particular example, if a malicious script is found, it may be commented out before the content is handed to the browser engine. - In addition, in many modes of operation the protocol-level analysis/
modification component 382 modifies and/or annotates the content—as HTML within the content—to provide the user with textual, audible and/or graphical indicators of risk associated with the content. In the context of a webpage that include search results from a user's search query, for example, a green check may be added within the HTML code next to a result that is a low risk link, and a red X may be added within the HTML code next to a result the is a high risk link. For example, an image tag and/or text may be added within the HTML to indicate a risk of being exposed to malware. Beneficially, making modifications to the content (e.g., to include risk indicators) at the HTML level avoids having to address the differences that different browsers introduce into the presentation of rendered content. - This approach of modifying content at the HTML level (e.g., to add annotations) is very different than the prior approaches of rendering (e.g., using a Browser Helper Object) annotations on top of a page that has already been parsed and rendered by a browser engine. This prior approach is problematic because it allows the browser engine to potentially execute malicious scripts or perform malicious actions while it is parsing and rendering the code. And in addition, because the annotations are added after rending, the annotation process must account for the rendering differences (e.g., differences in how and where content is displayed) that different browsers (e.g., Firefox, Safari, Chrome, Internet Explorer, etc.) exhibit.
- It should be recognized that although search results annotation is one application for the protocol-level (e.g., HTML level) handling of content, it is certainly contemplated that there are other anti-malware-related applications of the protocol-level handling of content. One application for example, is an anti-phishing application, which may automatically modify HTML data so that user need not type in a password and expose the password to a keylogger object.
- As shown in
FIG. 2 , more details of the protocol-level analysis/modification component 382 are depicted. As shown, once a complete collection of content is received, theservice process 228 is utilized to initiate parsing of the HTML content and extraction of anyscripts 230 in the content before building a tree representation of thecontent 234, which is then asynchronously analyzed 236. As shown inFIG. 2 for example, the analysis may includeURL analysis 238,IP analysis 240, image analysis (e.g., to analyze phishing threat) 242, and script/HTML analysis 244. The results of the analysis are then aggregated by theresult aggregator 246 so thatcloud verification 248 may be performed on the single aggregated collection of data by thebase computer 112 of thesecurity center 102 to determine malware vulnerabilities. - But in optional embodiments, the verification may be effectuated locally or in combination with the
security center 102. As shown, the results of the cloud verification are provided to thefinal content packager 250 of theservice process 228, and then in some modes of operation, annotations are performed 252 on the packaged HTML code before being distributed 254 back to thedata wrapper component 256, which sits on top of the response receive component of thebrowser engine 218 and performsfinal context modifications 258 to ensure content is properly displayed before being passed to thebrowser engine 218. As shown, final content modifications are performed at 258 before thefinal content 260 is provided to thebrowser engine 218 for parsing and rendering of the HTML content before being displayed by theapplication 222. - Referring again to
FIG. 3 , shown is a loop (Blocks browser engine 218. As shown, as the browser engine requests more content, the pointer is set to the next byte to be read (Block 384), and once all the content is on hand, the browser engine requests content (Block 386), and the cleaned and/or annotated content is fed to thebrowser engine 218 in the manner thebrowser engine 218 would have requested and received the content. - In other words, from the browser engine's 218 perspective, the
browser engine 218 is obtaining the content from thewebserver 108 directly. In other words, theprotection agent 116 operates as an emulated server in memory to provide the clean content in the way theweb server 108 would have provided the content to thebrowser engine web server 108 directly. - In the exemplary embodiment however, for operational speed, the clean/modified content is not encoded or encrypted once it is cleaned. As a consequence, the packet headers (e.g., length headers) are modified to reflect that the content being provided to the
browser engine - Referring next to
FIG. 4 , shown is a block diagram depicting hardware components in an exemplary embodiment of the protected computer described with reference toFIG. 1 . As shown, the N processing components 140 described with reference toFIG. 1 are depicted asN processors 440 that are coupled to abus 460, and also coupled to thebus 460 are a memory 438 (corresponding to memory 138), storage medium 412 (corresponding to the storage medium 112), a keyboard/pointing device 462, a display/graphics adapter 464, and anetwork interface 466. In addition, adisplay 468 is coupled to the display/graphics adapter 464. - The
storage medium 412 may be any device capable of holding substantial amounts of data, such as a hard drive, flash memory, or some other form of fixed or removable storage device. And thestorage medium 412 in this embodiment stores processor-readable code with instructions to effectuate the functions described herein (e.g., the functions of the components inFIG. 1 depicted in theuser 102 andkernel 104 environments). Theprocessors 440 generally function to execute code and process other information that resides in memory and may be any specific or general-purpose processor such as an INTEL x86 or POWERPC-compatible central processing unit (CPU), and each may include one or multiple (e.g., four) cores. The memory 438 may include several gigabytes of random access memory, but this is merely exemplary and other memory types and sizes may be utilized. As one of ordinarily skill will appreciate, an operating system (e.g., LINUX® or WINDOWS®) may also reside in thestorage medium 412 and memory 438 and function (e.g., when executed by one or more of the processors 440) to enable the components to operate as described with reference toFIG. 1 . - As one of ordinary skill in the art in light of this disclosure will appreciate,
FIG. 4 depicts only an exemplary embodiment, and the processes presented herein are not inherently related to any particular computing device or other apparatus. Various general purpose systems may be used with programs in accordance with the teachings herein, or it may prove convenient to construct a more specialized apparatus to perform the desired method. In addition, embodiments of the present invention are not described with reference to any particular programming language. It will be appreciated that a variety of programming languages may be used to implement the teachings of the invention as described herein. In addition, it should be understood that operations, capabilities, and features described herein may be implemented with any combination embodied in firmware, software, application-specific integrated circuits (ASICs), and/or programmable logic devices. - It is to be appreciated that a lesser or more equipped computer system than the example described above may be desirable for certain implementations. Therefore, the configuration of the system illustrated in the figure can vary from implementation to implementation depending upon numerous factors, such as its intended use, price constraints, performance requirements, storage requirements, technological improvements, and/or other circumstances, or the like.
- It should also be noted that while the embodiments and methods described herein may be performed and used with a computer similar to the one described herein, other embodiments and variations can be used with computer that vary from the described example. Therefore, nothing disclosed herein concerning the configuration of the illustrated computer should be construed as limiting the disclosure to a particular embodiment wherein the recited operations are performed by a specific combination of hardware components.
- The various embodiments and variations thereof illustrated in the accompanying Figures and/or in the totality of this document are merely exemplary and are not meant to limit the scope of the invention. It is to be appreciated that numerous variations of the invention have been contemplated as would be obvious to one of ordinary skill in the art with the benefit of this disclosure. Additionally, while certain features may be categorized under one or more headings to assist with readability, it is to be appreciated that the feature(s) described under a particular heading may be used in associating with other portions of the specification and/or feature(s) described herein. Similarly, while certain embodiments are discussed in relation to specific languages, it is to be appreciated that the techniques disclosed herein can be used with any software language(s).
- While the above described methodology has been discussed in relation to a particular sequence of events, it should be appreciated that minor changes to this sequence can occur without materially effecting the operation of the invention.
- The above-described system and methodology, as has been indicated herein, can be implemented on a computing device, such as a personal computer, server, dedicated computing device, distributed processing system, in a cloud, or the like, or a separately programmed general purpose computer. Additionally, the systems and methods of this invention can be implemented on a special purpose computer, a programmed microprocessor or microcontroller and peripheral integrated circuit element(s), an ASIC or other integrated circuit, a digital signal processor, a hard-wired electronic or logic circuit such as a discrete element circuit, a programmable logic device such as a PLD, PLA, FPGA, PAL, or the like, in fuzzy logic, artificial intelligence and/or neural networks. In general, any device(s) or module, which can be any combination of hardware and/or software, capable of implementing a state machine that is in turn capable of implementing the processes described herein can be used to implement this invention.
- Furthermore, the disclosed methods may readily implemented in software using, for example, object or object-oriented software development environments that provide portable source code that can be used on a variety of computer or workstation and/or server platforms. The software can be stored on a non-transitory computer-readable medium, with the software including one or more processor executable instructions. The disclosed system and methodology may also be implemented partially or fully in hardware using standard logic circuits or, for example, a VLSI design. Whether software or hardware is used to implement the systems in accordance with this invention is dependent on the speed and/or efficiency requirements of the system, the particular function, and the particular software or hardware systems or microprocessor or microcomputer systems being utilized. The systems and methods illustrated herein can be readily implemented in hardware and/or software using any suitable systems, means, structures, devices and/or the functionality stored on an appropriate information storage medium, by those of ordinary skill in the applicable art from the functional description provided herein and with a basic general knowledge of the computer and software arts.
- While the embodiments illustrated herein may show some of the various components collocated, it is to be appreciated that the various components of the system can be located at distant portions of a distributed network, such as a communications network and/or the Internet and/or within a dedicated network. Thus, it should be appreciated that the various components can be combined into one or more devices or collocated on a particular node of a distributed network and/or in a cloud. As will be appreciated from the description, and for reasons of computational efficiency, the components can be arranged at any location within a distributed network without affecting the operation of the system.
- Furthermore, it should be appreciated that various links connecting elements can be wired or wireless links, or a combination thereof, or any known or later developed element(s) that is capable of supplying and/or communicating data to and from the elements.
- The present disclosure, in various aspects, embodiments, and/or configurations, includes components, methods, processes, systems and/or apparatus substantially as depicted and described herein, including various aspects, embodiments, configurations embodiments, subcombinations, and/or subsets thereof. Those of skill in the art will understand how to make and use the disclosed aspects, embodiments, and/or configurations after understanding the present disclosure. The present disclosure, in various aspects, embodiments, and/or configurations, includes providing devices and processes in the absence of items not depicted and/or described herein or in various aspects, embodiments, and/or configurations hereof, including in the absence of such items as may have been used in previous devices or processes, e.g., for improving performance, achieving ease and\or reducing cost of implementation.
- The foregoing discussion has been presented for purposes of illustration and description. The foregoing is not intended to limit the disclosure to the form or forms disclosed herein. In the foregoing Detailed Description for example, various features of the disclosure are grouped together in one or more aspects, embodiments, and/or configurations for the purpose of streamlining the disclosure. The features of the aspects, embodiments, and/or configurations of the disclosure may be combined in alternate aspects, embodiments, and/or configurations other than those discussed above. This method of disclosure is not to be interpreted as reflecting an intention that the claims require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed aspect, embodiment, and/or configuration. Thus, the following claims are hereby incorporated into this Detailed Description, with each claim standing on its own as a separate exemplary, and separately claimable, embodiment of the disclosure.
- While exemplary aspects have been described in conjunction with a number of embodiments, it is evident that many alternatives, modifications and variations would be or are apparent to those of ordinary skill in the applicable arts. Accordingly, this disclosure is intended to embrace all such alternatives, modifications, equivalents and variations that are within the spirit and scope of this disclosure.
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18/158,218 US20230153437A1 (en) | 2011-10-03 | 2023-01-23 | Proactive browser content analysis |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161542693P | 2011-10-03 | 2011-10-03 | |
US13/633,956 US10025928B2 (en) | 2011-10-03 | 2012-10-03 | Proactive browser content analysis |
US16/036,022 US11281777B2 (en) | 2011-10-03 | 2018-07-16 | Proactive browser content analysis |
US17/221,028 US11593484B2 (en) | 2011-10-03 | 2021-04-02 | Proactive browser content analysis |
US18/158,218 US20230153437A1 (en) | 2011-10-03 | 2023-01-23 | Proactive browser content analysis |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/221,028 Continuation US11593484B2 (en) | 2011-10-03 | 2021-04-02 | Proactive browser content analysis |
Publications (1)
Publication Number | Publication Date |
---|---|
US20230153437A1 true US20230153437A1 (en) | 2023-05-18 |
Family
ID=47993971
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/633,956 Active US10025928B2 (en) | 2011-10-03 | 2012-10-03 | Proactive browser content analysis |
US16/036,022 Active US11281777B2 (en) | 2011-10-03 | 2018-07-16 | Proactive browser content analysis |
US17/221,028 Active 2032-10-15 US11593484B2 (en) | 2011-10-03 | 2021-04-02 | Proactive browser content analysis |
US18/158,218 Pending US20230153437A1 (en) | 2011-10-03 | 2023-01-23 | Proactive browser content analysis |
Family Applications Before (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/633,956 Active US10025928B2 (en) | 2011-10-03 | 2012-10-03 | Proactive browser content analysis |
US16/036,022 Active US11281777B2 (en) | 2011-10-03 | 2018-07-16 | Proactive browser content analysis |
US17/221,028 Active 2032-10-15 US11593484B2 (en) | 2011-10-03 | 2021-04-02 | Proactive browser content analysis |
Country Status (1)
Country | Link |
---|---|
US (4) | US10025928B2 (en) |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10025928B2 (en) | 2011-10-03 | 2018-07-17 | Webroot Inc. | Proactive browser content analysis |
US9734332B2 (en) | 2014-03-17 | 2017-08-15 | Proofpoint, Inc. | Behavior profiling for malware detection |
US9710648B2 (en) | 2014-08-11 | 2017-07-18 | Sentinel Labs Israel Ltd. | Method of malware detection and system thereof |
US11507663B2 (en) | 2014-08-11 | 2022-11-22 | Sentinel Labs Israel Ltd. | Method of remediating operations performed by a program and system thereof |
WO2016127233A1 (en) * | 2015-02-10 | 2016-08-18 | Gas Informatica Ltda | Assistive technology for anti-malware software |
RU2622626C2 (en) * | 2015-09-30 | 2017-06-16 | Акционерное общество "Лаборатория Касперского" | System and method for detecting phishing scripts |
US11695800B2 (en) | 2016-12-19 | 2023-07-04 | SentinelOne, Inc. | Deceiving attackers accessing network data |
US11616812B2 (en) | 2016-12-19 | 2023-03-28 | Attivo Networks Inc. | Deceiving attackers accessing active directory data |
US10462171B2 (en) | 2017-08-08 | 2019-10-29 | Sentinel Labs Israel Ltd. | Methods, systems, and devices for dynamically modeling and grouping endpoints for edge networking |
US11470115B2 (en) | 2018-02-09 | 2022-10-11 | Attivo Networks, Inc. | Implementing decoys in a network environment |
US11470113B1 (en) * | 2018-02-15 | 2022-10-11 | Comodo Security Solutions, Inc. | Method to eliminate data theft through a phishing website |
US10803188B1 (en) * | 2018-06-25 | 2020-10-13 | NortonLifeLock, Inc. | Systems and methods for preventing sensitive data sharing |
US10521583B1 (en) * | 2018-10-25 | 2019-12-31 | BitSight Technologies, Inc. | Systems and methods for remote detection of software through browser webinjects |
US12074887B1 (en) * | 2018-12-21 | 2024-08-27 | Musarubra Us Llc | System and method for selectively processing content after identification and removal of malicious content |
WO2020236981A1 (en) | 2019-05-20 | 2020-11-26 | Sentinel Labs Israel Ltd. | Systems and methods for executable code detection, automatic feature extraction and position independent code detection |
US11082437B2 (en) * | 2019-12-17 | 2021-08-03 | Paypal, Inc. | Network resources attack detection |
US11579857B2 (en) | 2020-12-16 | 2023-02-14 | Sentinel Labs Israel Ltd. | Systems, methods and devices for device fingerprinting and automatic deployment of software in a computing network using a peer-to-peer approach |
US11716310B2 (en) * | 2020-12-31 | 2023-08-01 | Proofpoint, Inc. | Systems and methods for in-process URL condemnation |
US11899782B1 (en) | 2021-07-13 | 2024-02-13 | SentinelOne, Inc. | Preserving DLL hooks |
Citations (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060259544A1 (en) * | 2005-04-28 | 2006-11-16 | Zubenko Igor V | Client-side Java content transformation |
US20070016949A1 (en) * | 2005-07-15 | 2007-01-18 | Microsoft Corporation | Browser Protection Module |
US20080083012A1 (en) * | 2006-06-26 | 2008-04-03 | Dachuan Yu | Program instrumentation method and apparatus for constraining the behavior of embedded script in documents |
US20080104661A1 (en) * | 2006-10-27 | 2008-05-01 | Joseph Levin | Managing Policy Settings for Remote Clients |
US20090077670A1 (en) * | 2002-02-05 | 2009-03-19 | Max Schireson | E-commerce store management user interface for performing Web site updates |
US20090193497A1 (en) * | 2008-01-25 | 2009-07-30 | Haruka Kikuchi | Method and apparatus for constructing security policies for web content instrumentation against browser-based attacks |
US20090300111A1 (en) * | 2001-04-09 | 2009-12-03 | Aol Llc, A Delaware Limited Liability Company | Server-based browser system |
US7647417B1 (en) * | 2006-03-15 | 2010-01-12 | Netapp, Inc. | Object cacheability with ICAP |
US7861120B2 (en) * | 2007-12-14 | 2010-12-28 | Sap Ag | Method and apparatus for runtime error handling |
US20110197272A1 (en) * | 2010-02-09 | 2011-08-11 | Webroot Software, Inc. | Low-Latency Detection of Scripting-Language-Based Exploits |
US20120240183A1 (en) * | 2011-03-18 | 2012-09-20 | Amit Sinha | Cloud based mobile device security and policy enforcement |
US20130080611A1 (en) * | 2011-09-22 | 2013-03-28 | Blue Coat Systems Inc. | Managing Network Content |
US8839404B2 (en) * | 2011-05-26 | 2014-09-16 | Blue Coat Systems, Inc. | System and method for building intelligent and distributed L2-L7 unified threat management infrastructure for IPv4 and IPv6 environments |
US8850584B2 (en) * | 2010-02-08 | 2014-09-30 | Mcafee, Inc. | Systems and methods for malware detection |
US8898776B2 (en) * | 2010-12-28 | 2014-11-25 | Microsoft Corporation | Automatic context-sensitive sanitization |
US9098719B2 (en) * | 2011-02-03 | 2015-08-04 | Apple Inc. | Securing unrusted content for collaborative documents |
US20150319136A1 (en) * | 2011-05-24 | 2015-11-05 | Palo Alto Networks, Inc. | Malware analysis system |
US9392002B2 (en) * | 2002-01-31 | 2016-07-12 | Nokia Technologies Oy | System and method of providing virus protection at a gateway |
US9906549B2 (en) * | 2007-09-06 | 2018-02-27 | Microsoft Technology Licensing, Llc | Proxy engine for custom handling of web content |
US10019570B2 (en) * | 2007-06-14 | 2018-07-10 | Microsoft Technology Licensing, Llc | Protection and communication abstractions for web browsers |
US10178115B2 (en) * | 2004-06-18 | 2019-01-08 | Fortinet, Inc. | Systems and methods for categorizing network traffic content |
US10320809B1 (en) * | 2015-10-30 | 2019-06-11 | Cyberinc Corporation | Decoupling rendering engine from web browser for security |
US10601775B1 (en) * | 2011-02-01 | 2020-03-24 | Palo Alto Networks, Inc. | Blocking download of content |
US11537472B1 (en) * | 2021-10-14 | 2022-12-27 | Vast Data Ltd. | Striping based on failure domains rules |
Family Cites Families (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6029245A (en) * | 1997-03-25 | 2000-02-22 | International Business Machines Corporation | Dynamic assignment of security parameters to web pages |
US6594697B1 (en) * | 1999-05-20 | 2003-07-15 | Microsoft Corporation | Client system having error page analysis and replacement capabilities |
US20040015725A1 (en) * | 2000-08-07 | 2004-01-22 | Dan Boneh | Client-side inspection and processing of secure content |
US6898619B1 (en) * | 2000-12-08 | 2005-05-24 | Sun Microsystmes, Inc. | System and method for dynamically disabling resubmission of HTTP requests |
US20030051142A1 (en) * | 2001-05-16 | 2003-03-13 | Hidalgo Lluis Mora | Firewalls for providing security in HTTP networks and applications |
US7200599B2 (en) * | 2001-06-21 | 2007-04-03 | Microsoft Corporation | Automated generator of input-validation filters |
US20040073811A1 (en) * | 2002-10-15 | 2004-04-15 | Aleksey Sanin | Web service security filter |
US20040260754A1 (en) * | 2003-06-20 | 2004-12-23 | Erik Olson | Systems and methods for mitigating cross-site scripting |
US8244910B2 (en) * | 2004-07-14 | 2012-08-14 | Ebay Inc. | Method and system to modify function calls from within content published by a trusted web site |
US7621613B2 (en) | 2005-11-17 | 2009-11-24 | Brother Kogyo Kabushiki Kaisha | Ink-jet recording apparatus and recording method for realizing satisfactory recording even when ink temperature is suddenly changed |
US7849507B1 (en) * | 2006-04-29 | 2010-12-07 | Ironport Systems, Inc. | Apparatus for filtering server responses |
US7934253B2 (en) * | 2006-07-20 | 2011-04-26 | Trustwave Holdings, Inc. | System and method of securing web applications across an enterprise |
US20080034424A1 (en) * | 2006-07-20 | 2008-02-07 | Kevin Overcash | System and method of preventing web applications threats |
US20090019525A1 (en) * | 2007-07-13 | 2009-01-15 | Dachuan Yu | Domain-specific language abstractions for secure server-side scripting |
US20090070873A1 (en) | 2007-09-11 | 2009-03-12 | Yahoo! Inc. | Safe web based interactions |
US8302080B2 (en) * | 2007-11-08 | 2012-10-30 | Ntt Docomo, Inc. | Automated test input generation for web applications |
US8464318B1 (en) * | 2008-11-24 | 2013-06-11 | Renen Hallak | System and method for protecting web clients and web-based applications |
US8935773B2 (en) * | 2009-04-09 | 2015-01-13 | George Mason Research Foundation, Inc. | Malware detector |
US9154364B1 (en) | 2009-04-25 | 2015-10-06 | Dasient, Inc. | Monitoring for problems and detecting malware |
US8307436B2 (en) * | 2009-06-15 | 2012-11-06 | The United States Of America As Represented By The Secretary Of The Air Force | Transformative rendering of internet resources |
US8856869B1 (en) * | 2009-06-22 | 2014-10-07 | NexWavSec Software Inc. | Enforcement of same origin policy for sensitive data |
US20110219446A1 (en) * | 2010-03-05 | 2011-09-08 | Jeffrey Ichnowski | Input parameter filtering for web application security |
US8875285B2 (en) * | 2010-03-24 | 2014-10-28 | Microsoft Corporation | Executable code validation in a web browser |
US9009330B2 (en) * | 2010-04-01 | 2015-04-14 | Cloudflare, Inc. | Internet-based proxy service to limit internet visitor connection speed |
US20120047581A1 (en) | 2010-08-12 | 2012-02-23 | Anirban Banerjee | Event-driven auto-restoration of websites |
US20130007870A1 (en) | 2011-06-28 | 2013-01-03 | The Go Daddy Group, Inc. | Systems for bi-directional network traffic malware detection and removal |
KR101086451B1 (en) * | 2011-08-30 | 2011-11-25 | 한국전자통신연구원 | Apparatus and method for defending a modulation of the client screen |
US10025928B2 (en) | 2011-10-03 | 2018-07-17 | Webroot Inc. | Proactive browser content analysis |
-
2012
- 2012-10-03 US US13/633,956 patent/US10025928B2/en active Active
-
2018
- 2018-07-16 US US16/036,022 patent/US11281777B2/en active Active
-
2021
- 2021-04-02 US US17/221,028 patent/US11593484B2/en active Active
-
2023
- 2023-01-23 US US18/158,218 patent/US20230153437A1/en active Pending
Patent Citations (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090300111A1 (en) * | 2001-04-09 | 2009-12-03 | Aol Llc, A Delaware Limited Liability Company | Server-based browser system |
US9392002B2 (en) * | 2002-01-31 | 2016-07-12 | Nokia Technologies Oy | System and method of providing virus protection at a gateway |
US20090077670A1 (en) * | 2002-02-05 | 2009-03-19 | Max Schireson | E-commerce store management user interface for performing Web site updates |
US10178115B2 (en) * | 2004-06-18 | 2019-01-08 | Fortinet, Inc. | Systems and methods for categorizing network traffic content |
US20060259544A1 (en) * | 2005-04-28 | 2006-11-16 | Zubenko Igor V | Client-side Java content transformation |
US20070016949A1 (en) * | 2005-07-15 | 2007-01-18 | Microsoft Corporation | Browser Protection Module |
US7647417B1 (en) * | 2006-03-15 | 2010-01-12 | Netapp, Inc. | Object cacheability with ICAP |
US20080083012A1 (en) * | 2006-06-26 | 2008-04-03 | Dachuan Yu | Program instrumentation method and apparatus for constraining the behavior of embedded script in documents |
US20080104661A1 (en) * | 2006-10-27 | 2008-05-01 | Joseph Levin | Managing Policy Settings for Remote Clients |
US10019570B2 (en) * | 2007-06-14 | 2018-07-10 | Microsoft Technology Licensing, Llc | Protection and communication abstractions for web browsers |
US9906549B2 (en) * | 2007-09-06 | 2018-02-27 | Microsoft Technology Licensing, Llc | Proxy engine for custom handling of web content |
US7861120B2 (en) * | 2007-12-14 | 2010-12-28 | Sap Ag | Method and apparatus for runtime error handling |
US20090193497A1 (en) * | 2008-01-25 | 2009-07-30 | Haruka Kikuchi | Method and apparatus for constructing security policies for web content instrumentation against browser-based attacks |
US9686288B2 (en) * | 2008-01-25 | 2017-06-20 | Ntt Docomo, Inc. | Method and apparatus for constructing security policies for web content instrumentation against browser-based attacks |
US8850584B2 (en) * | 2010-02-08 | 2014-09-30 | Mcafee, Inc. | Systems and methods for malware detection |
US20110197272A1 (en) * | 2010-02-09 | 2011-08-11 | Webroot Software, Inc. | Low-Latency Detection of Scripting-Language-Based Exploits |
US8898776B2 (en) * | 2010-12-28 | 2014-11-25 | Microsoft Corporation | Automatic context-sensitive sanitization |
US10601775B1 (en) * | 2011-02-01 | 2020-03-24 | Palo Alto Networks, Inc. | Blocking download of content |
US9098719B2 (en) * | 2011-02-03 | 2015-08-04 | Apple Inc. | Securing unrusted content for collaborative documents |
US20120240183A1 (en) * | 2011-03-18 | 2012-09-20 | Amit Sinha | Cloud based mobile device security and policy enforcement |
US20150319136A1 (en) * | 2011-05-24 | 2015-11-05 | Palo Alto Networks, Inc. | Malware analysis system |
US8839404B2 (en) * | 2011-05-26 | 2014-09-16 | Blue Coat Systems, Inc. | System and method for building intelligent and distributed L2-L7 unified threat management infrastructure for IPv4 and IPv6 environments |
US8843608B2 (en) * | 2011-09-22 | 2014-09-23 | Blue Coat Systems, Inc. | Methods and systems for caching popular network content |
US20130080611A1 (en) * | 2011-09-22 | 2013-03-28 | Blue Coat Systems Inc. | Managing Network Content |
US10320809B1 (en) * | 2015-10-30 | 2019-06-11 | Cyberinc Corporation | Decoupling rendering engine from web browser for security |
US11537472B1 (en) * | 2021-10-14 | 2022-12-27 | Vast Data Ltd. | Striping based on failure domains rules |
Non-Patent Citations (4)
Title |
---|
Charles Reis, John Dunagan, Helen J. Wang, Opher Dubrovsky, and Saher Esmeir. 2007. BrowserShield: Vulnerability-driven filtering of dynamic HTML. ACM Trans. Web 1, 3 (September 2007), 11–es. https://doi.org/10.1145/1281480.1281481 (Year: 2007) * |
Jil Verdol, "Automating Content Security Policy Generation", Thesis, Pennsylvania State University, 8/2011, pg. 1-47. (Year: 2011) * |
Moshchuk, A., Bragin, T., Deville, D., Gribble, S.D., & Levy, H.M. (2007). SpyProxy: Execution-based Detection of Malicious Web Content. USENIX Security Symposium. (Year: 2007) * |
Reis, Charles. 2009. Web browsers as operating systems: Supporting robust and secure web programs. Ph.D. diss., University of Washington, https://www.proquest.com/dissertations-theses/web-browsers-as-operating-systems-supporting/docview/305018008/se-2 (accessed October 20, 2023). (Year: 2009) * |
Also Published As
Publication number | Publication date |
---|---|
US11281777B2 (en) | 2022-03-22 |
US10025928B2 (en) | 2018-07-17 |
US11593484B2 (en) | 2023-02-28 |
US20190171817A1 (en) | 2019-06-06 |
US20210224389A1 (en) | 2021-07-22 |
US20130086681A1 (en) | 2013-04-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11593484B2 (en) | Proactive browser content analysis | |
US10642600B2 (en) | Cloud suffix proxy and a method thereof | |
US10834082B2 (en) | Client/server security by executing instructions and rendering client application instructions | |
US10164993B2 (en) | Distributed split browser content inspection and analysis | |
US9811676B1 (en) | Systems and methods for securely providing information external to documents | |
US8732304B2 (en) | Method and system for ensuring authenticity of IP data served by a service provider | |
US8176556B1 (en) | Methods and systems for tracing web-based attacks | |
US11973780B2 (en) | Deobfuscating and decloaking web-based malware with abstract execution | |
US20140283078A1 (en) | Scanning and filtering of hosted content | |
US11303693B2 (en) | Firewall multi-level security dynamic host-based sandbox generation for embedded URL links | |
CN111163094B (en) | Network attack detection method, network attack detection device, electronic device, and medium | |
EP3069251A1 (en) | A cloud suffix proxy and methods thereof | |
WO2015109912A1 (en) | Buffer overflow attack detection device and method and security protection system | |
JP2023535770A (en) | Systems and methods for enhancing user privacy | |
TWI470468B (en) | System and method for detecting web malicious programs and behaviors | |
US9154520B1 (en) | Systems and methods for notifying users of endpoint devices about blocked downloads | |
Kerschbaumer et al. | Towards precise and efficient information flow control in web browsers | |
US20190334930A1 (en) | Mobile device and method for isolating and protecting a computer, networks, and devices from viruses and cyber attacks | |
Pearce | Development and evaluation of a secure web gateway with messaging functionality: utilizing existing ICAP and open-source tools to notify and protect end users from Internet security threats. | |
Kerschbaumer et al. | Towards Precise and Efficient Information Flow Control in Web Browsers (Short Paper) | |
WO2016186817A1 (en) | Client/server security by an intermediary executing instructions received from a server and rendering client application instructions |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: WEBROOT INC., COLORADO Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:JAROCH, JOE;MCCLOY, HARRY MURPHEY, III;ADAMS, ROBERT EDWARD;SIGNING DATES FROM 20121002 TO 20140926;REEL/FRAME:062679/0272 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
AS | Assignment |
Owner name: WEBROOT LLC, COLORADO Free format text: CERTIFICATE OF CONVERSION;ASSIGNOR:WEBROOT INC.;REEL/FRAME:064176/0622 Effective date: 20220930 Owner name: CARBONITE, LLC, MASSACHUSETTS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:WEBROOT LLC;REEL/FRAME:064167/0129 Effective date: 20221001 |
|
AS | Assignment |
Owner name: OPEN TEXT INC., CALIFORNIA Free format text: ASSIGNMENT AND ASSUMPTION AGREEMENT;ASSIGNOR:CARBONITE, LLC;REEL/FRAME:064351/0178 Effective date: 20221001 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |