WO2007044709A2 - Electronic discovery system and method - Google Patents

Electronic discovery system and method Download PDF

Info

Publication number
WO2007044709A2
WO2007044709A2 PCT/US2006/039527 US2006039527W WO2007044709A2 WO 2007044709 A2 WO2007044709 A2 WO 2007044709A2 US 2006039527 W US2006039527 W US 2006039527W WO 2007044709 A2 WO2007044709 A2 WO 2007044709A2
Authority
WO
WIPO (PCT)
Prior art keywords
investigation
file
files
unique identifier
machine
Prior art date
Application number
PCT/US2006/039527
Other languages
French (fr)
Other versions
WO2007044709A3 (en
Inventor
Shawn Mccreight
Jon Stewart
Brent Botta
Original Assignee
Guidance Software, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guidance Software, Inc. filed Critical Guidance Software, Inc.
Priority to EP06816612A priority Critical patent/EP1934840A4/en
Publication of WO2007044709A2 publication Critical patent/WO2007044709A2/en
Publication of WO2007044709A3 publication Critical patent/WO2007044709A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/50Monitoring users, programs or devices to maintain the integrity of platforms, e.g. of processors, firmware or operating systems
    • G06F21/57Certifying or maintaining trusted computer platforms, e.g. secure boots or power-downs, version controls, system software checks, secure updates or assessing vulnerabilities
    • G06F21/577Assessing vulnerabilities and evaluating computer system security
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/12Applying verification of the received information

Definitions

  • This invention relates generally to computer investigation systems, and more specifically, to a system and method for conducting investigations of computer devices in a forensically defensible manner.
  • U.S. Patent No. 6,792,545 assigned to the Assignee of the present application, discloses a system and method for performing secure investigations of networked devices over a computer network.
  • This patent discloses that investigations of computer media may be conducted by obtaining raw sectors of the computer media and making full bit copies of such media.
  • the present invention is directed to a system and method for conducting investigations of one or more target machines in a data communications network via an examining machine.
  • the examining machine defines a set of investigation criteria and automatically generates a unique identifier for the set of investigation criteria.
  • the examining machine then automatically investigates the one or more target machines based on the set of investigation criteria and outputs results of the investigation.
  • the results of the investigation are preserved in an evidence data store.
  • the unique identifier is also stored in the evidence data store in association with the preserved evidence data, allowing the evidence data to be traced to the investigation subject.
  • the evidence data store is a container file storing metadata of files that are responsive to the investigation criteria and copies of the responsive files.
  • the unique identifier is randomly generated by the examining machine.
  • the investigation criteria includes one or more filter conditions for identifying one or more files based on the filter conditions.
  • the filter conditions may specify file metadata.
  • the examining computer retrieves filesystem data of the one or more target machines, compares the filesystem data with the specified file metadata, and identifies the one or more files based on the comparison.
  • the investigation criteria includes one or more keywords for identifying files in the one or more target machines containing the one or more keywords.
  • the investigation criteria identifies a specific source file for identifying a file in the one or more target machines matching the source file.
  • the examining machine identifies lengths of the source file and a file in a particular target machine that is being considered for a match and determines whether the lengths are equal. If the lengths are equal, the target machine is invoked to compute a digital signature value of the file being considered for a match.
  • the digital signature values of the source file and the file being considered for a match are retrieved, and a determination is made as to whether the digital signature values are equal. If the digital signature values are equal, the file being considered for a match is identified as a matching file.
  • a post-processing module receives an examiner report including the unique identifier for the set of investigation criteria used for the investigating.
  • the module compares the unique identifier in the examiner report with the unique identifier in the evidence data store. A verification message is then output based on the comparison.
  • the unique identifier is retrieved from the evidence data store, and the set of investigation criteria is reconstructed based on the unique identifier.
  • FIG. 1 is a block diagram of an exemplary computer investigation system according to one embodiment of the invention.
  • FIGS. 2A-2B are flow diagrams of a process executed by an eDiscovery module for conducting an investigation of one or more target machines according to one embodiment of the invention
  • FIG. 3 is a photograph of a GUI screen for viewing details of an investigation subject according to one embodiment of the invention.
  • FIGS. 4A-4D are photographs of GUI screens for setting filter conditions according to one embodiment of the invention.
  • FIGS. 5A-5D are photographs of GUI screens for setting search keywords according to one embodiment of the invention.
  • FIG. 6 is a photograph of a GUI screen for specifying source files to be searched for a match according to one embodiment of the invention
  • FIG. 7 is a photograph of a GUI screen for viewing and/or editing an investigation subject after it has been created according to one embodiment of the invention
  • FIG. 8 is GUI screen for applying an investigation subject to an investigation of one or more target machines according to one embodiment of the invention.
  • FIG. 9 is a layout diagram of metadata preserved in a logical evidence file according to one embodiment of the invention.
  • FIG. 10 is a flow diagram of a process for verifying evidence produced in response to a discovery order according to one embodiment of the invention.
  • FIG. 1 IA is a flow diagram of a process for establishing secure communication between an examining machine and a secure server according to one embodiment of the invention.
  • FIG. 1 IB is a flow diagram of a process for establishing a secure communication between a secure server and a servlet according to one embodiment of the invention.
  • IP intellectual property
  • the former employer raises the issue with the company and a large investigation is performed in order for the large company to avoid a costly lawsuit and/or criminal charges.
  • the former employer sues the employee and/or company, and the investigation is required to comply with a discovery order.
  • the investigation may require thousands of computer workstations to be searched, and any potentially responsive documents to the search to be collected and preserved in a forensically sound manner.
  • the workstations are part of a controlled network, they are distributed globally, and access control permissions allow users to save documents on storage media directly attached to the workstations (e.g. internal hard drives). There is no centralized, global index of all documents across all computers. What information exists on these workstations is largely unknown.
  • the maximum amount of storage on the workstations may average 40GB or more, and each one may contain upwards of 100,000 files.
  • the vast scale of the investigation often conflicts with the tight deadlines demanded by the competitor's counsel and the prevailing law enforcement officials.
  • embodiments of the present invention are directed to such a computer investigation system.
  • the investigation may be conducted in response to an electronic discovery order issued by a court, or as part of a routine or non-routine investigation conducted by an organization.
  • the investigation according to one embodiment of the invention entails electronically identifying, collecting, and preserving evidence that is responsive to a set of investigation criteria. In order to be legally defensible, it is desirable for the investigation criteria to be reasonable, testable, and consistently applied across identified target machines. [0031] To help achieve this, the computer investigation system and method according to the described embodiments allow a complicated set of investigation criteria to be defined in an organized and structured manner. The set of investigation criteria is then associated with an investigation subject that is identified by a subject name and a global unique identifier (GUID). As the investigation subject is applied to the various files, the responsive files are stamped with the GUID and preserved in a container file referred to as a logical evidence file (LEF).
  • GUID global unique identifier
  • FIG. 1 is a block diagram of an exemplary computer investigation system 101 according to one embodiment of the invention.
  • the computer investigation system 101 includes various network devices coupled to a data communications network 103 over data communication links 105.
  • the data communications network 103 may be a computer network, such as, for example, a public Internet, a private wide area network (WAN), a local area network (LAN), or other wired or wireless network environment conventional in the art.
  • the network devices may include a vendor computer 107, a secure server 111, an examining machine 115, one or more target machines 117, and a keymaster computer 113.
  • the data communication link 105 may be any network link conventional in the art, such as, for example, an Ethernet coupling.
  • a vendor having access to the vendor computer 107 provides the organization with a computer investigation software 109 which enables the organization to effectively perform forensic investigations, respond to network safety alerts, and conduct network audits over the data communications network 103.
  • the computer investigation software 109 may also allow other investigations of networked devices in addition to forensic investigations as evident to those of skill in the art.
  • the investigation software is installed in a local memory of the secure server 111 allocated to the organization.
  • the computer investigation software 109 provides computer program instructions which, when executed by one or more processors resident in the secure server 111, cause the secure server to broker safe communication between the examining machine 115 and the target machines 117.
  • the computer investigation software further facilitates the administration of users, logs transactions conducted via the server, and controls access rights to the system.
  • the examining machine 115 (which may also be referred to as the client) allows an authorized examiner 119 to conduct searches of the target machines 117 and their associated secondary storage devices 104.
  • the examining machine 115 includes a client software 116 which includes the functionality and interoperability for remotely accessing the secure server 111 and corresponding target machines 117.
  • the client software 116 may be implemented as computer program instructions stored in memory and executed by one or more processors resident in the examining machine 115.
  • the client software 116 includes an electronic discovery (eDiscovery) module 254 and associated graphics user interface (GUI).
  • the eDiscovery module 254 is accessed by an examiner for defining a set of investigation criteria and initiating investigation of the target machines 117.
  • Each target machine 117 is exemplarily the subject of a computer investigation conducted by the examining machine 115.
  • Each target machine 117 is coupled to one or more secondary storage devices 104 over an input/output connection 114.
  • the storage devices include any nonvolatile storage media such as, for example, hard disks, diskettes, Zip drives, redundant array of independent disks (RAID) systems, holographic storage devices, and the like.
  • a servlet 118 installed on a particular target machine 117 responds to commands provided by the examining machine 115 to remotely discover, preview, and acquire dynamic and/or static data, and transmit the acquired data to the examining machine via the secure communication path created between the target machine and the examining machine.
  • the servlet may be implemented as any software module stored in memory that is executed by one or more processors resident in the target machine 117, and is not limited to applets in a web browser environment.
  • the computer investigation system 101 illustrated in FIG. 1 further includes an examiner device which allows the examiner direct or remote access to the examining machine 115 in any manner conventional in the art.
  • the examiner device 119 may be an input and/or output device coupled to the examining machine 115, such as, for example, a keyboard and/or monitor.
  • the examiner device 119 may alternatively be a personal computer or laptop communicating with the examining device over a wired or wireless communication mechanism.
  • the examiner is a trusted individual who safely stores in the examining machine 115, one or more encryption keys used for authenticating to the secure server 111 and conducting the secure investigation of the target machines 117, as is described in more detail in the above-referenced U.S. Patent No.
  • FIGS. 2A-2B are flow diagrams of a process executed by the eDiscovery module
  • the eDiscovery module identifies one or more filter conditions for the investigation based on input provided by the examiner.
  • the filter conditions may be based on file metadata fields that are typically maintained by a computer filesystem.
  • the metadata fields may specify, for a particular file, a name, file directory structure, various timestamps (e.g. created, accessed, modified, and written timestamps), user access permissions, and the like.
  • Filtering by common filesystem metadata fields allows a quick and efficient reduction of the number of documents that need to be considered.
  • the filter conditions may be set to exclude certain directories from the search, or specify particular folders or directories to limit the collection of files to the designated folders or directories.
  • the filter conditions may also be set based on different file extensions.
  • Exemplary file extensions include a .doc, .xls, .pps, .ppt. or .mdb extension for Microsoft
  • Office files a .pst or .nsf extension for e-mail archives, and .wav, jpg, or .avi extension for respectively audio, graphic, or video file.
  • the file extensions included in the filter conditions may be set as inclusive or exclusive extensions.
  • An “inclusive extension” targets any file that has an extension in the approved set list.
  • An “exclusive extension” targets all data except for files in the extension list.
  • the exclusive method may be used to filter out standard program files with no investigative value.
  • the filter conditions may further be set to specify particular ranges of dates and/or times to search in the file timestamp data. For example, documents that have not changed since the start date of an employee in question may be excluded.
  • filter conditions may be set to indicate particular file names, active or deleted file status, security properties, and the like.
  • the various filter conditions may be combined to form complex boolean expressions via the GUI provided by the eDiscovery module 254.
  • the eDiscovery module identifies one or more keywords for the investigation based on input provided by the examiner. Such keywords help narrow the scope of the data collection by causing examination of the content of potentially responsive files for the existence of the keywords before such files are collected.
  • Various options may be set along with the keywords, such as, for example, a case sensitive search option, language option, and the like.
  • the eDiscovery module identifies one or more source files based on input provided by the examiner. This is useful when the investigation entails the search of the one or more source files, and the examiner has electronic copies of the source files to find an exact match.
  • the set of investigation criteria identified by the eDiscovery module 254 may include any combination of filter conditions, keywords, and/or source files specified in steps 202-206. Thus, not all three criteria need to be present in order to define an investigation subject. Similarly, the three types of criteria may be combined in any manner to create as complicated of a set of investigation criteria as desired for a particular investigation.
  • the eDiscovery module 254 allows the defined investigation criteria to be grouped together and organized under an investigation subject, simplifying the organization, access, and application of the investigation criteria.
  • the eDiscovery module 254 generates a name for the investigation subject. The name is selected by the examiner and may be descriptive of the subject matter of the investigation.
  • the eDiscovery module 254 generates a GUID for the investigation subject.
  • the GUID is a number randomly generated by the eDiscovery module.
  • the eDiscovery module 254 associates the name, GUID, and the identified set of investigation criteria to the investigation subject, and stores it in the memory.
  • a GUID is regenerated if any of the investigation criteria options are changed.
  • the GUID is not mathematically related to the investigation criteria. Therefore, the GUID- Subject relationship tracks identity rather than equality. No two subjects are designed to have the same GUID, even if they happen to have the same settings.
  • the GUID provides a mechanism for tracking the creation and application of the investigation subject to the set of universe of files that need to be considered.
  • the eDiscovery module 254 determines, in step 212, whether an investigation is to be conducted. In this regard, the eDiscovery module 254 determines whether a particular investigation subject has been selected and a command was provided to initiate the investigation.
  • the eDiscovery module 254 generates, in step 214, a target list of the target machines 117 that are to be scanned for the investigation.
  • the target list may be a list of network addresses, machine names, and/or address ranges provided by the examiner.
  • the eDiscovery module 256 establishes a secure communication with the secure server 111 which then brokers safe communication between the examining machine 115 and the target machines in the target list as is described in further detail below with respect to FIGS. 1 IA-I IB.
  • the secure server 111 allows the examining machine 115 to be concurrently connected with a plurality of the identified target machines.
  • the eDiscovery module 256 verifies, upon connection, that the machine name that was used matches the actual name that is stored in the target machine. If the machine name values do not match, the target machine is not scanned as it is not in the target list.
  • the examining machine 115 maintains a database with information on the target machines that have been scanned and information related to the devices associated with each target machine. After each device of a target machine is scanned, it is marked as complete in the database. Once all the devices in the target machine have been scanned, then the machine itself is marked as complete in the database.
  • step 218 the eDiscovery module 256 applies the investigation subject to the files in the verified target machines. If the application of the investigation subject results in a file being identified as being responsive, it copied back to the examining machine.
  • Performing a collection of a single target computer may take several hours, due to the target computer's own processing characteristics (e.g. processor speed, amount of random access memory), the bandwidth and latency characteristics of the network connection 105, and the amount of data that is identified as responsive.
  • processing characteristics e.g. processor speed, amount of random access memory
  • the bandwidth and latency characteristics of the network connection 105 e.g. bandwidth and latency characteristics of the network connection 105
  • the amount of data that is identified as responsive On large collections, it is desirable to process several target computers simultaneously. However, each target computer should preferably be scanned only once, and only one connection should be made to the target system at any given time. Additionally, for very large collections, the amount of work done by the eDiscovery module 254 and the client software 116, and the amount of data sent over the network 105 to the examiner computer 115 may exceed the computational resources of a single examiner computer 115.
  • the eDiscovery module 254 in order to scale to large collections, is configured to be run in parallel, on several examiner machines 115, against the set of target machines 117.
  • a simple database such as the one mentioned above for keeping status information, is used to maintain information about the target machines.
  • Each parallel instance is configured to connect to every target machine. When the connection is made, the instance queries the simple database for any information related to the target machine.
  • One of the fields associated with the target machine information denotes whether the target is currently being processed by an eDiscovery module instance.
  • the eDiscovey module takes advantage of this limited atomic property of the database to synchronize access to target machines.
  • the eDiscovery module instance 254 connects to the machine and determines that the target 115 is not currently being scanned, it generates a temporary globally unique identifier (GUID) and then creates a new record in the database containing the GUID and examining machine name. This record is automatically assigned an auto-increment value by the database.
  • the eDiscovery module 254 then issues a new query to the database to retrieve any records related to that particular machine name.
  • GUID globally unique identifier
  • any other instances of the eDiscovery module are trying to gain access to the target at the same time, they will have performed the same operation, and therefore multiple records will have been created by the eDiscovery instances.
  • the instance then examines each record returned. According to one embodiment, only if its own GUID matches the record with the lowest auto-increment value does the eDiscovery module instance go forward with processing the target machine (and in so doing, updating the machine information in the database related to marking the machine as being in progress). If an eDiscovery module instance determines that the record with its GUID was not the first record related to the target machine, then it does try to process that particular target machine and continues its attempt to process other target machines. In this manner, the eDiscovery Suite module is able to synchronize access to target machines while utilizing almost any database system, even ones with degraded transactional abilities.
  • the multiple instances of the eDiscovery module 254 may be scanning the set of target machines 115
  • consideration is be given to the distribution of the target machines on the network 105 in order to maximize the throughput of the entire collection.
  • Some target machines may exist on a segment of the network 105 with restricted bandwidth such that all targets on the segment may not be able to be scanned simultaneously without creating a network bottleneck. In practice this is quite common with collections at large organizations which have many network segments distributed geographically across the world.
  • the set of target machines is often provided to the eDiscovery module in non-random order. It is very common for the list to be ordered by organizational department, which has a high degree of correlation to geographic and network distribution, or explicitly by geographic location. If multiple instances of the eDiscovery module are used, then each would proceed through the list in order and many machines on the same network segment would be processed simultaneously, thus straining network traffic on particular segments. As network traffic becomes constrained, the total collection is delayed. [0068] To avoid this situation, information' concerning the network distribution of the target machines could be provided as input to the eDiscovery module and that information could be taken into consideration as the eDiscovery module instances connected to the targets.
  • FIG. 2B is a more detailed flow diagram of step 220 of FIG. 2A of applying the investigation subject according to one embodiment of the invention.
  • the eDiscovery module 256 determines whether the selected investigation subject includes one or more filter conditions. If the answer is YES, the eDiscovery module 256 identifies, in step 222, the files that satisfy the filter conditions. In this regard, the eDiscovery module obtains from each of the identified target machines a copy of their full filesystem metadata and compares the filter conditions against the obtained metadata.
  • the eDiscovery module 256 determines whether the selected investigation subject includes any search keywords. If the answer is NO, the metadata of the files identified in step 222 is collected and preserved in an evidence file associated with the investigation subject along with the investigation GUID. If the content of the identified files is to also be collected, acquisition commands are transmitted to the corresponding servlets for acquiring the indicated files. Each acquired file is stamped with the GUID by associating the GUID to the acquired file. The files may be acquired, for example, in the manner described in the above-referenced U.S. Patent Application Ser. No. 11/315,761. The acquired files are then stored in the evidence file associated with the investigation subject.
  • the eDiscovery module 254 transmits commands to one or more servlets associated with the files identified in step 222, for collecting and preserving from the identified files, the files and/or their metadata that also contain the set of keywords.
  • a multiple number of the identified servlets may be concurrently invoked for performing searches of the identified files for the specified keyword set.
  • the servlet conducts a non- indexed search of the specified files.
  • the search makes use of optimized algorithms for determining responsiveness to the keywords during a single pass through the files.
  • information about search hits is retained and returned to the eDiscovery module 254.
  • the servlet returns for each hit, a file offset, in bytes, that denotes the start of the hit, the length of the hit in bytes, and the keyword that generated the hit. The returned data may then be used to acquire a copy of the corresponding file.
  • the eDiscovery module 254 collects the metadata of the files that have resulted in a hit in the evidence file associated with the investigation subject along with the investigation GUID. The examining machine then transmits a command to acquire the content of such files such content is to be preserved. The acquired files are stamped with the GUID and stored in the evidence file associated with the investigation subject. [0076] In step 228, the eDiscovery module 254 determines whether the investigation subject includes a source file to be searched for a match. If the answer is YES, the eDiscovery module proceeds to collect and preserve, in step 230, the files matching the source file and/or their metadata.
  • the eDiscovery module 254 compares the size of the source file against the size of the files to be investigated.
  • the file size information is maintained by the filesystem obtained from the target devices. If the size of a given file does not match the file size of the source file, then a determination is made that the files are different.
  • the digital signatures such as, for example, hash values
  • the eDiscovery module 254 commands the servlet associated with the candidate file to compute a hash value of the candidate file. If a hash value of the source file has not yet been computed, the eDiscovery module proceeds to do this as well.
  • the eDisclovery module 256 compares the returned hash value against the hash value of the source file. If there is a match, the files are the same, and the metadata of the matching file is collected and preserved in the evidence file associated with the investigation subject along with the investigation GUID. If the matching file is also to be acquired, the eDiscovery module 254 transmits a command to the servlet to acquire the matching file. The acquired file is then stamped with the GUID and stored in the evidence file associated with the investigation subject.
  • the eDiscovery module 254 takes a timestamp of the date and time in which the metadata and/or files were acquired, and stores this timestamp information with the corresponding metadata.
  • the eDiscovery module 254 also stores information of the investigation subject applied to the investigation in the evidence file. In this manner, the files that have been preserved during the investigation may be easily and reliably associated with the investigation subject. This provides complete tracking of why each file was collected.
  • the eDiscovery module may optionally create a log file for each target machine listing the full path (directory structure and file name) of every file considered, whether the file was identified or rejected based on the investigation subject. Information about the date and time of when the determination was made is also included into the log file. In this way, the eDiscovery module 256 may provide a detailed accounting of all files that were or were not responsive, aiding the defensibility of a particular investigation.
  • the eDiscovery module 256 further allows for on-the-ffy expansion of compressed container files (e.g. zip archives) in the midst of a current scan of the target machine. In this manner, the investigation subject may be applied against the individual files within the container file. Otherwise, the entire container would have to be collected, even if it contained irrelevant data, in order to avoid risking defensibility of the investigation.
  • the eDiscovery module provides an option as to whether to preserve entire container file if it contains a responsive file, or only the particular responsive files within the container file.
  • the eDiscovery module 256 constantly monitors its connection to the target machines.
  • connection If the connection is disrupted, then the incomplete results are destroyed and the target machine is kept marked as incomplete.
  • target machine is re-scanned from scratch. In this way, the collection for each crizt machine is performed in one continuous period of time, eliminating any questions about the defensibility of the individual collection having occurred over several discontinuous time ranges.
  • the GUID may be retrieved from the evidence file and used to reconstruct the investigation subject including the set of investigation criteria.
  • the eDiscovery module 254 allows for searches to be done based on subject name and/or GUID.
  • the reconstructed investigation criteria may then be displayed on a display.
  • GUID associated with each file that is collected may be automatically compared to the GUID in an examiner report to verify that the investigation criteria was consistently applied to the collected files, and to verify that indeed the correct files have been produced in court or to an opposing party.
  • FIG. 3 is a photograph of a GUI screen for viewing details of an investigation subject according to one embodiment of the invention.
  • the investigation subject includes a name 300 and a GUID 302.
  • the GUID 302 is automatically generated upon an indication that a new investigation subject is to be generated, or upon edits of the investigation criteria of a previously generated investigation subject.
  • the examiner starts by selecting a condition tab 304 followed by a keyword search tab 306 and a matching files tab 308.
  • the investigation criteria may also be defined in any other order as will be appreciated by a person of skill in the art.
  • FIGS. 4A-4D are photographs of the GUI screens for setting filter conditions upon selection of the conditions tab 304 according to one embodiment of the invention.
  • a name 320 is selected for the filter conditions that are to be defined as is illustrated in FIG. 4A.
  • the examiner then sets various filter properties and the values of those properties as is illustrated in FIGS. 4B-4C.
  • FIG. 4B a file extension property is set for filtering based on the identified file extensions.
  • a full path property is set for filtering based on the identified logical path of the files.
  • FIG. 4D illustrates the combining of various filter conditions via a boolean expression such as "and" or "or” to create a set of complex filter conditions.
  • the illustrated filter conditions specify the following criteria:
  • FIGS. 5A-5D are photographs of the GUI screens for setting search keywords according to one embodiment of the invention.
  • a keyword list is initially generated and saved in a keyword file as illustrated in FIGS. 5A-5B.
  • the keywords that are generated are saved in an "eDiscovery" file.
  • the keyword file may then be used for different investigation subjects.
  • FIGS. 5C-5D illustrate the use of a generated keyword list for specifying a keyword search for a particular investigation subject.
  • a name 322 assigned to the filter conditions is highlighted as is illustrated in FIG. 5C, and the keyword search tab 306 selected for selecting the appropriate keyword file.
  • FIG. 5D illustrates the selection of the "eDiscovery" keyword file, and the selection of particular keywords within the file for the current investigation subject.
  • the matching files tab 308 is selected and an add files command 324 given as is illustrated in FIG. 6, to browse to a folder containing a list of possible source files.
  • the files within the selected folders are then displayed in a matching files window 323.
  • the specific files to be searched for a match are then selected, and an OK button 326 selected to save the selected source files for the particular investigation subject.
  • FIG. 7 is a photograph of a GUI screen for viewing and/or editing an investigation subject after it has been created according to one embodiment of the invention.
  • the subject name assigned to the investigation subject appears in a name field 600, and the GUID assigned to the investigation subject appears in a GUID field 602.
  • Area 604 identifies the investigation subject as being locked or unlocked.
  • a locked state indicates that the investigation subject may not be modified.
  • the subject is locked the first time the investigation subject is selected and a command given to run an investigation based on the subject.
  • the subject is locked even before a connection is made to a first target computer.
  • An unlocked state indicates that the investigation subject may be modified. According to one embodiment of the invention, if any of the investigation criteria options are modified, a new GUID is generated and associated with the investigation subject.
  • a criteria area 608 identifies one or more investigation criteria sets defined for the investigation subject. According to one embodiment of the invention, more than one investigation criteria set may be defined for a particular investigation subject.
  • a criteria name identified in a criteria name field 606 identifies each investigation criteria set. In the illustrated embodiment, a single criteria set is defined. Selection of a particular investigation criteria set in the criteria name field 606 causes display of various filter conditions 610 and keyword options 612 defined for the criteria set. In this manner the investigation criteria defined for a particular investigation subject may be reconstructed and/or verified in an reliable and easy manner.
  • FIG. 8 is GUI screen for applying an investigation subject to an investigation of one or more target machines according to one embodiment of the invention.
  • an investigation subject with the name "ACME Corporation” is selected, and a command to initiate the investigation is provided via selection of a next button 350. Additional GUI screens (not shown) are then displayed for allowing the examiner to select the target machines to be investigated, and to connect to the secure server 111.
  • the results of the investigation are preserved in an evidence file referred to as a logical evidence file (LEF).
  • the LEF is a container file for storing the metadata of the collected files as well as the content of those files.
  • the LEF presents the metadata of the preserved files in such a manner as to allow for the recreation of how the files existed on the original device.
  • the LEF includes various files, including a metadata file for each preserved file, and files containing the content of each the preserved file.
  • the metadata file for each preserved file is kept hidden and not presented as a contained file.
  • the metadata is extracted by the client software 116 and presented as fields associated with the contained file to recreate the file in its original context on the target device.
  • the LEF may also take the form of a folder or other evidence data store as will be appreciated by a person of skill in the art.
  • FIG. 9 is a layout diagram of metadata preserved in an LEF according to one embodiment of the invention.
  • the metadata includes, for each file that is responsive to the investigation subject, a file name 500, file type 502, and MAC (modified, accessed, created) dates 504.
  • the file's hash value 506, full directly structure 508, and ownership and access permission information 510 are also preserved in the LEF.
  • the LEF further stores the investigation subject including a subject name 516, GUID 512, and the date and time 514 in which the metadata information was acquired. In this manner, the file corresponding to the preserved metadata becomes associated with the investigation subject stored in the LEF.
  • FIG. 10 is a flow diagram of a process for verifying evidence produced in response to a discovery order according to one embodiment of the invention.
  • the process is described as a software process implemented by a post-processing module in a third party computer, a person of skill in the art should recognize that the process may also be implemented by the eDiscovery module 254 or another module in the examining machine
  • step 400 the post-processing module receives an examiner report with information on the investigation that was conducted responsive to the discovery order.
  • the examiner report includes the GUID of the investigation subject that was applied for the investigation.
  • the post-processing module also receives the LEF containing the metadata and the files that have been preserved as a result of the investigation.
  • step 402 the post-processing module retrieves the GUID from the examiner report.
  • step 404 the post-processing module retrieves the GUID from the LEF.
  • step 406 the post-processing module compares the GUID in the examiner report to the GUID in the LEF.
  • step 408 the post-processing module determines whether the GUIDs match. If the answer is YES, the post-processing module outputs a verification message in step 410. If the answer is NO, the post-processing module outputs, in step 412, a message that the investigation could not be verified.
  • the post-processing module may also recompute the hash value of the stored file contents to compare them against the stored hash value at the time of collection, in order to detect corruption in the LEF.
  • FIG. 1 IA is a flow diagram of a process for establishing secure communication between the examining machine 115 and the secure server 111 according to one embodiment of the invention.
  • the client software 116 or reconnect module 254 (hereinafter collectively referred to as the client software) resident in the examining machine 115 is invoked for establishing the secure communication.
  • the client software in step 900, generates an examiner's random number "Erand" and includes it into a packet along with the examiner's user name.
  • the client software signs the packet with a user authentication private key as is understood by those of skill in the art.
  • step 904 the client software encrypts the signed packet with the secure server's public key according to conventional mechanisms, and transmits the encrypted, signed packet to the secure server 111 in step 906.
  • the secure server 111 receives the packet and invokes its computer investigation software 109 to decrypt the packet using the server's private key.
  • the software 109 retrieves the examiner's user name from the packet and searches the server's database for a match.
  • the matched name in the server's database includes a public user authentication key which is used in step 912 to verify the user's signature on the packet according to conventional mechanisms. If the signature is not verified, as determined in step 914, the client software cannot be authenticated and a connection between the client software and the secure server is denied in step 916.
  • the client software may be authenticated, and the computer investigation software 109 stores the examiner's random number in step 918.
  • the processor generates its own server random number "Srand” and a server-to- examiner session encryption key "SEkey” to be used to encrypt future communications between the server and the examiner.
  • SEkey server-to- examiner session encryption key
  • These values, as well as the original examiner's random number are signed with the server's private key in step 922, encrypted with the user's public key in step 924, and transmitted to the client software in step 926.
  • the client software 116 receives the packet from the secure server and decrypts it using the user's private key.
  • step 930 the client software verifies the server's signature with the server's public key according to conventional mechanisms.
  • step 932 a determination is made as to whether the signature may be verified. If the answer is YES, the server is authenticated, and the client software verifies the examiner's random number that is transmitted by the server to confirm that it is, in fact, the same number that was sent to the server. If the number may be confirmed, as is determined in step 934, the examiner creates another packet to send back to the server 111. This packet includes the server random number which is encrypted, in step 936, with the server-to-examiner session key. The encrypted packet is then transmitted to the server.
  • step 938 the server's computer investigation software 109 decrypts the packet containing the server random number with the server-to-examiner session key. If the received server random number is the same number originally generated and sent to the client software as is determined in step 940, the number is confirmed, and a secure connection is established in step 942.
  • the process for establishing a secure connection between the client software and the secure server 111 is described in more detail in the above-referenced U.S. Patent No. 6,792,545.
  • FIG. 1 IB is a flow diagram of a process for establishing a secure communication between the secure server 111 and the servlet 118 according to one embodiment of the invention. A number of such secure communications may be established concurrently. The communication is established in response to request by the examining machine to investigate the target machine running the servlet.
  • step 1000 the server's computer investigation software 109 generates a second server random number "Srand2," and signs the packet with the server's private key in step 1002.
  • step 1004 the software 109 transmits the signed packet to the servlet.
  • the servlet receives the packet signed with the second server random number, and in step 1006, verifies the signature with the server's public key. If the signature cannot be verified, as is determined in step 1008, a safe connection between the secure server 111 and the servlet 118 is denied in step 1010.
  • the servlet If, however, the server's signature is verified, the servlet generates a servlet-to- server session encryption key in step 1012 and inserts it into a packet in step 1014 along with the second server random number. The servlet encrypts the packet in step 1016 with the server's public key, and transmits the packet to the server 111.
  • step 1018 the server's computer investigation software 109 receives the encrypted packet and decrypts it with the server's private key.
  • the processor further confirms in step 1020, whether the second server random number is the same number that was originally sent to the servlet. If the answer is YES, the processor generates a server-to-servlet session encryption key in step 1022, and encrypts the server-to-servlet session encryption key with the servlet-to-server session encryption key in step 1024.
  • step 1026 the encrypted packet is transmitted to the servlet.
  • step 1028 the servlet decrypts the packet with the servlet-to-server session key, and stores the server-to-servlet session key in step 1030.
  • step 1031 a secure connection is established, and all subsequent data exchanges between the server and the servlet are encrypted using the server-to-servlet session key.
  • the establishment of a secure connection between the secure server 111 and the servlet 118 is described in more detail in the above- referenced U.S. Patent No. 6,792,545.
  • the server 111 may communicate directly in effectuating searches of dynamic and/or static data stored in the target devices.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Human Resources & Organizations (AREA)
  • Quality & Reliability (AREA)
  • Computing Systems (AREA)
  • Economics (AREA)
  • Operations Research (AREA)
  • Marketing (AREA)
  • Data Mining & Analysis (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Automatic Analysis And Handling Materials Therefor (AREA)

Abstract

A computer investigation system and method that conducts electronic discovery of desired files across a live network in a forensically sound manner. The investigation entails an examining machine electronically identifying, collecting, and preserving evidence from target machines that is responsive to a set of investigation criteria. The set of investigation criteria is associated with an investigation subject that is identified by a global unique identifier (GUID). As the investigation subject is applied to the various files, the responsive files are stamped with the GUID and preserved in a container file referred to as a logical evidence file (LEF). The GUID allows the results of an investigation to be easily and reliably traced to the particular investigation subject that was applied.

Description

ELECTRONIC DISCOVERY SYSTEM AND METHOD
FIELD OF THE INVENTION
[0001] This invention relates generally to computer investigation systems, and more specifically, to a system and method for conducting investigations of computer devices in a forensically defensible manner.
BACKGROUND OF THE INVENTION
[0002] U.S. Patent No. 6,792,545, assigned to the Assignee of the present application, discloses a system and method for performing secure investigations of networked devices over a computer network. This patent discloses that investigations of computer media may be conducted by obtaining raw sectors of the computer media and making full bit copies of such media. However, it may be desirable to obtain and preserve only particular files from the computer media, and to do so in a forensically sound manner.
SUMMARY OF THE INVENTION
[0003] The present invention is directed to a system and method for conducting investigations of one or more target machines in a data communications network via an examining machine. The examining machine defines a set of investigation criteria and automatically generates a unique identifier for the set of investigation criteria. The examining machine then automatically investigates the one or more target machines based on the set of investigation criteria and outputs results of the investigation. The results of the investigation are preserved in an evidence data store. The unique identifier is also stored in the evidence data store in association with the preserved evidence data, allowing the evidence data to be traced to the investigation subject.
[0004] According to one embodiment of the invention, the evidence data store is a container file storing metadata of files that are responsive to the investigation criteria and copies of the responsive files.
[0005] According to one embodiment of the invention, the unique identifier is randomly generated by the examining machine.
[0006] According to one embodiment of the invention, the investigation criteria includes one or more filter conditions for identifying one or more files based on the filter conditions.
The filter conditions may specify file metadata. The examining computer retrieves filesystem data of the one or more target machines, compares the filesystem data with the specified file metadata, and identifies the one or more files based on the comparison.
[0007] According to one embodiment of the invention, the investigation criteria includes one or more keywords for identifying files in the one or more target machines containing the one or more keywords. [0008] According to one embodiment of the invention, the investigation criteria identifies a specific source file for identifying a file in the one or more target machines matching the source file. In order to find a machine file, the examining machine identifies lengths of the source file and a file in a particular target machine that is being considered for a match and determines whether the lengths are equal. If the lengths are equal, the target machine is invoked to compute a digital signature value of the file being considered for a match. The digital signature values of the source file and the file being considered for a match are retrieved, and a determination is made as to whether the digital signature values are equal. If the digital signature values are equal, the file being considered for a match is identified as a matching file.
[0009] According to one embodiment of the invention, a post-processing module receives an examiner report including the unique identifier for the set of investigation criteria used for the investigating. The module compares the unique identifier in the examiner report with the unique identifier in the evidence data store. A verification message is then output based on the comparison.
[0010] According to one embodiment of the invention, the unique identifier is retrieved from the evidence data store, and the set of investigation criteria is reconstructed based on the unique identifier.
[0011] These and other features, aspects and advantages of the present invention will be more fully understood when considered with respect to the following detailed description, appended claims, and accompanying drawings. Of course, the actual scope of the invention is defined by the appended claims.
BRIEF DESCRIPTION OF THE DRAWINGS
[0012] FIG. 1 is a block diagram of an exemplary computer investigation system according to one embodiment of the invention;
[0013] FIGS. 2A-2B are flow diagrams of a process executed by an eDiscovery module for conducting an investigation of one or more target machines according to one embodiment of the invention;
[0014] FIG. 3 is a photograph of a GUI screen for viewing details of an investigation subject according to one embodiment of the invention;
[0015] FIGS. 4A-4D are photographs of GUI screens for setting filter conditions according to one embodiment of the invention;
[0016] FIGS. 5A-5D are photographs of GUI screens for setting search keywords according to one embodiment of the invention;
[0017] FIG. 6 is a photograph of a GUI screen for specifying source files to be searched for a match according to one embodiment of the invention; [0018] FIG. 7 is a photograph of a GUI screen for viewing and/or editing an investigation subject after it has been created according to one embodiment of the invention;
[0019] FIG. 8 is GUI screen for applying an investigation subject to an investigation of one or more target machines according to one embodiment of the invention;
[0020] FIG. 9 is a layout diagram of metadata preserved in a logical evidence file according to one embodiment of the invention;
[0021] FIG. 10 is a flow diagram of a process for verifying evidence produced in response to a discovery order according to one embodiment of the invention;
[0022] FIG. 1 IA is a flow diagram of a process for establishing secure communication between an examining machine and a secure server according to one embodiment of the invention; and
[0023] FIG. 1 IB is a flow diagram of a process for establishing a secure communication between a secure server and a servlet according to one embodiment of the invention.
DETAILED DESCRIPTION
[0024] The desirability of obtaining and preserving only particular files from a computer media, and doing so in a forensically sound manner should be evident when considering the following scenario.
[0025] Assume that an employee of a large company has illicitly contaminated his assigned computer workstation with documents containing intellectual property (IP) from his former employer. It is unknown whether the employee distributed the IP to any other employees, and, if so, to what extent.
[0026] The former employer raises the issue with the company and a large investigation is performed in order for the large company to avoid a costly lawsuit and/or criminal charges. Alternatively, the former employer sues the employee and/or company, and the investigation is required to comply with a discovery order. In either scenario, the investigation may require thousands of computer workstations to be searched, and any potentially responsive documents to the search to be collected and preserved in a forensically sound manner. [0027] Although the workstations are part of a controlled network, they are distributed globally, and access control permissions allow users to save documents on storage media directly attached to the workstations (e.g. internal hard drives). There is no centralized, global index of all documents across all computers. What information exists on these workstations is largely unknown. The maximum amount of storage on the workstations may average 40GB or more, and each one may contain upwards of 100,000 files. The vast scale of the investigation often conflicts with the tight deadlines demanded by the competitor's counsel and the prevailing law enforcement officials. [0028] Accordingly, it is desirable for a computer investigation system and method that conducts electronic discovery of desired files across a live network, in an efficient and legally defensible manner.
[0029] In general terms, embodiments of the present invention are directed to such a computer investigation system. The investigation may be conducted in response to an electronic discovery order issued by a court, or as part of a routine or non-routine investigation conducted by an organization.
[0030] The investigation according to one embodiment of the invention entails electronically identifying, collecting, and preserving evidence that is responsive to a set of investigation criteria. In order to be legally defensible, it is desirable for the investigation criteria to be reasonable, testable, and consistently applied across identified target machines. [0031] To help achieve this, the computer investigation system and method according to the described embodiments allow a complicated set of investigation criteria to be defined in an organized and structured manner. The set of investigation criteria is then associated with an investigation subject that is identified by a subject name and a global unique identifier (GUID). As the investigation subject is applied to the various files, the responsive files are stamped with the GUID and preserved in a container file referred to as a logical evidence file (LEF). Alternatively, the container file is stamped with the GUID, allowing an automatic association between the GUID and each file preserved in the LEF. In this manner, the responsive files may be permanently associated with the GUID, allowing the results of the investigation to be easily and reliably traced to the particular investigation subject that was applied. The tight coupling of the investigation subject to the investigation criteria and to the evidence files help ensure a legally defensible and forensically sound investigation. [0032] FIG. 1 is a block diagram of an exemplary computer investigation system 101 according to one embodiment of the invention. The computer investigation system 101 includes various network devices coupled to a data communications network 103 over data communication links 105. The data communications network 103 may be a computer network, such as, for example, a public Internet, a private wide area network (WAN), a local area network (LAN), or other wired or wireless network environment conventional in the art. The network devices may include a vendor computer 107, a secure server 111, an examining machine 115, one or more target machines 117, and a keymaster computer 113. The data communication link 105 may be any network link conventional in the art, such as, for example, an Ethernet coupling.
[0033] A vendor having access to the vendor computer 107 provides the organization with a computer investigation software 109 which enables the organization to effectively perform forensic investigations, respond to network safety alerts, and conduct network audits over the data communications network 103. The computer investigation software 109 may also allow other investigations of networked devices in addition to forensic investigations as evident to those of skill in the art.
[0034] The investigation software is installed in a local memory of the secure server 111 allocated to the organization. According to one embodiment of the invention, the computer investigation software 109 provides computer program instructions which, when executed by one or more processors resident in the secure server 111, cause the secure server to broker safe communication between the examining machine 115 and the target machines 117. The computer investigation software further facilitates the administration of users, logs transactions conducted via the server, and controls access rights to the system. [0035] The examining machine 115 (which may also be referred to as the client) allows an authorized examiner 119 to conduct searches of the target machines 117 and their associated secondary storage devices 104. In this regard, the examining machine 115 includes a client software 116 which includes the functionality and interoperability for remotely accessing the secure server 111 and corresponding target machines 117. The client software 116 may be implemented as computer program instructions stored in memory and executed by one or more processors resident in the examining machine 115. [0036] According to one embodiment of the invention, the client software 116 includes an electronic discovery (eDiscovery) module 254 and associated graphics user interface (GUI). The eDiscovery module 254 is accessed by an examiner for defining a set of investigation criteria and initiating investigation of the target machines 117. [0037] Each target machine 117 is exemplarily the subject of a computer investigation conducted by the examining machine 115. Each target machine 117 is coupled to one or more secondary storage devices 104 over an input/output connection 114. The storage devices include any nonvolatile storage media such as, for example, hard disks, diskettes, Zip drives, redundant array of independent disks (RAID) systems, holographic storage devices, and the like.
[0038] According to one embodiment, a servlet 118 installed on a particular target machine 117 responds to commands provided by the examining machine 115 to remotely discover, preview, and acquire dynamic and/or static data, and transmit the acquired data to the examining machine via the secure communication path created between the target machine and the examining machine. The servlet may be implemented as any software module stored in memory that is executed by one or more processors resident in the target machine 117, and is not limited to applets in a web browser environment. [0039] The computer investigation system 101 illustrated in FIG. 1 further includes an examiner device which allows the examiner direct or remote access to the examining machine 115 in any manner conventional in the art. The examiner device 119 may be an input and/or output device coupled to the examining machine 115, such as, for example, a keyboard and/or monitor. The examiner device 119 may alternatively be a personal computer or laptop communicating with the examining device over a wired or wireless communication mechanism.
[0040] According to one embodiment of the invention, the examiner is a trusted individual who safely stores in the examining machine 115, one or more encryption keys used for authenticating to the secure server 111 and conducting the secure investigation of the target machines 117, as is described in more detail in the above-referenced U.S. Patent No.
6,792,545.
[0041] FIGS. 2A-2B are flow diagrams of a process executed by the eDiscovery module
254 for conducting an investigation of one or more target machines according to one embodiment of the invention. The process may be executed in the order indicated in the flow diagram, or in any other order appreciated by a person of skill in the art.
[0042] In step 202, the eDiscovery module identifies one or more filter conditions for the investigation based on input provided by the examiner. The filter conditions may be based on file metadata fields that are typically maintained by a computer filesystem. For example, the metadata fields may specify, for a particular file, a name, file directory structure, various timestamps (e.g. created, accessed, modified, and written timestamps), user access permissions, and the like.
[0043] Filtering by common filesystem metadata fields allows a quick and efficient reduction of the number of documents that need to be considered. For example, the filter conditions may be set to exclude certain directories from the search, or specify particular folders or directories to limit the collection of files to the designated folders or directories.
[0044] The filter conditions may also be set based on different file extensions.
Exemplary file extensions include a .doc, .xls, .pps, .ppt. or .mdb extension for Microsoft
Office files, a .pst or .nsf extension for e-mail archives, and .wav, jpg, or .avi extension for respectively audio, graphic, or video file.
[0045] The file extensions included in the filter conditions may be set as inclusive or exclusive extensions. An "inclusive extension" targets any file that has an extension in the approved set list. An "exclusive extension" targets all data except for files in the extension list. The exclusive method may be used to filter out standard program files with no investigative value.
[0046] The filter conditions may further be set to specify particular ranges of dates and/or times to search in the file timestamp data. For example, documents that have not changed since the start date of an employee in question may be excluded.
[0047] Other filter conditions may be set to indicate particular file names, active or deleted file status, security properties, and the like. According to one embodiment of the invention, the various filter conditions may be combined to form complex boolean expressions via the GUI provided by the eDiscovery module 254. [0048] In step 204, the eDiscovery module identifies one or more keywords for the investigation based on input provided by the examiner. Such keywords help narrow the scope of the data collection by causing examination of the content of potentially responsive files for the existence of the keywords before such files are collected. Various options may be set along with the keywords, such as, for example, a case sensitive search option, language option, and the like.
[0049] In step 206, the eDiscovery module identifies one or more source files based on input provided by the examiner. This is useful when the investigation entails the search of the one or more source files, and the examiner has electronic copies of the source files to find an exact match.
[0050] A person of skill in the art should recognize that the set of investigation criteria identified by the eDiscovery module 254 may include any combination of filter conditions, keywords, and/or source files specified in steps 202-206. Thus, not all three criteria need to be present in order to define an investigation subject. Similarly, the three types of criteria may be combined in any manner to create as complicated of a set of investigation criteria as desired for a particular investigation.
[0051] According to one embodiment of the invention, the eDiscovery module 254 allows the defined investigation criteria to be grouped together and organized under an investigation subject, simplifying the organization, access, and application of the investigation criteria. In this regard, in step 208, the eDiscovery module 254 generates a name for the investigation subject. The name is selected by the examiner and may be descriptive of the subject matter of the investigation.
[0052] In step 210, the eDiscovery module 254 generates a GUID for the investigation subject. According to one embodiment of the invention, the GUID is a number randomly generated by the eDiscovery module.
[0053] In step 211, the eDiscovery module 254 associates the name, GUID, and the identified set of investigation criteria to the investigation subject, and stores it in the memory. [0054] According to one embodiment of the invention, a GUID is regenerated if any of the investigation criteria options are changed. Thus, according to one embodiment of the invention, the GUID is not mathematically related to the investigation criteria. Therefore, the GUID- Subject relationship tracks identity rather than equality. No two subjects are designed to have the same GUID, even if they happen to have the same settings. Thus the GUID provides a mechanism for tracking the creation and application of the investigation subject to the set of universe of files that need to be considered.
[0055] Once the investigation subject has been defined, it may be selected to conduct an investigation. In this regard, the eDiscovery module 254 determines, in step 212, whether an investigation is to be conducted. In this regard, the eDiscovery module 254 determines whether a particular investigation subject has been selected and a command was provided to initiate the investigation.
[0056] If the answer is YES, the eDiscovery module 254 generates, in step 214, a target list of the target machines 117 that are to be scanned for the investigation. The target list may be a list of network addresses, machine names, and/or address ranges provided by the examiner.
[0057] In step 216, the eDiscovery module 256 establishes a secure communication with the secure server 111 which then brokers safe communication between the examining machine 115 and the target machines in the target list as is described in further detail below with respect to FIGS. 1 IA-I IB. According to one embodiment of the invention, the secure server 111 allows the examining machine 115 to be concurrently connected with a plurality of the identified target machines.
[0058] According to one embodiment of the invention, if a machine name is used to connect to a particular target machine, the eDiscovery module 256 verifies, upon connection, that the machine name that was used matches the actual name that is stored in the target machine. If the machine name values do not match, the target machine is not scanned as it is not in the target list.
[0059] In addition, in order to keep track of the status of the machines that have been scanned during the investigation, the examining machine 115 maintains a database with information on the target machines that have been scanned and information related to the devices associated with each target machine. After each device of a target machine is scanned, it is marked as complete in the database. Once all the devices in the target machine have been scanned, then the machine itself is marked as complete in the database.
[0060] In step 218, the eDiscovery module 256 applies the investigation subject to the files in the verified target machines. If the application of the investigation subject results in a file being identified as being responsive, it copied back to the examining machine.
[0061] Performing a collection of a single target computer may take several hours, due to the target computer's own processing characteristics (e.g. processor speed, amount of random access memory), the bandwidth and latency characteristics of the network connection 105, and the amount of data that is identified as responsive. On large collections, it is desirable to process several target computers simultaneously. However, each target computer should preferably be scanned only once, and only one connection should be made to the target system at any given time. Additionally, for very large collections, the amount of work done by the eDiscovery module 254 and the client software 116, and the amount of data sent over the network 105 to the examiner computer 115 may exceed the computational resources of a single examiner computer 115.
[0062] According to one embodiment of the invention, in order to scale to large collections, the eDiscovery module 254 is configured to be run in parallel, on several examiner machines 115, against the set of target machines 117. To coordinate the activity of the parallel instances of the eDiscovery module 254, a simple database, such as the one mentioned above for keeping status information, is used to maintain information about the target machines. Each parallel instance is configured to connect to every target machine. When the connection is made, the instance queries the simple database for any information related to the target machine. One of the fields associated with the target machine information denotes whether the target is currently being processed by an eDiscovery module instance. [0063] If this fields signifies that the current target machine is not being processed, then the connected eDiscovery module instance proceeds to update this field in the database so that it has exclusive access. However, consider that another instance of the eDiscovery module could also be doing exactly the same thing. Without some level of concurrency control, both instances could mark the machine as being in progress and both would then begin scanning the machine. This is obviously inefficient.
[0064] While many database systems have functionality for ensuring that updates such as this occur atomically, such that an update from one eDiscovery module instance would succeed and the other fail, not all do, especially mass-market consumer databases (e.g. Microsoft Access). The eDiscovery module does not use the database heavily, so it is desirable for the module to work with as many databases as possible. Even if a particular database system does not allow for atomic transactions, almost all provide for a data field known as an "auto-increment" value. If such a field exists in the database, then each new record that is created in the database is assigned a new number by the database for this field, and the value is guaranteed, according to one embodiment, to be unique to the database. Typically the values are integer numbers which increase by one value (e.g. "1, 2, 3, 4...") with each new record. It is a property of the database that this value is updated atomically, such that no two records will ever have the same value.
[0065] Therefore the eDiscovey module takes advantage of this limited atomic property of the database to synchronize access to target machines. When the eDiscovery module instance 254 connects to the machine and determines that the target 115 is not currently being scanned, it generates a temporary globally unique identifier (GUID) and then creates a new record in the database containing the GUID and examining machine name. This record is automatically assigned an auto-increment value by the database. The eDiscovery module 254 then issues a new query to the database to retrieve any records related to that particular machine name. If any other instances of the eDiscovery module are trying to gain access to the target at the same time, they will have performed the same operation, and therefore multiple records will have been created by the eDiscovery instances. The instance then examines each record returned. According to one embodiment, only if its own GUID matches the record with the lowest auto-increment value does the eDiscovery module instance go forward with processing the target machine (and in so doing, updating the machine information in the database related to marking the machine as being in progress). If an eDiscovery module instance determines that the record with its GUID was not the first record related to the target machine, then it does try to process that particular target machine and continues its attempt to process other target machines. In this manner, the eDiscovery Suite module is able to synchronize access to target machines while utilizing almost any database system, even ones with degraded transactional abilities.
[0066] According to one embodiment, given that the multiple instances of the eDiscovery module 254 may be scanning the set of target machines 115, consideration is be given to the distribution of the target machines on the network 105 in order to maximize the throughput of the entire collection. Some target machines may exist on a segment of the network 105 with restricted bandwidth such that all targets on the segment may not be able to be scanned simultaneously without creating a network bottleneck. In practice this is quite common with collections at large organizations which have many network segments distributed geographically across the world.
[0067] Accordingly, the set of target machines is often provided to the eDiscovery module in non-random order. It is very common for the list to be ordered by organizational department, which has a high degree of correlation to geographic and network distribution, or explicitly by geographic location. If multiple instances of the eDiscovery module are used, then each would proceed through the list in order and many machines on the same network segment would be processed simultaneously, thus straining network traffic on particular segments. As network traffic becomes constrained, the total collection is delayed. [0068] To avoid this situation, information' concerning the network distribution of the target machines could be provided as input to the eDiscovery module and that information could be taken into consideration as the eDiscovery module instances connected to the targets. However, counsel is typically unaware of how the network is segmented and the capabilities of each segment. Additionally, the network and the distribution of the target machines on it is often highly dynamic (e.g. custodians with laptops may change their location on the network frequently). Gathering this information and keeping it updated is burdensome.
[0069] Therefore, according to one embodiment of the invention, a much simpler strategy is utilized by the eDiscovery module to avoid the aforementioned situation, where network traffic becomes a bottleneck on a particular segment. When each eDiscovery module instance is provided with the list of target machines, it randomly permutes the ordering of the machines internally to itself and then connects to the machines according to the new permutation. According to one embodiment, the permutation is not retained. Each eDiscovery module instance will likely create a unique permutation. Therefore in the aggregate, network utilization will be fairly distributed across the global network 105 avoiding bottlenecks on particular segments. Additionally, if any one network segment is degraded or becomes unavailable, scanning of target machines on other network segments is unaffected. In this way the aggregate collection proceeds in a robust manner and throughput it maximized. [0070] FIG. 2B is a more detailed flow diagram of step 220 of FIG. 2A of applying the investigation subject according to one embodiment of the invention.
[0071] In step 221, the eDiscovery module 256 determines whether the selected investigation subject includes one or more filter conditions. If the answer is YES, the eDiscovery module 256 identifies, in step 222, the files that satisfy the filter conditions. In this regard, the eDiscovery module obtains from each of the identified target machines a copy of their full filesystem metadata and compares the filter conditions against the obtained metadata.
[0072] In step 223, the eDiscovery module 256 determines whether the selected investigation subject includes any search keywords. If the answer is NO, the metadata of the files identified in step 222 is collected and preserved in an evidence file associated with the investigation subject along with the investigation GUID. If the content of the identified files is to also be collected, acquisition commands are transmitted to the corresponding servlets for acquiring the indicated files. Each acquired file is stamped with the GUID by associating the GUID to the acquired file. The files may be acquired, for example, in the manner described in the above-referenced U.S. Patent Application Ser. No. 11/315,761. The acquired files are then stored in the evidence file associated with the investigation subject. [0073] If the investigation subject does, however, specify a keyword set, the eDiscovery module 254 transmits commands to one or more servlets associated with the files identified in step 222, for collecting and preserving from the identified files, the files and/or their metadata that also contain the set of keywords. According to one embodiment of the invention, a multiple number of the identified servlets may be concurrently invoked for performing searches of the identified files for the specified keyword set.
[0074] According to one embodiment of the invention, the servlet conducts a non- indexed search of the specified files. The search makes use of optimized algorithms for determining responsiveness to the keywords during a single pass through the files. At the end of this pass, information about search hits is retained and returned to the eDiscovery module 254. According to one embodiment of the invention, the servlet returns for each hit, a file offset, in bytes, that denotes the start of the hit, the length of the hit in bytes, and the keyword that generated the hit. The returned data may then be used to acquire a copy of the corresponding file.
[0075] The eDiscovery module 254 collects the metadata of the files that have resulted in a hit in the evidence file associated with the investigation subject along with the investigation GUID. The examining machine then transmits a command to acquire the content of such files such content is to be preserved. The acquired files are stamped with the GUID and stored in the evidence file associated with the investigation subject. [0076] In step 228, the eDiscovery module 254 determines whether the investigation subject includes a source file to be searched for a match. If the answer is YES, the eDiscovery module proceeds to collect and preserve, in step 230, the files matching the source file and/or their metadata. In this regard, the eDiscovery module 254 compares the size of the source file against the size of the files to be investigated. The file size information is maintained by the filesystem obtained from the target devices. If the size of a given file does not match the file size of the source file, then a determination is made that the files are different.
[0077] Otherwise, if there is a match of the file sizes, the digital signatures, such as, for example, hash values, of the source file and the candidate file are generated for determining a match. In this regard, the eDiscovery module 254 commands the servlet associated with the candidate file to compute a hash value of the candidate file. If a hash value of the source file has not yet been computed, the eDiscovery module proceeds to do this as well. [0078] The eDisclovery module 256 compares the returned hash value against the hash value of the source file. If there is a match, the files are the same, and the metadata of the matching file is collected and preserved in the evidence file associated with the investigation subject along with the investigation GUID. If the matching file is also to be acquired, the eDiscovery module 254 transmits a command to the servlet to acquire the matching file. The acquired file is then stamped with the GUID and stored in the evidence file associated with the investigation subject.
[0079] According to one embodiment of the invention, the eDiscovery module 254 takes a timestamp of the date and time in which the metadata and/or files were acquired, and stores this timestamp information with the corresponding metadata. The eDiscovery module 254 also stores information of the investigation subject applied to the investigation in the evidence file. In this manner, the files that have been preserved during the investigation may be easily and reliably associated with the investigation subject. This provides complete tracking of why each file was collected.
[0080] According to one embodiment of the invention, the eDiscovery module may optionally create a log file for each target machine listing the full path (directory structure and file name) of every file considered, whether the file was identified or rejected based on the investigation subject. Information about the date and time of when the determination was made is also included into the log file. In this way, the eDiscovery module 256 may provide a detailed accounting of all files that were or were not responsive, aiding the defensibility of a particular investigation.
[0081] According to one embodiment of the invention, the eDiscovery module 256 further allows for on-the-ffy expansion of compressed container files (e.g. zip archives) in the midst of a current scan of the target machine. In this manner, the investigation subject may be applied against the individual files within the container file. Otherwise, the entire container would have to be collected, even if it contained irrelevant data, in order to avoid risking defensibility of the investigation. According to one embodiment of the invention, the eDiscovery module provides an option as to whether to preserve entire container file if it contains a responsive file, or only the particular responsive files within the container file. [0082] According to one embodiment of the invention, the eDiscovery module 256 constantly monitors its connection to the target machines. If the connection is disrupted, then the incomplete results are destroyed and the target machine is kept marked as incomplete. When a connection can be re-established at a later time, the target machine is re-scanned from scratch. In this way, the collection for each traget machine is performed in one continuous period of time, eliminating any questions about the defensibility of the individual collection having occurred over several discontinuous time ranges.
[0083] According to one embodiment of the invention, the GUID may be retrieved from the evidence file and used to reconstruct the investigation subject including the set of investigation criteria. In this manner, the eDiscovery module 254 allows for searches to be done based on subject name and/or GUID. The reconstructed investigation criteria may then be displayed on a display.
[0084] The GUID associated with each file that is collected may be automatically compared to the GUID in an examiner report to verify that the investigation criteria was consistently applied to the collected files, and to verify that indeed the correct files have been produced in court or to an opposing party.
[0085] FIG. 3 is a photograph of a GUI screen for viewing details of an investigation subject according to one embodiment of the invention. The investigation subject includes a name 300 and a GUID 302. According to one embodiment of the invention, the GUID 302 is automatically generated upon an indication that a new investigation subject is to be generated, or upon edits of the investigation criteria of a previously generated investigation subject.
[0086] In order to associate different investigation criteria to the investigation subject, the examiner starts by selecting a condition tab 304 followed by a keyword search tab 306 and a matching files tab 308. The investigation criteria may also be defined in any other order as will be appreciated by a person of skill in the art.
[0087] FIGS. 4A-4D are photographs of the GUI screens for setting filter conditions upon selection of the conditions tab 304 according to one embodiment of the invention. A name 320 is selected for the filter conditions that are to be defined as is illustrated in FIG. 4A. The examiner then sets various filter properties and the values of those properties as is illustrated in FIGS. 4B-4C. In FIG. 4B, a file extension property is set for filtering based on the identified file extensions. In FIG. 4C, a full path property is set for filtering based on the identified logical path of the files. [0088] FIG. 4D illustrates the combining of various filter conditions via a boolean expression such as "and" or "or" to create a set of complex filter conditions. The illustrated filter conditions specify the following criteria:
- Microsoft office files (.doc, .xls, .pps, .ppt and .mdb) OR
- Enterprise E-mail archives (.pst and .nsf) AND
- Which were created between 1/1/2005 and 12/31/2005 AND
- Which are NOT deleted, NOT system files in the root Windows directory or are NOT Lost Files (files recovered on the drive with no logical parent folder.
[0089] FIGS. 5A-5D are photographs of the GUI screens for setting search keywords according to one embodiment of the invention. According to one embodiment, a keyword list is initially generated and saved in a keyword file as illustrated in FIGS. 5A-5B. In the illustrated example, the keywords that are generated are saved in an "eDiscovery" file. The keyword file may then be used for different investigation subjects.
[0090] FIGS. 5C-5D illustrate the use of a generated keyword list for specifying a keyword search for a particular investigation subject. In this regard, after the filter conditions have been specified as illustrated in FIGS. 4A-4D, a name 322 assigned to the filter conditions is highlighted as is illustrated in FIG. 5C, and the keyword search tab 306 selected for selecting the appropriate keyword file.
[0091] FIG. 5D illustrates the selection of the "eDiscovery" keyword file, and the selection of particular keywords within the file for the current investigation subject. [0092] In order to search for a match of specific source files, the matching files tab 308 is selected and an add files command 324 given as is illustrated in FIG. 6, to browse to a folder containing a list of possible source files. The files within the selected folders are then displayed in a matching files window 323. The specific files to be searched for a match are then selected, and an OK button 326 selected to save the selected source files for the particular investigation subject.
[0093] FIG. 7 is a photograph of a GUI screen for viewing and/or editing an investigation subject after it has been created according to one embodiment of the invention. The subject name assigned to the investigation subject appears in a name field 600, and the GUID assigned to the investigation subject appears in a GUID field 602. Area 604 identifies the investigation subject as being locked or unlocked. According to one embodiment of the invention, a locked state indicates that the investigation subject may not be modified. According to one embodiment of the invention, the subject is locked the first time the investigation subject is selected and a command given to run an investigation based on the subject. According to one embodiment of the invention, the subject is locked even before a connection is made to a first target computer. Thereafter looking at the subject in the GUI will display it as locked, even if the subject is transferred to another examiner computer. An unlocked state indicates that the investigation subject may be modified. According to one embodiment of the invention, if any of the investigation criteria options are modified, a new GUID is generated and associated with the investigation subject.
[0094] A criteria area 608 identifies one or more investigation criteria sets defined for the investigation subject. According to one embodiment of the invention, more than one investigation criteria set may be defined for a particular investigation subject. [0095] A criteria name identified in a criteria name field 606 identifies each investigation criteria set. In the illustrated embodiment, a single criteria set is defined. Selection of a particular investigation criteria set in the criteria name field 606 causes display of various filter conditions 610 and keyword options 612 defined for the criteria set. In this manner the investigation criteria defined for a particular investigation subject may be reconstructed and/or verified in an reliable and easy manner.
[0096] FIG. 8 is GUI screen for applying an investigation subject to an investigation of one or more target machines according to one embodiment of the invention. In the illustrated example, an investigation subject with the name "ACME Corporation" is selected, and a command to initiate the investigation is provided via selection of a next button 350. Additional GUI screens (not shown) are then displayed for allowing the examiner to select the target machines to be investigated, and to connect to the secure server 111. [0097] The results of the investigation are preserved in an evidence file referred to as a logical evidence file (LEF). According to one embodiment of the invention, the LEF is a container file for storing the metadata of the collected files as well as the content of those files. The LEF presents the metadata of the preserved files in such a manner as to allow for the recreation of how the files existed on the original device.
[0098] Included in the LEF are various files, including a metadata file for each preserved file, and files containing the content of each the preserved file. According to one embodiment of the invention, the metadata file for each preserved file is kept hidden and not presented as a contained file. The metadata is extracted by the client software 116 and presented as fields associated with the contained file to recreate the file in its original context on the target device. The LEF may also take the form of a folder or other evidence data store as will be appreciated by a person of skill in the art.
[0099] FIG. 9 is a layout diagram of metadata preserved in an LEF according to one embodiment of the invention. The metadata includes, for each file that is responsive to the investigation subject, a file name 500, file type 502, and MAC (modified, accessed, created) dates 504. The file's hash value 506, full directly structure 508, and ownership and access permission information 510 are also preserved in the LEF. In addition, the LEF further stores the investigation subject including a subject name 516, GUID 512, and the date and time 514 in which the metadata information was acquired. In this manner, the file corresponding to the preserved metadata becomes associated with the investigation subject stored in the LEF.
[00100] FIG. 10 is a flow diagram of a process for verifying evidence produced in response to a discovery order according to one embodiment of the invention. Although the process is described as a software process implemented by a post-processing module in a third party computer, a person of skill in the art should recognize that the process may also be implemented by the eDiscovery module 254 or another module in the examining machine
115.
[00101] In step 400, the post-processing module receives an examiner report with information on the investigation that was conducted responsive to the discovery order.
According to one embodiment of the invention, the examiner report includes the GUID of the investigation subject that was applied for the investigation.
[00102] The post-processing module also receives the LEF containing the metadata and the files that have been preserved as a result of the investigation.
[00103] In step 402, the post-processing module retrieves the GUID from the examiner report.
[00104] In step 404, the post-processing module retrieves the GUID from the LEF.
[00105] In step 406, the post-processing module compares the GUID in the examiner report to the GUID in the LEF.
[00106] In step 408, the post-processing module determines whether the GUIDs match. If the answer is YES, the post-processing module outputs a verification message in step 410. If the answer is NO, the post-processing module outputs, in step 412, a message that the investigation could not be verified.
[00107] According to one embodiment of the invention, the post-processing module may also recompute the hash value of the stored file contents to compare them against the stored hash value at the time of collection, in order to detect corruption in the LEF.
[00108] FIG. 1 IA is a flow diagram of a process for establishing secure communication between the examining machine 115 and the secure server 111 according to one embodiment of the invention. The client software 116 or reconnect module 254 (hereinafter collectively referred to as the client software) resident in the examining machine 115 is invoked for establishing the secure communication. In general terms, the client software, in step 900, generates an examiner's random number "Erand" and includes it into a packet along with the examiner's user name. In step 902, the client software signs the packet with a user authentication private key as is understood by those of skill in the art. In step 904, the client software encrypts the signed packet with the secure server's public key according to conventional mechanisms, and transmits the encrypted, signed packet to the secure server 111 in step 906. [00109] In step 908, the secure server 111 receives the packet and invokes its computer investigation software 109 to decrypt the packet using the server's private key. In step 910 the software 109 retrieves the examiner's user name from the packet and searches the server's database for a match. The matched name in the server's database includes a public user authentication key which is used in step 912 to verify the user's signature on the packet according to conventional mechanisms. If the signature is not verified, as determined in step 914, the client software cannot be authenticated and a connection between the client software and the secure server is denied in step 916.
[00110] If, however, the signature is verified, the client software may be authenticated, and the computer investigation software 109 stores the examiner's random number in step 918. In step 920, the processor generates its own server random number "Srand" and a server-to- examiner session encryption key "SEkey" to be used to encrypt future communications between the server and the examiner. These values, as well as the original examiner's random number are signed with the server's private key in step 922, encrypted with the user's public key in step 924, and transmitted to the client software in step 926. [00111] In step 928, the client software 116 receives the packet from the secure server and decrypts it using the user's private key. In step 930, the client software verifies the server's signature with the server's public key according to conventional mechanisms. In step 932, a determination is made as to whether the signature may be verified. If the answer is YES, the server is authenticated, and the client software verifies the examiner's random number that is transmitted by the server to confirm that it is, in fact, the same number that was sent to the server. If the number may be confirmed, as is determined in step 934, the examiner creates another packet to send back to the server 111. This packet includes the server random number which is encrypted, in step 936, with the server-to-examiner session key. The encrypted packet is then transmitted to the server.
[00112] In step 938, the server's computer investigation software 109 decrypts the packet containing the server random number with the server-to-examiner session key. If the received server random number is the same number originally generated and sent to the client software as is determined in step 940, the number is confirmed, and a secure connection is established in step 942. The process for establishing a secure connection between the client software and the secure server 111 is described in more detail in the above-referenced U.S. Patent No. 6,792,545.
[00113] Once a secure connection is established, an examiner may use its client software 116 to request investigation of the target machines across the network in support of incident response, information auditing, and forensic discovery. The secure server 111 authorizes and securely brokers requests and communications from the client software to the target machines. The communication between the server and the client software is encrypted using the server-to-examiner session encryption key. [00114] FIG. 1 IB is a flow diagram of a process for establishing a secure communication between the secure server 111 and the servlet 118 according to one embodiment of the invention. A number of such secure communications may be established concurrently. The communication is established in response to request by the examining machine to investigate the target machine running the servlet.
[00115] In step 1000, the server's computer investigation software 109 generates a second server random number "Srand2," and signs the packet with the server's private key in step 1002. In step 1004, the software 109 transmits the signed packet to the servlet. [00116] The servlet receives the packet signed with the second server random number, and in step 1006, verifies the signature with the server's public key. If the signature cannot be verified, as is determined in step 1008, a safe connection between the secure server 111 and the servlet 118 is denied in step 1010.
[00117] If, however, the server's signature is verified, the servlet generates a servlet-to- server session encryption key in step 1012 and inserts it into a packet in step 1014 along with the second server random number. The servlet encrypts the packet in step 1016 with the server's public key, and transmits the packet to the server 111.
[00118] In step 1018, the server's computer investigation software 109 receives the encrypted packet and decrypts it with the server's private key. The processor further confirms in step 1020, whether the second server random number is the same number that was originally sent to the servlet. If the answer is YES, the processor generates a server-to-servlet session encryption key in step 1022, and encrypts the server-to-servlet session encryption key with the servlet-to-server session encryption key in step 1024. In step 1026, the encrypted packet is transmitted to the servlet.
[00119] In step 1028, the servlet decrypts the packet with the servlet-to-server session key, and stores the server-to-servlet session key in step 1030. In step 1031, a secure connection is established, and all subsequent data exchanges between the server and the servlet are encrypted using the server-to-servlet session key. The establishment of a secure connection between the secure server 111 and the servlet 118 is described in more detail in the above- referenced U.S. Patent No. 6,792,545.
[00120] Once the server 111 has successfully established secure connections with the examining machine 115 and one or more servlets, the examining machine 115 and the servlets may communicate directly in effectuating searches of dynamic and/or static data stored in the target devices.
[00121] Although this invention has been described in certain specific embodiments, those skilled in the art will have no difficulty devising variations to the described embodiment which in no way depart from the scope and spirit of the present invention. Furthermore, to those skilled in the various arts, the invention itself herein will suggest solutions to other tasks and adaptations for other applications. It is the Applicant's intention to cover by claims all such uses of the invention and those changes and modifications which could be made to the embodiments of the invention herein chosen for the purpose of disclosure without departing from the spirit and scope of the invention. Thus, the present embodiments of the invention should be considered in all respects as illustrative and not restrictive, the scope of the invention to be indicated by the appended claims and their equivalents rather than the foregoing description.

Claims

WHAT IS CLAIMED IS:
1. A computer-implemented method for conducting investigations of one or more target machines in a data communications network via an examining machine, the method comprising: defining, under control of the examining machine, a set of investigation criteria; automatically generating by the examining machine, a unique identifier for the set of investigation criteria; automatically investigating the one or more target machines based on the set of investigation criteria and outputting results of the investigation; preserving the results of the investigation in an evidence data store; and storing the unique identifier in the evidence data store in association with the preserved results for tracing the results to the set of investigation criteria.
2. The method of claim I5 wherein the evidence data store is a container file storing metadata of files that are responsive to the investigation criteria and copies of the responsive files.
3. The method of claim 1 , wherein the unique identifier is randomly generated by the examining machine.
4. The method of claim 1, wherein the investigation criteria includes one or more filter conditions for identifying one or more files based on the filter conditions.
5. The method of claim 4, wherein the filter conditions specify file metadata, the method further comprising: retrieving by the examining computer fϊlesystem data of the one or more target machines; comparing by the examining computer the fϊlesystem data with the specified file metadata; and identifying the one or more files based on the comparison.
6. The method of claim 1, wherein the investigation criteria includes one or more keywords for identifying files in the one or more target machines containing the one or more keywords.
7. The method of claim I5 wherein the investigation criteria identifies a specific source file for identifying a file in the one or more target machines matching the source file.
8. The method of claim 7 further comprising: identifying lengths of the source file and a file in a particular target machine that is being considered for a match; determining whether the lengths are equal; if the lengths are equal, invoking the target machine to compute a digital signature value of the file being considered for a match; retrieving the digital signature values of the source file and the file being considered for a match; determining whether the digital signature values are equal; and if the digital signature values are equal, identifying the file being considered for a match as a matching file.
9. The method of claim 1 further comprising: receiving an examiner report including the unique identifier for the set of investigation criteria used for the investigating; comparing the unique identifier in the examiner report with the unique identifier in the evidence data store; and outputting a verification message based on the comparison.
10. The method of claim 1 further comprising: retrieving the unique identifier from the evidence data store; and reconstructing the set of investigation criteria based on the unique identifier.
11. An examining machine conducting investigations of one or more target machines in a data communications network, the examining machine comprising: a processor; a memory operably coupled to the processor storing program instructions therein, the processor being operable to execute the program instructions, the program instructions including: defining a set of investigation criteria; automatically generating a unique identifier for the set of investigation criteria; automatically investigating the one or more target machines based on the set of investigation criteria and outputting results of the investigation; preserving the results of the investigation in an evidence data store; and storing the unique identifier in the evidence data store in association with the preserved results for tracing the results to the set of investigation criteria.
12. The examining machine of claim 11, wherein the evidence data store is a container file storing metadata of files that are responsive to the investigation criteria and copies of the responsive files.
13. The examining machine of claim 11 , wherein the unique identifier is randomly generated by the examining machine.
14. The examining machine of claim 11, wherein the investigation criteria includes one or more filter conditions for identifying one or more files based on the filter conditions.
15. The examining machine of claim 14, wherein the filter conditions specify file metadata, the program instructions further including: retrieving filesystem data of the one or more target machines; comparing the filesystem data with the specified file metadata; and identifying the one or more files based on the comparison.
16. The examining machine of claim 11, wherein the investigation criteria includes one or more keywords for identifying files in the one or more target machines containing the one or more keywords.
17. The examining machine of claim 11, wherein the investigation criteria identifies a specific source file for identifying a file in the one or more target machines matching the source file.
18. The examining machine of claim 17, wherein the program instructions further include: identifying lengths of the source file and a file in a particular target machine that is being considered for a match; determining whether the lengths are equal; if the lengths are equal, invoking the target machine to compute a digital signature value of the file being considered for a match; retrieving the digital signature values of the source file and the file being considered for a match; determining whether the digital signature values are equal; and if the digital signature values are equal, identifying the file being considered for a match as a matching file.
19. The examining machine of claim 11 further comprising a second processor configured to execute a second set of program instructions including: receiving an examiner report including the unique identifier for the set of investigation criteria used for the investigating; comparing the unique identifier in the examiner report with the unique identifier in the evidence data store; and outputting a verification message based on the comparison.
20. The examining machine of claim 11, wherein the program instructions further include: retrieving the unique identifier from the evidence data store; and reconstructing the set of investigation criteria based on the unique identifier.
PCT/US2006/039527 2005-10-06 2006-10-06 Electronic discovery system and method WO2007044709A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP06816612A EP1934840A4 (en) 2005-10-06 2006-10-06 Electronic discovery system and method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US72462305P 2005-10-06 2005-10-06
US60/724,623 2005-10-06

Publications (2)

Publication Number Publication Date
WO2007044709A2 true WO2007044709A2 (en) 2007-04-19
WO2007044709A3 WO2007044709A3 (en) 2009-04-23

Family

ID=37943469

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2006/039527 WO2007044709A2 (en) 2005-10-06 2006-10-06 Electronic discovery system and method

Country Status (3)

Country Link
US (2) US7809686B2 (en)
EP (1) EP1934840A4 (en)
WO (1) WO2007044709A2 (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8200635B2 (en) 2009-03-27 2012-06-12 Bank Of America Corporation Labeling electronic data in an electronic discovery enterprise system
US8250037B2 (en) 2009-03-27 2012-08-21 Bank Of America Corporation Shared drive data collection tool for an electronic discovery system
EP2509024A1 (en) * 2010-03-29 2012-10-10 Ubic, Inc. Forensic system, method and program
US8364681B2 (en) 2009-03-27 2013-01-29 Bank Of America Corporation Electronic discovery system
US8417716B2 (en) 2009-03-27 2013-04-09 Bank Of America Corporation Profile scanner
US20130117218A1 (en) * 2011-11-03 2013-05-09 Microsoft Corporation Cross-store electronic discovery
US8504489B2 (en) 2009-03-27 2013-08-06 Bank Of America Corporation Predictive coding of documents in an electronic discovery system
US8549327B2 (en) 2008-10-27 2013-10-01 Bank Of America Corporation Background service process for local collection of data in an electronic discovery system
US8572376B2 (en) 2009-03-27 2013-10-29 Bank Of America Corporation Decryption of electronic communication in an electronic discovery enterprise system
US8572227B2 (en) 2009-03-27 2013-10-29 Bank Of America Corporation Methods and apparatuses for communicating preservation notices and surveys
US8793277B2 (en) 2010-03-29 2014-07-29 Ubic, Inc. Forensic system, forensic method, and forensic program
US8806358B2 (en) 2009-03-27 2014-08-12 Bank Of America Corporation Positive identification and bulk addition of custodians to a case within an electronic discovery system
US9053454B2 (en) 2009-11-30 2015-06-09 Bank Of America Corporation Automated straight-through processing in an electronic discovery system
US9330374B2 (en) 2009-03-27 2016-05-03 Bank Of America Corporation Source-to-processing file conversion in an electronic discovery enterprise system
US9721227B2 (en) 2009-03-27 2017-08-01 Bank Of America Corporation Custodian management system
US9817898B2 (en) 2011-11-14 2017-11-14 Microsoft Technology Licensing, Llc Locating relevant content items across multiple disparate content sources
US20200005324A1 (en) * 2013-09-09 2020-01-02 UnitedLex Corp. Organization based on hash values

Families Citing this family (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6792545B2 (en) * 2002-06-20 2004-09-14 Guidance Software, Inc. Enterprise computer investigation system
US7711728B2 (en) * 2002-06-20 2010-05-04 Guidance Software, Inc. System and method for searching for static data in a computer investigation system
US20070011450A1 (en) * 2004-09-14 2007-01-11 Mccreight Shawn System and method for concurrent discovery and survey of networked devices
US7941386B2 (en) * 2005-10-19 2011-05-10 Adf Solutions, Inc. Forensic systems and methods using search packs that can be edited for enterprise-wide data identification, data sharing, and management
US7603344B2 (en) * 2005-10-19 2009-10-13 Advanced Digital Forensic Solutions, Inc. Methods for searching forensic data
US20070139231A1 (en) * 2005-10-19 2007-06-21 Advanced Digital Forensic Solutions, Inc. Systems and methods for enterprise-wide data identification, sharing and management in a commercial context
US8892735B2 (en) * 2006-09-28 2014-11-18 Guidance Software, Inc. Phone home servlet in a computer investigation system
JP2008146601A (en) * 2006-12-13 2008-06-26 Canon Inc Information processor and information processing method
US20080294492A1 (en) * 2007-05-24 2008-11-27 Irina Simpson Proactively determining potential evidence issues for custodial systems in active litigation
US8417776B2 (en) * 2007-08-25 2013-04-09 Vere Software, Inc. Online evidence collection
US8572043B2 (en) * 2007-12-20 2013-10-29 International Business Machines Corporation Method and system for storage of unstructured data for electronic discovery in external data stores
US8112406B2 (en) 2007-12-21 2012-02-07 International Business Machines Corporation Method and apparatus for electronic data discovery
US8140494B2 (en) 2008-01-21 2012-03-20 International Business Machines Corporation Providing collection transparency information to an end user to achieve a guaranteed quality document search and production in electronic data discovery
US8275720B2 (en) 2008-06-12 2012-09-25 International Business Machines Corporation External scoping sources to determine affected people, systems, and classes of information in legal matters
US9830563B2 (en) 2008-06-27 2017-11-28 International Business Machines Corporation System and method for managing legal obligations for data
US8327384B2 (en) 2008-06-30 2012-12-04 International Business Machines Corporation Event driven disposition
US8484069B2 (en) 2008-06-30 2013-07-09 International Business Machines Corporation Forecasting discovery costs based on complex and incomplete facts
US8515924B2 (en) 2008-06-30 2013-08-20 International Business Machines Corporation Method and apparatus for handling edge-cases of event-driven disposition
US20100017239A1 (en) * 2008-06-30 2010-01-21 Eric Saltzman Forecasting Discovery Costs Using Historic Data
US8073729B2 (en) 2008-09-30 2011-12-06 International Business Machines Corporation Forecasting discovery costs based on interpolation of historic event patterns
US8489439B2 (en) 2008-06-30 2013-07-16 International Business Machines Corporation Forecasting discovery costs based on complex and incomplete facts
US8090705B1 (en) * 2008-09-15 2012-01-03 Symantec Corporation Method and apparatus for processing electronically stored information for electronic discovery
US8204869B2 (en) * 2008-09-30 2012-06-19 International Business Machines Corporation Method and apparatus to define and justify policy requirements using a legal reference library
US8321860B2 (en) * 2008-10-27 2012-11-27 Bank Of America Corporation Local collector
US8086694B2 (en) 2009-01-30 2011-12-27 Bank Of America Network storage device collector
US8504580B2 (en) * 2009-03-03 2013-08-06 Ilya Geller Systems and methods for creating an artificial intelligence
US20100250509A1 (en) * 2009-03-27 2010-09-30 Bank Of America Corporation File scanning tool
US20100250455A1 (en) * 2009-03-27 2010-09-30 Bank Of America Corporation Suggesting potential custodians for cases in an enterprise-wide electronic discovery system
US8224924B2 (en) * 2009-03-27 2012-07-17 Bank Of America Corporation Active email collector
US20100250266A1 (en) * 2009-03-27 2010-09-30 Bank Of America Corporation Cost estimations in an electronic discovery system
US20100250735A1 (en) * 2009-03-27 2010-09-30 Bank Of America Corporation Monitoring an enterprise network for determining specified computing device usage
US20100250456A1 (en) * 2009-03-27 2010-09-30 Bank Of America Corporation Suggesting preservation notice and survey recipients in an electronic discovery system
US8339680B2 (en) * 2009-04-02 2012-12-25 Xerox Corporation Printer image log system for document gathering and retention
US20110040600A1 (en) * 2009-08-17 2011-02-17 Deidre Paknad E-discovery decision support
US8250041B2 (en) 2009-12-22 2012-08-21 International Business Machines Corporation Method and apparatus for propagation of file plans from enterprise retention management applications to records management systems
US8655856B2 (en) 2009-12-22 2014-02-18 International Business Machines Corporation Method and apparatus for policy distribution
US8566903B2 (en) * 2010-06-29 2013-10-22 International Business Machines Corporation Enterprise evidence repository providing access control to collected artifacts
US8832148B2 (en) * 2010-06-29 2014-09-09 International Business Machines Corporation Enterprise evidence repository
US8402359B1 (en) 2010-06-30 2013-03-19 International Business Machines Corporation Method and apparatus for managing recent activity navigation in web applications
US20120278761A1 (en) * 2011-04-29 2012-11-01 Symantec Corporation Method and system for managing duplicate item display
US9177011B2 (en) * 2011-12-22 2015-11-03 Magnet Forensics Inc. Systems and methods for locating application specific data
US9158825B1 (en) * 2012-11-09 2015-10-13 Symantec Corporation Search validity in data backup systems
US20140244699A1 (en) * 2013-02-26 2014-08-28 Jonathan Grier Apparatus and Methods for Selective Location and Duplication of Relevant Data
US10498777B2 (en) * 2014-03-17 2019-12-03 Citrix Systems, Inc. Real-time push notifications for cloud-based applications
US10635645B1 (en) 2014-05-04 2020-04-28 Veritas Technologies Llc Systems and methods for maintaining aggregate tables in databases
US10078668B1 (en) * 2014-05-04 2018-09-18 Veritas Technologies Llc Systems and methods for utilizing information-asset metadata aggregated from multiple disparate data-management systems
CN105224572B (en) * 2014-06-30 2019-11-15 北京金山安全软件有限公司 Method and device for identifying garbage catalogue
US10826930B2 (en) 2014-07-22 2020-11-03 Nuix Pty Ltd Systems and methods for parallelized custom data-processing and search
US10346550B1 (en) * 2014-08-28 2019-07-09 X1 Discovery, Inc. Methods and systems for searching and indexing virtual environments
US10191907B2 (en) * 2015-02-27 2019-01-29 Ricoh Company, Ltd. Legal discovery tool implemented in a mobile device
US20160253346A1 (en) * 2015-02-27 2016-09-01 Ricoh Company, Ltd. Legal Discovery Tool
US9680844B2 (en) * 2015-07-06 2017-06-13 Bank Of America Corporation Automation of collection of forensic evidence
CN105631327A (en) * 2015-12-16 2016-06-01 北京奇虎科技有限公司 Virus checking and killing method and system as well as client
US10430361B1 (en) 2015-12-17 2019-10-01 Cru Acquisition Group, Llc Combination write blocker
DE102019134590A1 (en) * 2019-12-16 2021-06-17 Thomas Schmalz Device for collecting IT forensically potentially relevant data, methods, computer program product and storage unit
EP3910511A1 (en) * 2020-05-13 2021-11-17 Magnet Forensics Inc. System and method for identifying files based on hash values
US11461490B1 (en) 2020-09-23 2022-10-04 Cru Data Security Group, Llc Systems, methods, and devices for conditionally allowing processes to alter data on a storage device
CN112533010A (en) * 2020-11-23 2021-03-19 北京北笛科技有限公司 Automatic evidence obtaining method and device for audio and video data in network live broadcast service

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5475625A (en) 1991-01-16 1995-12-12 Siemens Nixdorf Informationssysteme Aktiengesellschaft Method and arrangement for monitoring computer manipulations
US5491750A (en) * 1993-12-30 1996-02-13 International Business Machines Corporation Method and apparatus for three-party entity authentication and key distribution using message authentication codes
US5623652A (en) * 1994-07-25 1997-04-22 Apple Computer, Inc. Method and apparatus for searching for information in a network and for controlling the display of searchable information on display devices in the network
US5944794A (en) * 1994-09-30 1999-08-31 Kabushiki Kaisha Toshiba User identification data management scheme for networking computer systems using wide area network
US5715174A (en) * 1994-11-15 1998-02-03 Absolute Software Corporation Security apparatus and method
US5928323A (en) * 1996-05-30 1999-07-27 Sun Microsystems, Inc. Apparatus and method for dynamically generating information with server-side software objects
US5944791A (en) * 1996-10-04 1999-08-31 Contigo Software Llc Collaborative web browser
AU738458B2 (en) * 1996-11-26 2001-09-20 British Telecommunications Public Limited Company Communications system
US6084969A (en) * 1997-12-31 2000-07-04 V-One Corporation Key encryption system and method, pager unit, and pager proxy for a two-way alphanumeric pager network
US6012098A (en) * 1998-02-23 2000-01-04 International Business Machines Corp. Servlet pairing for isolation of the retrieval and rendering of data
WO1999066383A2 (en) * 1998-06-15 1999-12-23 Dmw Worldwide, Inc. Method and apparatus for assessing the security of a computer system
US6665702B1 (en) * 1998-07-15 2003-12-16 Radware Ltd. Load balancing
US20010011349A1 (en) * 1998-09-03 2001-08-02 Greg B. Garrison System and method for encrypting a data session between a client and a server
US6601061B1 (en) * 1999-06-18 2003-07-29 Surfwax, Inc. Scalable information search and retrieval including use of special purpose searching resources
US6647400B1 (en) * 1999-08-30 2003-11-11 Symantec Corporation System and method for analyzing filesystems to detect intrusions
US6874088B1 (en) * 1999-10-22 2005-03-29 Mission Critical Linux, Llc Secure remote servicing of a computer system over a computer network
US7120692B2 (en) * 1999-12-02 2006-10-10 Senvid, Inc. Access and control system for network-enabled devices
US20030208689A1 (en) * 2000-06-16 2003-11-06 Garza Joel De La Remote computer forensic evidence collection system and process
US20020174295A1 (en) * 2001-01-29 2002-11-21 Ulrich Thomas R. Enhanced file system failure tolerance
US6944760B2 (en) * 2001-05-24 2005-09-13 Openwave Systems Inc. Method and apparatus for protecting identities of mobile devices on a wireless network
US7096503B1 (en) * 2001-06-29 2006-08-22 Mcafee, Inc. Network-based risk-assessment tool for remotely detecting local computer vulnerabilities
US7146642B1 (en) * 2001-06-29 2006-12-05 Mcafee, Inc. System, method and computer program product for detecting modifications to risk assessment scanning caused by an intermediate device
US7228566B2 (en) * 2001-07-10 2007-06-05 Core Sdi, Incorporated Automated computer system security compromise
US20030196123A1 (en) * 2002-03-29 2003-10-16 Rowland Craig H. Method and system for analyzing and addressing alarms from network intrusion detection systems
US20070011450A1 (en) * 2004-09-14 2007-01-11 Mccreight Shawn System and method for concurrent discovery and survey of networked devices
US7711728B2 (en) * 2002-06-20 2010-05-04 Guidance Software, Inc. System and method for searching for static data in a computer investigation system
US6792545B2 (en) * 2002-06-20 2004-09-14 Guidance Software, Inc. Enterprise computer investigation system
US7370072B2 (en) * 2002-07-08 2008-05-06 Electronic Evidence Discovery, Inc. System and method for collecting electronic evidence data
US7308492B2 (en) * 2002-10-02 2007-12-11 Sony Corporation Method and apparatus for use in remote diagnostics
US7043476B2 (en) * 2002-10-11 2006-05-09 International Business Machines Corporation Method and apparatus for data mining to discover associations and covariances associated with data
US6968335B2 (en) * 2002-11-14 2005-11-22 Sesint, Inc. Method and system for parallel processing of database queries
US7496959B2 (en) * 2003-06-23 2009-02-24 Architecture Technology Corporation Remote collection of computer forensic evidence
US8892735B2 (en) * 2006-09-28 2014-11-18 Guidance Software, Inc. Phone home servlet in a computer investigation system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of EP1934840A4 *

Cited By (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8549327B2 (en) 2008-10-27 2013-10-01 Bank Of America Corporation Background service process for local collection of data in an electronic discovery system
US8805832B2 (en) 2009-03-27 2014-08-12 Bank Of America Corporation Search term management in an electronic discovery system
US8364681B2 (en) 2009-03-27 2013-01-29 Bank Of America Corporation Electronic discovery system
US8868561B2 (en) 2009-03-27 2014-10-21 Bank Of America Corporation Electronic discovery system
US8417716B2 (en) 2009-03-27 2013-04-09 Bank Of America Corporation Profile scanner
US8806358B2 (en) 2009-03-27 2014-08-12 Bank Of America Corporation Positive identification and bulk addition of custodians to a case within an electronic discovery system
US8504489B2 (en) 2009-03-27 2013-08-06 Bank Of America Corporation Predictive coding of documents in an electronic discovery system
US8250037B2 (en) 2009-03-27 2012-08-21 Bank Of America Corporation Shared drive data collection tool for an electronic discovery system
US9934487B2 (en) 2009-03-27 2018-04-03 Bank Of America Corporation Custodian management system
US8572376B2 (en) 2009-03-27 2013-10-29 Bank Of America Corporation Decryption of electronic communication in an electronic discovery enterprise system
US8572227B2 (en) 2009-03-27 2013-10-29 Bank Of America Corporation Methods and apparatuses for communicating preservation notices and surveys
US8688648B2 (en) 2009-03-27 2014-04-01 Bank Of America Corporation Electronic communication data validation in an electronic discovery enterprise system
US9721227B2 (en) 2009-03-27 2017-08-01 Bank Of America Corporation Custodian management system
US9547660B2 (en) 2009-03-27 2017-01-17 Bank Of America Corporation Source-to-processing file conversion in an electronic discovery enterprise system
US8200635B2 (en) 2009-03-27 2012-06-12 Bank Of America Corporation Labeling electronic data in an electronic discovery enterprise system
US9542410B2 (en) 2009-03-27 2017-01-10 Bank Of America Corporation Source-to-processing file conversion in an electronic discovery enterprise system
US9330374B2 (en) 2009-03-27 2016-05-03 Bank Of America Corporation Source-to-processing file conversion in an electronic discovery enterprise system
US8903826B2 (en) 2009-03-27 2014-12-02 Bank Of America Corporation Electronic discovery system
US9171310B2 (en) 2009-03-27 2015-10-27 Bank Of America Corporation Search term hit counts in an electronic discovery system
US9053454B2 (en) 2009-11-30 2015-06-09 Bank Of America Corporation Automated straight-through processing in an electronic discovery system
US9244920B2 (en) 2010-03-29 2016-01-26 Ubic, Inc. Forensic system, forensic method, and forensic program
EP2509024A1 (en) * 2010-03-29 2012-10-10 Ubic, Inc. Forensic system, method and program
US8799317B2 (en) 2010-03-29 2014-08-05 Ubic, Inc. Forensic system, forensic method, and forensic program
US8793277B2 (en) 2010-03-29 2014-07-29 Ubic, Inc. Forensic system, forensic method, and forensic program
EP2509024A4 (en) * 2010-03-29 2013-10-09 Ubic Inc Forensic system, method and program
US20130117218A1 (en) * 2011-11-03 2013-05-09 Microsoft Corporation Cross-store electronic discovery
US9817898B2 (en) 2011-11-14 2017-11-14 Microsoft Technology Licensing, Llc Locating relevant content items across multiple disparate content sources
US9996618B2 (en) 2011-11-14 2018-06-12 Microsoft Technology Licensing, Llc Locating relevant content items across multiple disparate content sources
US11978057B2 (en) 2013-09-09 2024-05-07 UnitedLex Corp. Single instance storage of metadata and extracted text
US20200005324A1 (en) * 2013-09-09 2020-01-02 UnitedLex Corp. Organization based on hash values
US11803860B2 (en) 2013-09-09 2023-10-31 UnitedLex Corp. Email mappings

Also Published As

Publication number Publication date
WO2007044709A3 (en) 2009-04-23
US7809686B2 (en) 2010-10-05
EP1934840A4 (en) 2010-12-15
EP1934840A2 (en) 2008-06-25
US20110047177A1 (en) 2011-02-24
US20070112783A1 (en) 2007-05-17

Similar Documents

Publication Publication Date Title
US7809686B2 (en) Electronic discovery system and method
US20230376523A1 (en) Event notification in interconnected content-addressable storage systems
US7693814B2 (en) Data repository and method for promoting network storage of data
JPH04310188A (en) Library service method for document/image library
US9807135B1 (en) Methods and computing systems for sharing cloud files using a social network
KR101311145B1 (en) Security in peer to peer synchronization applications
US10536459B2 (en) Document management systems and methods
US7711728B2 (en) System and method for searching for static data in a computer investigation system
EP1866797A2 (en) System and method for searching for static data in a computer investigation system
JP4409874B2 (en) Data transmitting apparatus and method, data receiving apparatus and method, and data distribution system and method
US9195846B2 (en) Access grants
Ieong et al. Deriving cse-specific live forensics investigation procedures from FORZA
Harrison The digital detective: An introduction to digital forensics.
Li Enhancement of the CINDI system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
REEP Request for entry into the european phase

Ref document number: 2006816612

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2006816612

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE