US20140351890A1 - Systems and Methods for Publishing Datasets - Google Patents
Systems and Methods for Publishing Datasets Download PDFInfo
- Publication number
- US20140351890A1 US20140351890A1 US14/452,481 US201414452481A US2014351890A1 US 20140351890 A1 US20140351890 A1 US 20140351890A1 US 201414452481 A US201414452481 A US 201414452481A US 2014351890 A1 US2014351890 A1 US 2014351890A1
- Authority
- US
- United States
- Prior art keywords
- datasets
- dataset
- index
- collected
- core
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/10—Network architectures or network communication protocols for network security for controlling access to devices or network resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2228—Indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/27—Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
- G06F16/275—Synchronous replication
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
- G06F16/972—Access to data in other repository systems, e.g. legacy data or dynamic Web page generation
-
- G06F17/30321—
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/02—Network architectures or network communication protocols for network security for separating internal from external traffic, e.g. firewalls
Definitions
- the present technology relates generally to systems and methods for information review and analysis, and more specifically, but not by way of limitation, to systems and methods for publishing datasets.
- the systems and methods provided herein may be configured to index and/or collect and publish datasets.
- electronic discovery, regulatory and investigative processes involve sensitive document review and analysis processes which often involve numerous time consuming and error prone steps associated with the gathering of relevant datasets in a legally defensible manner.
- Dataset collection, review, and analysis often begins with the filing of a lawsuit, commencement of an investigation or other significant event that requires the review and analysis of certain types of datasets.
- datasets may include many divers of similar data formats associated with user created files, electronic mail, audio, video, and the like.
- Datasets may be collected and analyzed from a variety of document sources such as email systems and repositories, file shares, storage media, third party web sites/services, and so forth. Because document review and analysis processes rely heavily on manual human action for certain steps, such as and the publishing of the collected and processed data to document review location, these processes are often error prone and time consuming. For example, datasets may be mistakenly omitted (i.e., not collected or improperly reviewed), leading to costly errors.
- the collected datasets may be published to a virtual segregated document review environment or other secure document review construct that provides a secure location for the review and analysis of the collected datasets.
- These segregated document review environments provide a location (either secured or unsecured) where dataset review personnel may analyze and evaluate datasets that have been identified and selected for review.
- the present technology may be directed to methods for publishing datasets.
- the methods may comprise: (a) receiving a request to publish a dataset to at least one of an internal environment located within a secured zone and an external environment located outside the secured zone, the request comprising at least one selection criteria; (b) selecting the dataset based upon the at least one selection criteria, the dataset being selected from an index of collected datasets; and (c) responsive to the request, publishing the dataset to at least one of the internal environment and the external environment.
- the present technology may be directed to an early case assessment system for publishing datasets.
- These systems may include: (a) a memory for storing executable instructions for publishing datasets; (b) a processor for executing the instructions stored in memory, the executable instructions comprising: (1) a core gather module that receives requests to publish a dataset to at least one of an internal environment located within a secured zone of the early case assessment system and an external environment located outside the secured zone, the request comprising at least one selection criteria; and (2) a core analysis module communicatively coupled to the core gather module that (i) selects the dataset based upon a request received by the core gather module, the dataset assembly module selecting the dataset from an index of collected datasets based upon the at least one selection criteria; and (ii) publishes the dataset to at least one of the internal environment and the external environment based upon a request received by the core gather module.
- FIG. 1 is a schematic diagram of an exemplary architecture of a system for publishing datasets. The system may be utilized to practice aspects of the present technology.
- FIG. 2 is an exemplary flow diagram of a method for publishing datasets.
- FIG. 3 is a block diagram of an exemplary computing system that may be utilized to practice aspects of the present disclosure.
- the systems provided herein may include electronic dataset review systems that identify, process, and publish datasets (e.g., documents, electronic mail, and so forth) from a variety of data sources such as an enterprise email server, document repositories, and the like. All datasets identified and/or collected comprise a corpus of reviewable datasets that are available for analysis and/or review. According to some embodiments, each dataset may be crawled and processed by the system according to guidelines established for the system by one or more dataset review administrators.
- datasets e.g., documents, electronic mail, and so forth
- All datasets identified and/or collected comprise a corpus of reviewable datasets that are available for analysis and/or review.
- each dataset may be crawled and processed by the system according to guidelines established for the system by one or more dataset review administrators.
- the systems and methods provided herein may automatically and continuously review datasets as they become available. It will be understood that the term “review” may be understood to include additional functions such as analyze, organize, enrich, prioritize, code, batch, and so forth. Datasets may be evaluated for keywords, concepts or other metadata that define the content or subject matter of the dataset and potential relevance of individual pieces of data. For example, the system may determine keywords in the body of a particular electronic mail communication. The subject matter of the email communication may be inferred from the occurrence of particular words or phrases or combinations thereof within the dataset. In some embodiments, the system may utilize any one of a number of algorithms to process and evaluate the subject matter and content of the datasets and their associated metadata to create an index of collected datasets according to the parameters set by the user. For example, email communications may be parsed for terms, dates, subject matter, email addresses, and the like. These terms are then associated with the email communication in the index such that searching the index for a term that is included in the email will yield the email as a search result.
- system users may search the index of datasets for relevant documents according to criteria. Criteria may include keyword searching, filtering according to parameters, and so forth. Once relevant data have been determined from the index and/or original location, the system may copy the relevant datasets from index and publish the same to a document review location.
- a triggering event e.g., commencement of a lawsuit, investigation or other similar event
- the document review location may include a virtual segregated document review environment, a secure document review location within the system, and/or a third party document review location.
- the relevant documents may be communicated to the computing system of the user, the computing system of the user's legal counsel, to regulatory bodies, or the relevant documents may be retained for review and analysis within the system itself.
- FIG. 1 illustrates an exemplary architecture 100 that may include an early case assessment system, hereinafter “system 105 ” that, in turn, includes a data collection module 110 , a core analysis and publish module 115 , an internal review environment 120 , and an external review environment 125 , can be utilized as a part of the early case assessment system, system 105 , or as an external review environment to which data is published from the early case assessment system 105 . It is noteworthy that the system 105 may include additional modules, engines, or components, and still fall within the scope of the present technology.
- system 105 may include additional modules, engines, or components, and still fall within the scope of the present technology.
- module may also refer to any of an application-specific integrated circuit (“ASIC”), an electronic circuit, a processor (shared, dedicated, or group) that executes one or more software or firmware programs, a combinational logic circuit, and/or other suitable components that provide the described functionality.
- ASIC application-specific integrated circuit
- processor shared, dedicated, or group
- individual modules of the system 105 may include separately configured web servers.
- the system 105 may communicatively couple with any one of a plurality of client devices such as an end user node 130 a , an attorney node 130 b , an IT node 130 c , and an investigator node 130 d . While these nodes represent descriptive or exemplary end user client devices, one of ordinary skill in the art will appreciate that many other types of client device nodes may likewise be utilized in accordance with the present technology. It is further noteworthy to mention that each of the nodes 130 a - d may include an end user computing system such as computing system 300 , which will be described in greater detail with reference to FIG. 3 .
- Each of the nodes 130 a - d may communicatively couple with the system 105 via any of a private and/or public communications network such as the Internet.
- the system 105 may include a cloud based computing environment that collects, processes, analyzes, and publishes datasets.
- a cloud-based computing environment is a resource that typically combines the computational power of a large grouping of processors and/or that combines the storage capacity of a large grouping of computer memories or storage devices.
- systems that provide a cloud resource may be utilized exclusively by their owners, such as GoogleTM or Yahoo!TM; or such systems may be accessible to outside users who deploy applications within the computing infrastructure to obtain the benefit of large computational or storage resources.
- the cloud may be formed, for example, by a network of web servers with each server (or at least a plurality thereof) providing processor and/or storage resources. These servers may manage workloads provided by multiple users (e.g., cloud resource customers or other users). Typically, each user places workload demands upon the cloud that vary in real-time, sometimes dramatically. The nature and extent of these variations typically depend on the type of business associated with the user.
- the system 105 may generate a variety of graphical user interfaces that allow end users to interact with the system 105 .
- the system may generate a web interface 140 that allows end users to enter search queries for searching indexes, establish dataset collection criteria, or automatically obtain and publish datasets to at least one of an internal document review location, such as an internal review environment 120 , and an external review environment 125 .
- an internal document review location such as an internal review environment 120
- an external review environment 125 may be located outside a firewall 145 of the system 105 .
- accessing datasets published to the internal review environment 120 may require the use of security credentials (e.g., secure communication protocols) to gain access through the firewall 145 .
- security credentials e.g., secure communication protocols
- components or modules behind the firewall 145 of the system 105 may be referred to as being located within a “secure zone” of the system 105 .
- the internal review environment 120 may be understood as being located within the secure zone because it is located behind the firewall 145 , or other suitable security devices, systems, or processes.
- the external review environment 125 may be understood to be outside the secure zone of the system 105 because the external review environment 125 is located outside the firewall 145 .
- the web interface 140 may be accessed by a web browser application of any of the nodes 130 a - d , or may be accessed by an application resident on a mobile device such as a smartphone.
- Data collection software or modules that may communicatively couple with the data collection module 110 may be installed on, and reside on the computing systems of the client (e.g., the entity for which datasets are to be collected), such as targets 150 a - n .
- the data collection module 110 may utilize one or more crawlers 155 a - n that may communicatively couple with the targets 150 a - n to receive datasets therefrom.
- the communication of datasets from targets 150 a - n to the data collection module 110 may occur automatically and continuously over time (e.g., synchronously), or may occur in batches (e.g., asynchronously).
- target may be understood to include any data source such as an email repository, storage media, and so forth.
- the core analysis and publish module 115 of the data collection module 110 may create an index of the received datasets and associated meta-data and then apply one or more algorithms to process and evaluate the subject matter and content of the datasets and their associated metadata and to create an index of collected datasets, as described above. For example, these evaluative algorithms may determine information such as keywords, dates, times, and other pertinent information included in the dataset.
- the core analysis and publish module 115 may also analyze the metadata of the dataset.
- the core gather module 160 may establish and maintain individual indexes for each of targets 150 a - n , or may utilize one index for each client that includes information for each target 150 a - n of the client.
- the core gather module 160 may store the collected datasets in a storage medium (not shown) communicatively coupled with the system 105 .
- a user may interact with the system 105 via their client device, such as end user node 130 a . That is, users may access the system 105 via the web interface 140 utilizing their end user node 130 a . Users may review, analyze and code relevant datasets by first searching or analyzing filtered datasets from an index or a combination of indexes.
- the end user may desire to locate all datasets that mention a particular keyword or group of keywords or relate to a particular concept.
- the core gather module 160 of the system 105 may compare the query to the index to locate and obtain relevant datasets that correspond to received queries.
- the core analysis and publish module 115 may make a copy of the index or indexes and select datasets before obtaining the datasets for publishing. Additionally, the core gather module 160 may lock each selected dataset to prevent tampering or modification to the datasets. Moreover, the core gather module 160 may utilize any other necessary protective or security related procedures to ensure that the datasets are obtained and maintained in a legally defensible manner.
- the core analysis and publish module 115 may prepare the datasets for export or publishing to a document review or analysis location by encapsulating the datasets into files and combining the datasets into batches.
- datasets may be published in their raw or native format.
- the core analysis and publish module 115 may publish the datasets to at least one of the internal review environment 120 and the external review environment 125 .
- the core analysis and publish module 115 may publish the datasets to a document review location via any one of a number of methods. For example, the core analysis and publish module 115 may publish the datasets by uploading the datasets via a file transfer protocol (“FTP”), or other computer networking protocol to one of the aforementioned locations. In other embodiments, the core analysis and publish module 115 may publish the datasets to a third party location, such as the external review environment 125 . In additional embodiments, the core analysis and publish module 115 may stream the datasets to the desired review location, either in batches or as individual datasets.
- FTP file transfer protocol
- the core analysis and publish module 115 may stream the datasets to the desired review location, either in batches or as individual datasets.
- the core analysis and publish module 115 may publish the datasets to an external review environment 125 that exists beyond or outside of the firewall 145 of the system 105 .
- Exemplary external nodes may include, but are not limited to, a web server or storage media associated with a user's web server or a third party web-based destination per the user's selection, and so forth.
- system 105 indexes datasets on a periodic or continual basis, requests to obtain and publish datasets may result in the system 105 automatically obtaining datasets from the corpus of available datasets without need for further evaluation of the datasets. That is, the system 105 may pre-analyze and indexes the datasets prior to the occurrence of a triggering event such that upon the occurrence of a triggering event, the datasets are easily and seamlessly obtained (should it meet certain defined parameters or criteria) and published to the appropriate document review location(s).
- a document review team 165 may access the datasets within the environment or may transfer the datasets from the environment for use on a local computing system. It will be understood that the document review team 165 may include a plurality of individual computing systems. The document review team 165 may access the external review environment 125 utilizing secure communications protocols, although in some embodiments, the document review team 165 may access the external review environment 125 without the requirement of a secure communications path. This scenario is more common when the datasets do not include sensitive information such as credit card data, social security numbers, or health record data.
- users may access datasets via the internal review environment 120 from a web based interface, such as web interface 140 , or by way of an additional review application (not shown) that may reside within a cloud based computing environment.
- a web based interface such as web interface 140
- an additional review application (not shown) that may reside within a cloud based computing environment.
- the external review environment 125 may include a cloud based computing environment that is configured to store selected and published datasets.
- the datasets may be stored in the cloud computing environment in virtual storage media or physical storage media that are distributed across a plurality of web servers.
- FIG. 2 illustrates a flowchart of an exemplary method 200 for publishing a dataset.
- the method may begin with the step 205 of crawling (or otherwise parsing, evaluating, analyzing, and so forth) the datasets from one or more data sources or targets.
- the method may include a step 210 of indexing the datasets. Indexing of datasets may include applying statistical or analytical algorithms or other mechanisms for determining the subject matter and/or content of the crawled datasets. Keywords, dates, times, and other descriptive information relative to the datasets may be stored in the index and associated with the datasets from which they were extracted.
- the method may include a step 215 of the system receiving a request to publish a dataset to at least one of an internal environment located within a secured zone and an external environment located outside the secured zone of the system.
- the method may include the step 220 of selecting or obtaining the dataset based upon the at least one selection criteria. It is noteworthy to mention that the dataset may be selected from the index of collected datasets that is maintained by the system.
- the method may include a step 225 of publishing the dataset to at least one of the internal environment and the external environment.
- the method 200 may then return to step 205 in anticipation of an additional triggering event.
- the method may include additional or fewer steps.
- other exemplary methods may include only the steps of receiving requests, selecting datasets, and publishing the selected datasets.
- system may collect and/or store datasets from one or more targets before the datasets are indexed by the system.
- FIG. 3 illustrates an exemplary computing system 300 that may be used to implement an embodiment of the present technology.
- the computing system 300 of FIG. 3 includes one or more processors 310 and main memory 320 .
- Main memory 320 stores, in part, instructions and data for execution by processor 310 .
- Main memory 320 can store the executable code when the computing system 300 is in operation.
- the computing system 300 of FIG. 3 may further include a mass storage device 330 , portable storage devices 340 , output devices 350 , user input devices 360 , a graphics display system 370 , and other peripheral devices 380 .
- the components shown in FIG. 3 are depicted as being connected via a single bus 390 .
- the components may be connected through one or more data transport means.
- One or more processors 310 and main memory 320 may be connected via a local microprocessor bus, and the mass storage device 330 , peripheral device(s) 380 , portable storage device(s) 340 , and graphics display system 370 may be connected via one or more input/output (I/O) buses.
- I/O input/output
- Mass storage device 330 which may be implemented with a magnetic disk drive or an optical disk drive, is a non-volatile storage device for storing data and instructions for use by one or more processors 310 . Mass storage device 330 can store the system software for implementing embodiments of the present technology for purposes of loading that software into main memory 320 .
- Portable storage device(s) 340 operates in conjunction with a portable non-volatile storage media, such as a floppy disk, compact disk or digital video disc, to input and output data and code to and from the computing system 300 of FIG. 3 .
- the system software for implementing embodiments of the present technology may be stored on such a portable media and input to the computing system 300 via the portable storage device(s) 340 .
- User input devices 360 provide a portion of a user interface.
- User input devices 360 may include an alphanumeric keypad, such as a keyboard, for inputting alphanumeric and other information, or a pointing device, such as a mouse, a trackball, stylus, or cursor direction keys.
- the computing system 300 as shown in FIG. 3 includes output devices 350 . Suitable output devices include speakers, printers, network interfaces, and monitors.
- Graphics display system 370 may include a liquid crystal display (LCD) or other suitable display device. Graphics display system 370 receives textual and graphical information, and processes the information for output to the display device.
- LCD liquid crystal display
- Peripheral devices 380 may include any type of computer support device to add additional functionality to the computing system 300 .
- Peripheral device(s) 380 may include a modem or a router.
- the components contained in the computing system 300 of FIG. 3 are those typically found in computer systems that may be suitable for use with embodiments of the present technology and are intended to represent a broad category of such computer components that are well known in the art.
- the computing system 300 of FIG. 3 can be a personal computer, hand held computing system, telephone, mobile computing system, workstation, server, minicomputer, mainframe computer, or any other computing system.
- the computer can also include different bus configurations, networked platforms, multi-processor platforms, etc.
- Various operating systems can be used including UNIX, Linux, Windows, Macintosh OS, Palm OS, and other suitable operating systems.
- Some of the above-described functions may be composed of instructions that are stored on storage media (e.g., computer-readable media).
- the instructions may be retrieved and executed by the one or more processors 310 .
- Some examples of storage media are memory devices, tapes, disks, and the like.
- the instructions are operational when executed by the one or more processors 310 to direct the one or more processors 310 to operate in accord with the technology.
- Those skilled in the art are familiar with instructions, processor(s), and storage media.
- Non-volatile media include, for example, optical or magnetic disks, such as a fixed disk.
- Volatile media include dynamic memory, such as system RAM.
- Transmission media include coaxial cables, copper wire and fiber optics, among others, including the wires that comprise one embodiment of a bus.
- Transmission media can also take the form of acoustic or light waves, such as those generated during radio frequency (RF) and infrared (IR) data communications.
- RF radio frequency
- IR infrared
- Common forms of computer-readable media include, for example, a floppy disk, a flexible disk, a hard disk, magnetic tape, any other magnetic media, a CD-ROM disk, digital video disk (DVD), any other optical media, any other physical media with patterns of marks or holes, a RAM, a PROM, an EPROM, an EEPROM, a FLASHEPROM, any other memory chip or data exchange adapter, a carrier wave, or any other media from which a computer can read.
- a bus carries the data to system RAM, from which a CPU retrieves and executes the instructions.
- the instructions received by system RAM can optionally be stored on a fixed disk either before or after execution by a CPU.
Abstract
Systems and methods for publishing datasets are provided herein. According to some embodiments, methods for publishing datasets may include receiving a request to publish a dataset to at least one of an internal environment located within a secured zone and an external environment located outside the secured zone, the request comprising at least one selection criteria, selecting the dataset based upon the at least one selection criteria, the dataset being selected from an index of collected datasets, and responsive to the request, publishing the dataset to at least one of the internal environment and the external environment.
Description
- The present application is a continuation of and claims the priority benefit of U.S. patent application Ser. No. 13/745,625, filed Jan. 18, 2013 and titled Systems and Methods for Publishing Datasets, which is a continuation of and claims the priority benefit of U.S. patent application Ser. No. 13/153,420, filed Jun. 4, 2011 and titled Systems and Methods for Publishing Datasets, the disclosures of which are incorporated herein by reference including all references cited therein.
- The present technology relates generally to systems and methods for information review and analysis, and more specifically, but not by way of limitation, to systems and methods for publishing datasets.
- According to some embodiments, the systems and methods provided herein may be configured to index and/or collect and publish datasets. As background, electronic discovery, regulatory and investigative processes involve sensitive document review and analysis processes which often involve numerous time consuming and error prone steps associated with the gathering of relevant datasets in a legally defensible manner.
- Dataset collection, review, and analysis often begins with the filing of a lawsuit, commencement of an investigation or other significant event that requires the review and analysis of certain types of datasets. These datasets may include many divers of similar data formats associated with user created files, electronic mail, audio, video, and the like. Datasets may be collected and analyzed from a variety of document sources such as email systems and repositories, file shares, storage media, third party web sites/services, and so forth. Because document review and analysis processes rely heavily on manual human action for certain steps, such as and the publishing of the collected and processed data to document review location, these processes are often error prone and time consuming. For example, datasets may be mistakenly omitted (i.e., not collected or improperly reviewed), leading to costly errors.
- In some instances, the collected datasets may be published to a virtual segregated document review environment or other secure document review construct that provides a secure location for the review and analysis of the collected datasets. These segregated document review environments provide a location (either secured or unsecured) where dataset review personnel may analyze and evaluate datasets that have been identified and selected for review.
- According to some embodiments, the present technology may be directed to methods for publishing datasets. The methods may comprise: (a) receiving a request to publish a dataset to at least one of an internal environment located within a secured zone and an external environment located outside the secured zone, the request comprising at least one selection criteria; (b) selecting the dataset based upon the at least one selection criteria, the dataset being selected from an index of collected datasets; and (c) responsive to the request, publishing the dataset to at least one of the internal environment and the external environment.
- According to additional exemplary embodiments, the present technology may be directed to an early case assessment system for publishing datasets. These systems may include: (a) a memory for storing executable instructions for publishing datasets; (b) a processor for executing the instructions stored in memory, the executable instructions comprising: (1) a core gather module that receives requests to publish a dataset to at least one of an internal environment located within a secured zone of the early case assessment system and an external environment located outside the secured zone, the request comprising at least one selection criteria; and (2) a core analysis module communicatively coupled to the core gather module that (i) selects the dataset based upon a request received by the core gather module, the dataset assembly module selecting the dataset from an index of collected datasets based upon the at least one selection criteria; and (ii) publishes the dataset to at least one of the internal environment and the external environment based upon a request received by the core gather module.
-
FIG. 1 is a schematic diagram of an exemplary architecture of a system for publishing datasets. The system may be utilized to practice aspects of the present technology. -
FIG. 2 is an exemplary flow diagram of a method for publishing datasets. -
FIG. 3 is a block diagram of an exemplary computing system that may be utilized to practice aspects of the present disclosure. - While this technology is susceptible of embodiment in many different forms, there is shown in the figures and will herein be described in detail several specific embodiments with the understanding that the present disclosure is to be considered as an exemplification of the principles of the technology and is not intended to limit the technology to the embodiments illustrated.
- Generally speaking, the systems provided herein may include electronic dataset review systems that identify, process, and publish datasets (e.g., documents, electronic mail, and so forth) from a variety of data sources such as an enterprise email server, document repositories, and the like. All datasets identified and/or collected comprise a corpus of reviewable datasets that are available for analysis and/or review. According to some embodiments, each dataset may be crawled and processed by the system according to guidelines established for the system by one or more dataset review administrators.
- The systems and methods provided herein may automatically and continuously review datasets as they become available. It will be understood that the term “review” may be understood to include additional functions such as analyze, organize, enrich, prioritize, code, batch, and so forth. Datasets may be evaluated for keywords, concepts or other metadata that define the content or subject matter of the dataset and potential relevance of individual pieces of data. For example, the system may determine keywords in the body of a particular electronic mail communication. The subject matter of the email communication may be inferred from the occurrence of particular words or phrases or combinations thereof within the dataset. In some embodiments, the system may utilize any one of a number of algorithms to process and evaluate the subject matter and content of the datasets and their associated metadata to create an index of collected datasets according to the parameters set by the user. For example, email communications may be parsed for terms, dates, subject matter, email addresses, and the like. These terms are then associated with the email communication in the index such that searching the index for a term that is included in the email will yield the email as a search result.
- Upon the occurrence of a triggering event (e.g., commencement of a lawsuit, investigation or other similar event), system users may search the index of datasets for relevant documents according to criteria. Criteria may include keyword searching, filtering according to parameters, and so forth. Once relevant data have been determined from the index and/or original location, the system may copy the relevant datasets from index and publish the same to a document review location.
- According to some embodiments, the document review location may include a virtual segregated document review environment, a secure document review location within the system, and/or a third party document review location. For example, the relevant documents may be communicated to the computing system of the user, the computing system of the user's legal counsel, to regulatory bodies, or the relevant documents may be retained for review and analysis within the system itself.
- It is noteworthy to mention that the more frequently a dataset is communicated between computing systems, the more likely that the dataset may be subject to unwanted exposure to third parties. Therefore, sensitive datasets may be retained within a secure zone of the system, or secure communications protocols may be established for accessing the datasets if they are published outside the secure zone of the system.
-
FIG. 1 illustrates anexemplary architecture 100 that may include an early case assessment system, hereinafter “system 105” that, in turn, includes adata collection module 110, a core analysis and publishmodule 115, aninternal review environment 120, and anexternal review environment 125, can be utilized as a part of the early case assessment system,system 105, or as an external review environment to which data is published from the earlycase assessment system 105. It is noteworthy that thesystem 105 may include additional modules, engines, or components, and still fall within the scope of the present technology. As used herein, the term “module” may also refer to any of an application-specific integrated circuit (“ASIC”), an electronic circuit, a processor (shared, dedicated, or group) that executes one or more software or firmware programs, a combinational logic circuit, and/or other suitable components that provide the described functionality. In other embodiments, individual modules of thesystem 105 may include separately configured web servers. - Generally speaking, the
system 105 may communicatively couple with any one of a plurality of client devices such as an end user node 130 a, anattorney node 130 b, anIT node 130 c, and aninvestigator node 130 d. While these nodes represent descriptive or exemplary end user client devices, one of ordinary skill in the art will appreciate that many other types of client device nodes may likewise be utilized in accordance with the present technology. It is further noteworthy to mention that each of the nodes 130 a-d may include an end user computing system such ascomputing system 300, which will be described in greater detail with reference toFIG. 3 . - Each of the nodes 130 a-d may communicatively couple with the
system 105 via any of a private and/or public communications network such as the Internet. - According to some embodiments, the
system 105 may include a cloud based computing environment that collects, processes, analyzes, and publishes datasets. In general, a cloud-based computing environment is a resource that typically combines the computational power of a large grouping of processors and/or that combines the storage capacity of a large grouping of computer memories or storage devices. For example, systems that provide a cloud resource may be utilized exclusively by their owners, such as Google™ or Yahoo!™; or such systems may be accessible to outside users who deploy applications within the computing infrastructure to obtain the benefit of large computational or storage resources. - The cloud may be formed, for example, by a network of web servers with each server (or at least a plurality thereof) providing processor and/or storage resources. These servers may manage workloads provided by multiple users (e.g., cloud resource customers or other users). Typically, each user places workload demands upon the cloud that vary in real-time, sometimes dramatically. The nature and extent of these variations typically depend on the type of business associated with the user.
- The
system 105 may generate a variety of graphical user interfaces that allow end users to interact with thesystem 105. For example, the system may generate aweb interface 140 that allows end users to enter search queries for searching indexes, establish dataset collection criteria, or automatically obtain and publish datasets to at least one of an internal document review location, such as aninternal review environment 120, and anexternal review environment 125. It is noteworthy to mention that theexternal review environment 125 may be located outside afirewall 145 of thesystem 105. As such, accessing datasets published to theinternal review environment 120 may require the use of security credentials (e.g., secure communication protocols) to gain access through thefirewall 145. - It will be understood that components or modules behind the
firewall 145 of thesystem 105 may be referred to as being located within a “secure zone” of thesystem 105. For example, theinternal review environment 120 may be understood as being located within the secure zone because it is located behind thefirewall 145, or other suitable security devices, systems, or processes. In contrast, theexternal review environment 125 may be understood to be outside the secure zone of thesystem 105 because theexternal review environment 125 is located outside thefirewall 145. - The
web interface 140 may be accessed by a web browser application of any of the nodes 130 a-d, or may be accessed by an application resident on a mobile device such as a smartphone. - Data collection software or modules that may communicatively couple with the
data collection module 110 may be installed on, and reside on the computing systems of the client (e.g., the entity for which datasets are to be collected), such as targets 150 a-n. In other embodiments, thedata collection module 110 may utilize one or more crawlers 155 a-n that may communicatively couple with the targets 150 a-n to receive datasets therefrom. The communication of datasets from targets 150 a-n to thedata collection module 110 may occur automatically and continuously over time (e.g., synchronously), or may occur in batches (e.g., asynchronously). The term “target” may be understood to include any data source such as an email repository, storage media, and so forth. - As datasets are located by the crawlers 155 a-n, the core analysis and publish
module 115 of thedata collection module 110 may create an index of the received datasets and associated meta-data and then apply one or more algorithms to process and evaluate the subject matter and content of the datasets and their associated metadata and to create an index of collected datasets, as described above. For example, these evaluative algorithms may determine information such as keywords, dates, times, and other pertinent information included in the dataset. The core analysis and publishmodule 115 may also analyze the metadata of the dataset. - The core gather
module 160 may establish and maintain individual indexes for each of targets 150 a-n, or may utilize one index for each client that includes information for each target 150 a-n of the client. - Additionally, the core gather
module 160 may store the collected datasets in a storage medium (not shown) communicatively coupled with thesystem 105. - Upon the occurrence of a triggering event, a user may interact with the
system 105 via their client device, such as end user node 130 a. That is, users may access thesystem 105 via theweb interface 140 utilizing their end user node 130 a. Users may review, analyze and code relevant datasets by first searching or analyzing filtered datasets from an index or a combination of indexes. - For example, the end user may desire to locate all datasets that mention a particular keyword or group of keywords or relate to a particular concept. The core gather
module 160 of thesystem 105 may compare the query to the index to locate and obtain relevant datasets that correspond to received queries. - In some embodiments, the core analysis and publish
module 115 may make a copy of the index or indexes and select datasets before obtaining the datasets for publishing. Additionally, the core gathermodule 160 may lock each selected dataset to prevent tampering or modification to the datasets. Moreover, the core gathermodule 160 may utilize any other necessary protective or security related procedures to ensure that the datasets are obtained and maintained in a legally defensible manner. - After selecting the desired datasets, the core analysis and publish
module 115 may prepare the datasets for export or publishing to a document review or analysis location by encapsulating the datasets into files and combining the datasets into batches. In other embodiments, datasets may be published in their raw or native format. - When the core analysis and publish
module 115 has prepared the datasets for publishing, the core analysis and publishmodule 115 may publish the datasets to at least one of theinternal review environment 120 and theexternal review environment 125. - According to some embodiments, the core analysis and publish
module 115 may publish the datasets to a document review location via any one of a number of methods. For example, the core analysis and publishmodule 115 may publish the datasets by uploading the datasets via a file transfer protocol (“FTP”), or other computer networking protocol to one of the aforementioned locations. In other embodiments, the core analysis and publishmodule 115 may publish the datasets to a third party location, such as theexternal review environment 125. In additional embodiments, the core analysis and publishmodule 115 may stream the datasets to the desired review location, either in batches or as individual datasets. - As stated previously, the core analysis and publish
module 115 may publish the datasets to anexternal review environment 125 that exists beyond or outside of thefirewall 145 of thesystem 105. Exemplary external nodes may include, but are not limited to, a web server or storage media associated with a user's web server or a third party web-based destination per the user's selection, and so forth. - It is noteworthy to mention that, because the
system 105 indexes datasets on a periodic or continual basis, requests to obtain and publish datasets may result in thesystem 105 automatically obtaining datasets from the corpus of available datasets without need for further evaluation of the datasets. That is, thesystem 105 may pre-analyze and indexes the datasets prior to the occurrence of a triggering event such that upon the occurrence of a triggering event, the datasets are easily and seamlessly obtained (should it meet certain defined parameters or criteria) and published to the appropriate document review location(s). - Assuming datasets have been published to an
external review environment 125, adocument review team 165 may access the datasets within the environment or may transfer the datasets from the environment for use on a local computing system. It will be understood that thedocument review team 165 may include a plurality of individual computing systems. Thedocument review team 165 may access theexternal review environment 125 utilizing secure communications protocols, although in some embodiments, thedocument review team 165 may access theexternal review environment 125 without the requirement of a secure communications path. This scenario is more common when the datasets do not include sensitive information such as credit card data, social security numbers, or health record data. - Alternatively, users may access datasets via the
internal review environment 120 from a web based interface, such asweb interface 140, or by way of an additional review application (not shown) that may reside within a cloud based computing environment. - It will be understood that according to some embodiments, the
external review environment 125 may include a cloud based computing environment that is configured to store selected and published datasets. The datasets may be stored in the cloud computing environment in virtual storage media or physical storage media that are distributed across a plurality of web servers. -
FIG. 2 illustrates a flowchart of anexemplary method 200 for publishing a dataset. The method may begin with thestep 205 of crawling (or otherwise parsing, evaluating, analyzing, and so forth) the datasets from one or more data sources or targets. As datasets are crawled, the method may include astep 210 of indexing the datasets. Indexing of datasets may include applying statistical or analytical algorithms or other mechanisms for determining the subject matter and/or content of the crawled datasets. Keywords, dates, times, and other descriptive information relative to the datasets may be stored in the index and associated with the datasets from which they were extracted. - Upon the occurrence of a triggering event (such as the commencement of a lawsuit), the method may include a
step 215 of the system receiving a request to publish a dataset to at least one of an internal environment located within a secured zone and an external environment located outside the secured zone of the system. - Next, the method may include the
step 220 of selecting or obtaining the dataset based upon the at least one selection criteria. It is noteworthy to mention that the dataset may be selected from the index of collected datasets that is maintained by the system. - After the selection of datasets, the method may include a
step 225 of publishing the dataset to at least one of the internal environment and the external environment. Themethod 200 may then return to step 205 in anticipation of an additional triggering event. - It will be understood that in some embodiments, the method may include additional or fewer steps. For example, other exemplary methods may include only the steps of receiving requests, selecting datasets, and publishing the selected datasets.
- It is noteworthy to mention that in alternative embodiments, the system may collect and/or store datasets from one or more targets before the datasets are indexed by the system.
-
FIG. 3 illustrates anexemplary computing system 300 that may be used to implement an embodiment of the present technology. Thecomputing system 300 ofFIG. 3 includes one ormore processors 310 andmain memory 320.Main memory 320 stores, in part, instructions and data for execution byprocessor 310.Main memory 320 can store the executable code when thecomputing system 300 is in operation. Thecomputing system 300 ofFIG. 3 may further include amass storage device 330,portable storage devices 340,output devices 350,user input devices 360, agraphics display system 370, and otherperipheral devices 380. - The components shown in
FIG. 3 are depicted as being connected via asingle bus 390. The components may be connected through one or more data transport means. One ormore processors 310 andmain memory 320 may be connected via a local microprocessor bus, and themass storage device 330, peripheral device(s) 380, portable storage device(s) 340, andgraphics display system 370 may be connected via one or more input/output (I/O) buses. -
Mass storage device 330, which may be implemented with a magnetic disk drive or an optical disk drive, is a non-volatile storage device for storing data and instructions for use by one ormore processors 310.Mass storage device 330 can store the system software for implementing embodiments of the present technology for purposes of loading that software intomain memory 320. - Portable storage device(s) 340 operates in conjunction with a portable non-volatile storage media, such as a floppy disk, compact disk or digital video disc, to input and output data and code to and from the
computing system 300 ofFIG. 3 . The system software for implementing embodiments of the present technology may be stored on such a portable media and input to thecomputing system 300 via the portable storage device(s) 340. -
User input devices 360 provide a portion of a user interface.User input devices 360 may include an alphanumeric keypad, such as a keyboard, for inputting alphanumeric and other information, or a pointing device, such as a mouse, a trackball, stylus, or cursor direction keys. Additionally, thecomputing system 300 as shown inFIG. 3 includesoutput devices 350. Suitable output devices include speakers, printers, network interfaces, and monitors. - Graphics display
system 370 may include a liquid crystal display (LCD) or other suitable display device. Graphics displaysystem 370 receives textual and graphical information, and processes the information for output to the display device. -
Peripheral devices 380 may include any type of computer support device to add additional functionality to thecomputing system 300. Peripheral device(s) 380 may include a modem or a router. - The components contained in the
computing system 300 ofFIG. 3 are those typically found in computer systems that may be suitable for use with embodiments of the present technology and are intended to represent a broad category of such computer components that are well known in the art. Thus, thecomputing system 300 ofFIG. 3 can be a personal computer, hand held computing system, telephone, mobile computing system, workstation, server, minicomputer, mainframe computer, or any other computing system. The computer can also include different bus configurations, networked platforms, multi-processor platforms, etc. Various operating systems can be used including UNIX, Linux, Windows, Macintosh OS, Palm OS, and other suitable operating systems. - Some of the above-described functions may be composed of instructions that are stored on storage media (e.g., computer-readable media). The instructions may be retrieved and executed by the one or
more processors 310. Some examples of storage media are memory devices, tapes, disks, and the like. The instructions are operational when executed by the one ormore processors 310 to direct the one ormore processors 310 to operate in accord with the technology. Those skilled in the art are familiar with instructions, processor(s), and storage media. - It is noteworthy that any hardware platform suitable for performing the processing described herein is suitable for use with the technology. The terms “computer-readable storage media” and “computer-readable storage media” as used herein refer to any media or media that participate in providing instructions to a CPU for execution. Such media can take many forms, including, but not limited to, non-volatile media, volatile media and transmission media. Non-volatile media include, for example, optical or magnetic disks, such as a fixed disk. Volatile media include dynamic memory, such as system RAM. Transmission media include coaxial cables, copper wire and fiber optics, among others, including the wires that comprise one embodiment of a bus. Transmission media can also take the form of acoustic or light waves, such as those generated during radio frequency (RF) and infrared (IR) data communications. Common forms of computer-readable media include, for example, a floppy disk, a flexible disk, a hard disk, magnetic tape, any other magnetic media, a CD-ROM disk, digital video disk (DVD), any other optical media, any other physical media with patterns of marks or holes, a RAM, a PROM, an EPROM, an EEPROM, a FLASHEPROM, any other memory chip or data exchange adapter, a carrier wave, or any other media from which a computer can read.
- Various forms of computer-readable media may be involved in carrying one or more sequences of one or more instructions to a CPU for execution. A bus carries the data to system RAM, from which a CPU retrieves and executes the instructions. The instructions received by system RAM can optionally be stored on a fixed disk either before or after execution by a CPU.
- The above description is illustrative and not restrictive. Many variations of the technology will become apparent to those of skill in the art upon review of this disclosure. The scope of the technology should, therefore, be determined not with reference to the above description, but instead should be determined with reference to the appended claims along with their full scope of equivalents.
Claims (25)
1.-8. (canceled)
9. A method for publishing datasets, comprising:
receiving a request to publish a dataset to an early case assessment system having both an internal environment and an external environment separated by a firewall, the internal environment located within a secured zone, the secured zone being defined by the firewall, and the external environment located outside the secured zone, the request comprising at least one selection criteria;
selecting, by a processor of the early case assessment system, the dataset based upon the at least one selection criteria, the dataset being selected from at least one index of collected datasets, the at least one index of collected datasets including terms associated with communications in the collected datasets; and
responsive to the request, publishing, via a module stored in memory and executable by a processor, the dataset to at least one of the internal environment and the external environment.
10. The method of claim 9 , wherein the at least one index comprises an index of collected datasets exists for each of a plurality of targets.
11. The method of claim 9 , wherein the at least one index comprises an index of collected datasets is associated with a single client.
12. The method of claim 9 , wherein indexing occurs on a periodic basis.
13. The method of claim 9 , wherein a sensitive dataset is transmitted using a secure protocol when published outside of the secured zone.
14. The method of claim 9 , wherein upon an occurrence of a triggering event, the at least one index of collected datasets is searched for relevant documents according to criteria.
15. The method of claim 14 , wherein the triggering event includes a lawsuit.
16. The method of claim 14 , wherein the triggering event includes an investigation.
17. An early case assessment system for publishing datasets, comprising:
a memory for storing executable instructions for publishing datasets; and
a processor for executing the instructions stored in memory, the executable instructions comprising:
a core gather module that receives requests to publish a dataset to an internal environment or an external environment of the early case assessment system, wherein the internal and external environments are separated by a firewall, the internal environment located within a secured zone, the secured zone being defined by the firewall, and an external environment located outside the secured zone, the request comprising at least one selection criteria; and
a core analysis module communicatively coupled to the core gather module that:
selects the dataset based upon a request received by the core gather module, the core analysis module selecting the dataset being from at least one index of collected datasets based upon the at least one selection criteria, the at least one index of collected datasets including terms associated with communications in the collected datasets; and
publishes the dataset to at least one of the internal environment and the external environment based upon a request received by the core gather module.
18. The system of claim 17 , wherein the at least one index comprises an index of collected datasets exists for each of a plurality of targets.
19. The system of claim 17 , wherein the at least one index comprises an index of collected datasets is associated with a single client.
20. The system of claim 17 , wherein indexing occurs on a periodic basis.
21. The system of claim 17 , wherein a sensitive dataset is transmitted using a secure protocol when published outside of the secured zone.
22. The system of claim 17 , wherein upon an occurrence of a triggering event, the at least one index of collected datasets is searched for relevant documents according to criteria.
23. The system of claim 22 , wherein the triggering event includes a lawsuit.
24. The system of claim 22 , wherein the triggering event includes an investigation.
25. A system for publishing datasets, comprising:
an internal environment and an external environment that are separated from one another by a firewall;
a memory for storing executable instructions for publishing datasets; and
a processor for executing the instructions stored in memory, the executable instructions comprising:
a core gather module that receives requests to publish a dataset to the internal environment located within a secured zone of the system, the secure zone being defined by the firewall, the request comprising at least one selection criteria, the core gather module locking the dataset to prevent modification, wherein the dataset includes a sensitive dataset; and
a core analysis module communicatively coupled to the core gather module that:
selects the dataset based upon a request received by the core gather module, the core analysis module selecting the dataset being from at least one index of collected dataset based upon the at least one selection criteria, the at least one index of collected datasets formed by parsing communications in the at least one index of collected datasets and including terms associated with the communications; and
publishes the dataset to the internal environment based upon a request received by the core gather module, wherein non-sensitive datasets are published to the external environment.
26. The system of claim 25 , wherein the at least one index comprises an index of collected datasets exists for each of a plurality of targets.
27. The system of claim 25 , wherein the at least one index comprises an index of collected datasets is associated with a single client.
28. The system of claim 25 , wherein indexing occurs on a periodic basis.
29. The system of claim 25 , wherein a sensitive dataset is transmitted using a secure protocol when published outside of the secure zone.
30. The system of claim 25 , wherein upon the occurrence of a triggering event, the at least one index of collected datasets is searched for relevant documents according to criteria.
31. The system of claim 30 , wherein the triggering event includes a lawsuit.
32. The system of claim 30 , wherein the triggering event includes an investigation.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/452,481 US20140351890A1 (en) | 2011-06-04 | 2014-08-05 | Systems and Methods for Publishing Datasets |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/153,420 US8793215B2 (en) | 2011-06-04 | 2011-06-04 | Systems and methods for publishing datasets |
US13/745,625 US8843442B2 (en) | 2011-06-04 | 2013-01-18 | Systems and methods for publishing datasets |
US14/452,481 US20140351890A1 (en) | 2011-06-04 | 2014-08-05 | Systems and Methods for Publishing Datasets |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/745,625 Continuation US8843442B2 (en) | 2011-06-04 | 2013-01-18 | Systems and methods for publishing datasets |
Publications (1)
Publication Number | Publication Date |
---|---|
US20140351890A1 true US20140351890A1 (en) | 2014-11-27 |
Family
ID=47262438
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/153,420 Active 2032-02-15 US8793215B2 (en) | 2011-06-04 | 2011-06-04 | Systems and methods for publishing datasets |
US13/745,625 Active US8843442B2 (en) | 2011-06-04 | 2013-01-18 | Systems and methods for publishing datasets |
US14/298,669 Active US9374375B2 (en) | 2011-06-04 | 2014-06-06 | Systems and methods for publishing datasets |
US14/452,481 Abandoned US20140351890A1 (en) | 2011-06-04 | 2014-08-05 | Systems and Methods for Publishing Datasets |
Family Applications Before (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/153,420 Active 2032-02-15 US8793215B2 (en) | 2011-06-04 | 2011-06-04 | Systems and methods for publishing datasets |
US13/745,625 Active US8843442B2 (en) | 2011-06-04 | 2013-01-18 | Systems and methods for publishing datasets |
US14/298,669 Active US9374375B2 (en) | 2011-06-04 | 2014-06-06 | Systems and methods for publishing datasets |
Country Status (3)
Country | Link |
---|---|
US (4) | US8793215B2 (en) |
EP (1) | EP2718802A4 (en) |
WO (1) | WO2012170050A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9374375B2 (en) | 2011-06-04 | 2016-06-21 | Recommind, Inc. | Systems and methods for publishing datasets |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9485300B2 (en) * | 2012-03-13 | 2016-11-01 | Yahoo! Inc. | Publish-subscribe platform for cloud file distribution |
US9229935B2 (en) * | 2013-01-24 | 2016-01-05 | International Business Machines Corporation | Simulating accesses for archived content |
US20150149410A1 (en) * | 2013-11-27 | 2015-05-28 | Microsoft Corporation | Publishing to a content delivery system |
US10877955B2 (en) * | 2014-04-29 | 2020-12-29 | Microsoft Technology Licensing, Llc | Using lineage to infer data quality issues |
US10089409B2 (en) | 2014-04-29 | 2018-10-02 | Microsoft Technology Licensing, Llc | Event-triggered data quality verification |
US11012806B2 (en) * | 2015-01-09 | 2021-05-18 | Ariba, Inc. | Multi-adapter support in the cloud |
US11080284B2 (en) * | 2015-05-01 | 2021-08-03 | Microsoft Technology Licensing, Llc | Hybrid search connector |
US10055301B2 (en) * | 2015-06-15 | 2018-08-21 | Open Text Sa Ulc | Systems and methods for content server make disk image operation |
US10417439B2 (en) | 2016-04-08 | 2019-09-17 | Google Llc | Post-hoc management of datasets |
US10043031B2 (en) | 2016-11-08 | 2018-08-07 | Ebay Inc. | Secure management of user addresses in network service |
CN107172075B (en) * | 2017-06-26 | 2021-09-21 | 努比亚技术有限公司 | Communication method, system and readable storage medium based on network isolation |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6944662B2 (en) | 2000-08-04 | 2005-09-13 | Vinestone Corporation | System and methods providing automatic distributed data retrieval, analysis and reporting services |
US6725393B1 (en) | 2000-11-06 | 2004-04-20 | Hewlett-Packard Development Company, L.P. | System, machine, and method for maintenance of mirrored datasets through surrogate writes during storage-area network transients |
US7178099B2 (en) | 2001-01-23 | 2007-02-13 | Inxight Software, Inc. | Meta-content analysis and annotation of email and other electronic documents |
EA003114B1 (en) | 2001-04-19 | 2003-02-27 | Сергей Владимирович Исаков | Method for publishing, retrieving, collecting and exchanging information in global and local networks |
US8176334B2 (en) * | 2002-09-30 | 2012-05-08 | Guardian Data Storage, Llc | Document security system that permits external users to gain access to secured files |
US6993534B2 (en) | 2002-05-08 | 2006-01-31 | International Business Machines Corporation | Data store for knowledge-based data mining system |
US7792828B2 (en) | 2003-06-25 | 2010-09-07 | Jericho Systems Corporation | Method and system for selecting content items to be presented to a viewer |
US7974850B2 (en) | 2003-09-26 | 2011-07-05 | Brideway Software, Inc. | Method of early case assessment in law suits |
US7552109B2 (en) | 2003-10-15 | 2009-06-23 | International Business Machines Corporation | System, method, and service for collaborative focused crawling of documents on a network |
US7383253B1 (en) | 2004-12-17 | 2008-06-03 | Coral 8, Inc. | Publish and subscribe capable continuous query processor for real-time data streams |
US20110110568A1 (en) | 2005-04-08 | 2011-05-12 | Gregory Vesper | Web enabled medical image repository |
US7599934B2 (en) | 2005-09-27 | 2009-10-06 | Microsoft Corporation | Server side filtering and sorting with field level security |
US7680937B2 (en) * | 2005-12-22 | 2010-03-16 | Microsoft Corporation | Content publication |
US7627432B2 (en) | 2006-09-01 | 2009-12-01 | Spss Inc. | System and method for computing analytics on structured data |
WO2008033424A2 (en) * | 2006-09-12 | 2008-03-20 | Foleeo, Inc. | Hive-based peer-to-peer network |
US7908281B2 (en) | 2006-11-22 | 2011-03-15 | Architecture Technology Corporation | Dynamic assembly of information pedigrees |
US7769806B2 (en) | 2007-10-24 | 2010-08-03 | Social Communications Company | Automated real-time data stream switching in a shared virtual area communication environment |
US7934066B2 (en) | 2008-04-30 | 2011-04-26 | Netapp, Inc. | Extensible application backup system and method |
US8548950B2 (en) | 2008-05-22 | 2013-10-01 | The Board Of Trustees Of The Leland Stanford Junior University | Method and system for data archiving |
US8301759B2 (en) | 2008-10-24 | 2012-10-30 | Microsoft Corporation | Monitoring agent programs in a distributed computing platform |
US8208459B2 (en) | 2008-11-10 | 2012-06-26 | Qualcomm Incorporated | Partitioned traffic segment communications methods and apparatus |
US20100205153A1 (en) | 2009-02-12 | 2010-08-12 | Accenture Global Services Gmbh | Data System Architecture to Analyze Distributed Data Sets |
US20110004590A1 (en) | 2009-03-02 | 2011-01-06 | Lilley Ventures, Inc. Dba Workproducts, Inc. | Enabling management of workflow |
CA2763364C (en) | 2009-05-29 | 2017-08-22 | Walter H. Runkis | Secure storage and accelerated transmission of information over communication networks |
US20100333116A1 (en) | 2009-06-30 | 2010-12-30 | Anand Prahlad | Cloud gateway system for managing data storage to cloud storage sites |
US8793215B2 (en) | 2011-06-04 | 2014-07-29 | Recommind, Inc. | Systems and methods for publishing datasets |
-
2011
- 2011-06-04 US US13/153,420 patent/US8793215B2/en active Active
- 2011-07-29 WO PCT/US2011/046021 patent/WO2012170050A1/en unknown
- 2011-07-29 EP EP11867142.9A patent/EP2718802A4/en not_active Ceased
-
2013
- 2013-01-18 US US13/745,625 patent/US8843442B2/en active Active
-
2014
- 2014-06-06 US US14/298,669 patent/US9374375B2/en active Active
- 2014-08-05 US US14/452,481 patent/US20140351890A1/en not_active Abandoned
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9374375B2 (en) | 2011-06-04 | 2016-06-21 | Recommind, Inc. | Systems and methods for publishing datasets |
Also Published As
Publication number | Publication date |
---|---|
US20140304274A1 (en) | 2014-10-09 |
US8843442B2 (en) | 2014-09-23 |
US9374375B2 (en) | 2016-06-21 |
EP2718802A4 (en) | 2014-11-05 |
US20120310884A1 (en) | 2012-12-06 |
US8793215B2 (en) | 2014-07-29 |
EP2718802A1 (en) | 2014-04-16 |
WO2012170050A1 (en) | 2012-12-13 |
US20130132344A1 (en) | 2013-05-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9374375B2 (en) | Systems and methods for publishing datasets | |
US11921672B2 (en) | Query execution at a remote heterogeneous data store of a data fabric service | |
US20200050607A1 (en) | Reassigning processing tasks to an external storage system | |
US10366053B1 (en) | Consistent randomized record-level splitting of machine learning data | |
US10713589B1 (en) | Consistent sort-based record-level shuffling of machine learning data | |
US10540606B2 (en) | Consistent filtering of machine learning data | |
US9135261B2 (en) | Systems and methods for facilitating data discovery | |
US20150379072A1 (en) | Input processing for machine learning | |
US11374948B2 (en) | Computer security system with network traffic analysis | |
JP2006107446A (en) | Batch indexing system and method for network document | |
US9177249B2 (en) | Scientometric methods for identifying emerging technologies | |
US11949702B1 (en) | Analysis and mitigation of network security risks | |
US11720824B1 (en) | Visualizing outliers from timestamped event data using machine learning-based models | |
US11681707B1 (en) | Analytics query response transmission | |
US11450419B1 (en) | Medication security and healthcare privacy systems | |
US11567735B1 (en) | Systems and methods for integration of multiple programming languages within a pipelined search query | |
US11151308B2 (en) | Electronic document processing system | |
US11494381B1 (en) | Ingestion and processing of both cloud-based and non-cloud-based data by a data intake and query system | |
US11748634B1 (en) | Systems and methods for integration of machine learning components within a pipelined search query to generate a graphic visualization | |
US20220103577A1 (en) | Threat Mapping Engine | |
Quick et al. | Quick analysis of digital forensic data | |
US8086993B2 (en) | Sorting tables in a page based approach | |
JP2020154381A (en) | Information processing system, information processing device, information processing method, and program | |
US11829378B1 (en) | Automated generation of insights for machine generated data | |
US11836146B1 (en) | Storing indexed fields per source type as metadata at the bucket level to facilitate search-time field learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: RECOMMIND, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:TENNANT, ROBERT;REEL/FRAME:033584/0663 Effective date: 20110726 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |