US20180218004A1 - Analytics and deduplication for air-gapped log analysis - Google Patents

Analytics and deduplication for air-gapped log analysis Download PDF

Info

Publication number
US20180218004A1
US20180218004A1 US15/420,855 US201715420855A US2018218004A1 US 20180218004 A1 US20180218004 A1 US 20180218004A1 US 201715420855 A US201715420855 A US 201715420855A US 2018218004 A1 US2018218004 A1 US 2018218004A1
Authority
US
United States
Prior art keywords
log entry
log
image
component
significant
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US15/420,855
Inventor
Itzhack Goldberg
Harry R. McGregor
Christopher B. Moore
Neil Sondhi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Priority to US15/420,855 priority Critical patent/US20180218004A1/en
Assigned to INTERNATIONAL BUSINESS MACHINES CORPORATION reassignment INTERNATIONAL BUSINESS MACHINES CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MCGREGOR, HARRY R., GOLDBERG, ITZHACK, SONDHI, NEIL, MOORE, CHRISTOPHER B.
Priority to US15/697,897 priority patent/US9996545B1/en
Priority to US15/970,319 priority patent/US10255289B2/en
Priority to US15/970,360 priority patent/US10248658B2/en
Publication of US20180218004A1 publication Critical patent/US20180218004A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • G06F17/30156
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/602Providing cryptographic facilities or services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/174Redundancy elimination performed by the file system
    • G06F16/1748De-duplication implemented within the file system, e.g. based on file segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5846Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30253
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06KGRAPHICAL DATA READING; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K19/00Record carriers for use with machines and with at least a part designed to carry digital markings
    • G06K19/06Record carriers for use with machines and with at least a part designed to carry digital markings characterised by the kind of the digital marking, e.g. shape, nature, code
    • G06K19/06009Record carriers for use with machines and with at least a part designed to carry digital markings characterised by the kind of the digital marking, e.g. shape, nature, code with optically detectable marking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06KGRAPHICAL DATA READING; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K7/00Methods or arrangements for sensing record carriers, e.g. for reading patterns
    • G06K7/10Methods or arrangements for sensing record carriers, e.g. for reading patterns by electromagnetic radiation, e.g. optical sensing; by corpuscular radiation
    • G06K7/14Methods or arrangements for sensing record carriers, e.g. for reading patterns by electromagnetic radiation, e.g. optical sensing; by corpuscular radiation using light without selection of wavelength, e.g. sensing reflected white light
    • G06K7/1404Methods for optical code recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06K2209/01

Definitions

  • the present disclosure relates to data analytics and, more specifically, to deduplication of air-gapped system logs.
  • Technical support for users' computer systems can be provided when system logs are relayed to a technical support group.
  • Information in the user's system logs can allow the technical support group to identify problems that the system may be experiencing, allowing the support group to provide solutions to these problems.
  • the transmission of the system logs from the user to the technical support group is automatic.
  • a user may determine whether to send logs to a technical support group.
  • these solutions do not work on air-gapped systems, which are physically separated from other networks, because the user does not have a network connection to the support group.
  • An image of a user's system logs can be captured by an image acquisition component.
  • the image can also be encrypted by the image acquisition component.
  • Text of system log entries in the image can be converted to machine-encoded text by an image processing component.
  • the machine-encoded text of the log entries can additionally be encrypted by the image processing component.
  • a log scanning component can scan the machine-encoded text of the log entries and remove redundant copes of a log entry.
  • a log analysis component in the log processing device or the text processing device can scan the log entries and determine whether a log entry is significant. If the log entry is significant, it can be flagged, and the user can be alerted.
  • FIG. 1 is a block diagram illustrating a first system log processing environment, according to embodiments of the present disclosure.
  • FIG. 2 is a block diagram illustrating a second system log processing environment, according to embodiments of the present disclosure.
  • FIG. 3 is a flowchart illustrating a process of system log deduplication and analysis, according to embodiments of the present disclosure.
  • FIG. 4 illustrates a computer system, according to embodiments of the present disclosure.
  • System logs are records of various events that take place in a computer system, and these events can be related to problems with the system. Sharing log entries related to problems in a system with a technical support group can help the support group diagnose and solve the problem. However, sharing log entries becomes more complicated when a user has an air-gapped system (e.g., one that is physically separated from other networks). If the system cannot connect to the technical support group via a network, alternative means of transmitting log entries to the group must be employed.
  • a user with an air-gapped system collects system log entries, and sends the entries to a technical support group without connecting the air-gapped system to a network.
  • the user's system logs visible on a console display, can be recorded as photographic images by a device that is not connected to the system. Photographic images of the log entries can also be collected as screenshots and transferred to a device that is not connected to the system via air-gapped transfer methods. Text in the photographic images of the logs can be converted to machine-encoded text, stored, and analyzed in the same device or in another device.
  • the system logs can additionally be deduplicated, and displayed to a user.
  • the user may have sensitive information in their system logs that they do not wish to share with a technical support group. If so, the user can sanitize the logs so that log entries containing sensitive information are removed. The user can select which logs will be sent to a technical support group, and send the logs without using a network connection between their system and the system of the technical support group.
  • FIG. 1 is a block diagram illustrating a first system log processing environment 100 , according to some embodiments.
  • Environment 100 includes a console display 105 and a log processing device 110 .
  • the log processing device 110 includes a camera 115 , a mounting adaptor 117 , an image acquisition component 120 , an image processing component 125 , and a text processing component 130 .
  • the text processing component includes a log scanning component 135 and a log analysis component 140 .
  • FIG. 1 illustrates the camera 115 , mounting adaptor 117 , image acquisition component 120 , image processing component 125 , and text processing component 130 as belonging to one device 110 , multiple devices can be involved in other embodiments.
  • An example of a processing environment 200 that includes multiple devices is discussed in greater detail with respect to FIG. 2 .
  • the console display 105 illustrated in FIG. 1 can be any display that shows a user's system logs.
  • the console display 105 is a non-cathode ray tube (non-CRT) display.
  • non-CRT displays can include plasma displays, liquid crystal displays (LCDs), light-emitting diode (LED) displays, etc.
  • System log entries can be records of system events, changes, operations, etc. The log entries can contain information about a problem with the system, and analyzing the logs can help a user or technical support group determine the best solution to the problem. In some instances, a system can produce redundant log entries. For example, a log entry may be repeated many times if it records a significant event or problem with the system.
  • An image of the user's system logs is captured from the console display 105 by the log processing device 110 as a photographic image, such as a video or a still image.
  • the image can be captured by another device, and transferred to the log processing device 110 .
  • the log processing device 110 also converts text in the captured image to machine-encoded text 127 , and the log processing device 110 processes the text, as discussed below.
  • the log processing device 110 can utilize the camera 115 , the mounting adaptor 117 , and the image acquisition component 120 to capture a photographic image.
  • the camera 115 can be included within the log processing device 110 , and, in other embodiments, the camera 115 can be externally connected to the device 110 .
  • the log processing device 110 can be positioned at a distance from the console display 105 that allows the camera 115 to capture videos and/or still images of log entry text with sufficient focus so that the log entry can be analyzed. In other embodiments, the log processing device 110 and the camera 115 can be positioned without employing the mounting adaptor 117 .
  • the image acquisition component 120 in the log processing device 110 facilitates transmission of the system log image from the camera 115 into the log processing device 110 .
  • the image acquisition component 120 encrypts images that are to be stored on the device 110 .
  • the encryption can protect the security of the images in situations where the log processing device 110 is compromised.
  • visual secret sharing encryption is used. However, any type of encryption can be employed.
  • Text in the photographic image is converted to machine-encoded text 127 by the image processing component 125 .
  • One approach to this conversion is by optical character recognition (OCR), which is a method of recognizing printed characters, though any method of converting printed characters to machine encoded text can be used.
  • OCR converts the printed characters in the photographic image of the system log to machine-encoded text 127 using photoelectric devices and computer software.
  • the image processing component 125 can additionally encrypt the machine-encoded text 127 so that it cannot be read if the log processing device 110 is compromised.
  • Text in the images captured by the image acquisition component 120 can be processed in real-time, or the images can be stored for later analysis.
  • images of the log system text can be captured from screenshots, rather than the camera 115 , and transferred to the log processing device 110 via other air-gapped file transfer methods. Examples of these methods include transfer using removable storage devices such as USB devices, flash memory cards, external hard drives, optical disks, magnetic tape, etc. in read only mode.
  • the log processing device 110 also includes a text processing component 130 , which scans and analyzes the machine-encoded text 127 generated by the image processing component 125 .
  • the text processing component 130 comprises a log scanning component 135 and a log analysis component 140 , which are discussed in greater detail below.
  • Machine-encoded system log text 127 produced by the image processing component 125 is monitored by the log scanning component 135 .
  • the log scanning component 135 determines whether there are redundant copies of log entries, and deduplicates the system logs by removing the redundant log entries.
  • the log scanning component 135 determines whether log entries are duplicates of one another by comparing the machine-encoded text 127 of the entries. For example, the log scanning component 135 can detect duplicates when log entries each contain some or all of the same words.
  • the log scanning component 135 can scan the complete machine-encoded text 127 of a log for matching words and phrases, or it can scan portions of the text. In some instances, the log scanning component 135 can detect that logs are duplicates if they each contain certain predefined keywords. For example, the log scanning component 140 can determine that logs are duplicates if each log contains the same error message.
  • all redundant copies of log entries are removed during deduplication. However, in other embodiments, a portion of the redundant copies of log entries are removed. Instructions directing the extent of deduplication can be input by a user or organization. For example, a user may specify that log entries should be deduplicated when the number of log entries surpasses a threshold number. The portion size can be any number of system log copies, and the number can be preprogrammed or specified by the user. Additionally, instructions could specify that log entries recorded at particular times be deduplicated.
  • the log scanning component 135 can additionally record the time span of deduplication and the quantity of redundant log entries that are removed.
  • the log scanning component 135 records that a log entry has a number of redundant copies that surpasses a threshold number of copies over a designated time span, this log entry may be considered significant or in some way indicative of a problem with the system.
  • a threshold number and time span is 1 , 000 redundant copies of a log entry in one hour, though other threshold numbers and time spans can be used.
  • Determination of the significance of a log entry is carried out by the log analysis component 140 . If the log analysis component 140 detects a log entry that may be significant, it flags the entry. In some cases, a log entry can be flagged when the log scanning component 135 detects many copies of it and/or its repetition over a long time span (e.g., at least one copy detected every hour for more than eight hours). There may also be cases in which a user designates certain types of log entry as significant (e.g., log entries that record system errors, scheduled updates, input from external devices, etc.). The log analysis component 140 can detect and flag these significant log entries. When a log entry is flagged, an alert can be sent to a user. In some cases, a flag can indicate that a problem in the system has been detected. Additionally, in some embodiments, user-input settings can instruct the log scanning component 135 to refrain from deduplicating all or a portion of log entries that are considered significant.
  • the user can examine the deduplicated system logs and decide what, if any, actions to take.
  • the user is alerted by sounds, pop-up messages, emails, short message service (‘SMS’ or text) messages, flashing lights, force or haptic feedback, electric impulses, etc.
  • SMS short message service
  • the user may implement a solution, or send the entry to a technical support group. Sending the log entry to a support group, such as the system vendor's technical support group, allows the group to diagnose the problem, and offer improvements to the system.
  • the user Before sending log entries to a technical support group, the user can sanitize the logs.
  • the log analysis component 140 can detect a sensitive log entry, it can automatically sanitize the entry.
  • the log analysis component 140 can analyze the machine encoded text 127 as it is generated, while creating an additional document for the automatically sanitized logs. The log analysis component 140 can keep a record of this automatic sanitization for the user's review, and it can optionally alert the user.
  • FIG. 2 is a block diagram illustrating a second system log processing environment 200 , according to some embodiments.
  • the same processes can be carried out in processing environment 200 as in processing environment 100 .
  • the image acquisition component 120 , image processing component 125 , and text processing component 130 are not contained in a single device.
  • the system includes at least one image acquisition device 212 - 1 , 212 - 2 , 212 -N, (collectively 212 ), an image processing device 222 , and a text processing device 232 .
  • Processing environment 200 also includes at least one console display 105 - 1 , 105 - 2 , 105 -N, (collectively 105 ).
  • the components of processing environment 200 are arranged so that the image acquisition device 212 and the image processing device 222 together form an optical character recognition (OCR) processing environment.
  • OCR optical character recognition
  • the OCR processing environment 202 is illustrated as containing one image processing device 222 and at least one image acquisition device 212 , but other arrangements are possible.
  • the OCR processing environment 202 can have more than one image processing device 222 .
  • the image acquisition device 212 includes an image acquisition component 120 , a camera 115 , and a mounting adaptor 117 . These components are discussed in greater detail with respect to FIG. 1 . Though FIG. 2 illustrates one camera 115 , each image acquisition device 212 can include a camera 115 .
  • One or more image acquisition devices 212 collect images of system logs from the console display 105 , and transfer the images to the image processing device 222 .
  • the image processing device 222 contains the image processing component 125 , which converts system log text in the images to machine-encoded text 127 , as discussed with respect to FIG. 1 .
  • Machine-encoded text 127 of the system logs produced by the image processing device 222 is transferred from the OCR processing environment 202 to the text processing device 232 .
  • multiple OCR processing environments 202 send the machine-encoded text 127 of log entries to a central text processing device 232 .
  • the text processing device 232 includes a log scanning component 135 and a log analysis component 140 , which monitor, deduplicate, analyze, and flag system logs.
  • FIG. 3 is a flow diagram illustrating a process 300 of system log collection, deduplication, and analysis, according to some embodiments.
  • operation 310 photographic images of log entries from the user's system are captured as they appear on a console display 105 .
  • the image collection can occur continuously or at user-specified times.
  • the photographic images can be still images or videos, and they can be collected by the log processing device 110 or the image acquisition device 212 .
  • the log processing device 110 and the image acquisition device 212 each employ the camera 115 , image acquisition component 120 , and mounting adaptor 117 to capture the images.
  • Photographic images of the system log entries collected by the log processing device 110 or the image acquisition device 212 in operation 310 are processed in operation 320 , producing machine-encoded text 127 of the log entries.
  • the processing operation 320 is carried out by the image processing component 120 .
  • the image processing component 125 converts the images of system log text into machine-encoded text 127 .
  • the machine-encoded text 127 produced by the image processing component 120 in operation 320 is scanned in operation 330 .
  • the log scanning operation 330 determines whether there are redundant copies of system logs.
  • the log scanning operation 330 is carried out by the log scanning component 135 , and it can occur continuously as images appear or at user-specified times. Additionally, the system logs can be scanned in real-time, or they can be stored, and scanned at another time.
  • the log scanning component 135 can be in the text processing device 232 or the log processing device 110 . Additionally, the log scanning component 135 can record the number of copies of a log entry, as well as the time period in which the copies appeared.
  • the log scanning component 135 deduplicates the system logs in operation 335 . This step removes redundant copies of a log entry. As discussed in greater detail with respect to FIG. 1 , the log scanning component can locate redundant copies by comparing the machine-encoded text 127 of multiple log entries, and locating matching portions. In some embodiments, all redundant copies can be removed, while, in other embodiments, a portion of the redundant copies can be removed. The extent of deduplication can be specified by user-input settings, and may depend on the content of the log entry.
  • the deduplicated system logs are analyzed by the log analysis component 140 in operation 340 , and log entries that may be significant to a user are flagged. Whether the log entries are significant is determined by the log analysis component 140 , which can be in the text processing device 232 or the log processing device 110 .
  • the log analysis component 140 can flag log entries that have been repeated in great quantities or over long time spans, as was discussed with respect to FIG. 1 . These quantities and time spans can be designated by user-input and/or preprogrammed settings.
  • the log analysis component 140 can be configured to flag log entries that have features the user considers significant. For example, the user may consider log entries that report system errors to be significant, and program the log analysis component 140 to flag these log entries, regardless of the number of redundant copies.
  • alerts can include sounds, pop-up messages, emails, short message service (‘SMS’ or text) messages, flashing lights, force or haptic feedback, electric impulses, etc.
  • SMS short message service
  • the same type of alert can be used for each flag, or different alerts can be used for different types of flags.
  • a pop-up message could be used when a log entry that the user has designated as important is flagged, and an email sent when a log entry is flagged because multiple copies of the log entry have been detected in a designated time span.
  • the user selects and shares flagged log entries in operation 360 .
  • the user sends the selected log entries to a technical support group so that the group can diagnose the problem and offer a solution.
  • the user may have sensitive information within the system logs that he or she does not want to share with the technical support group. If so, the user can examine and sanitize the deduplicated and flagged log entries before sending remaining log entries to the technical support group. Additionally, in some cases, the user can implement a solution to a system error that resulted in a flagged log entry without sending the log entries to the technical support group.
  • FIG. 4 shows a high-level block diagram of an example computer system 400 that may be used in implementing one or more of the methods, tools, and modules, and any related functions, described herein (e.g., using one or more processor circuits or computer processors of the computer), in accordance with embodiments of the present disclosure.
  • the major components of the computer system 400 may comprise one or more processors 402 , a memory subsystem 404 , a terminal interface 412 , a storage interface 416 , an I/O (input/output) device interface 414 , and a network interface 418 .
  • the computer system 400 can also be connected to an internal or external camera 115 .
  • camera 115 can be connected to the computer system 400 by the I/O device interface 414 .
  • the computer system 400 may contain one or more general-purpose programmable central processing units (CPUs) 402 - 1 , 402 - 2 , and 402 -N, herein generically referred to as the CPU 402 .
  • the computer system 400 may contain multiple processors typical of a relatively large system; however, in other embodiments the computer system 400 may alternatively be a single CPU system.
  • Each CPU 402 may execute instructions stored in the memory subsystem 404 and may include one or more levels of on-board cache.
  • the memory 404 may include a random-access semiconductor memory, storage device, or storage medium (either volatile or non-volatile) for storing or encoding data and programs.
  • the memory 404 represents the entire virtual memory of the computer system 400 , and may also include the virtual memory of other computer systems coupled to the computer system 400 or connected via a network.
  • the memory 404 is conceptually a single monolithic entity, but, in other embodiments, the memory 404 is a more complex arrangement, such as a hierarchy of caches and other memory devices. For example, memory may be in multiple levels of caches, and these caches may be further divided by function, so that one cache holds instructions while another holds non-instruction data, which is used by the processor or processors.
  • the memory 404 may store all or a portion of the following: an image acquisition component 120 , an image processing component 125 , and a text processing component 130 .
  • the text processing component 130 can include the log scanning component 135 and the log analysis component 140 , though these are not illustrated in FIG. 4 .
  • Components 120 , 125 , 130 , 135 , and 140 were discussed in greater detail with respect to FIG. 1 .
  • components 120 , 125 , and 130 are illustrated as being included within the memory 404 in the computer system 400 , in other embodiments, some or all of these components may be on different computer systems, and may be accessed remotely, e.g., via a network.
  • the computer system 400 may use virtual addressing mechanisms that allow the programs of the computer system 400 to behave as if they only have access to a large, single storage entity instead of access to multiple, smaller storage entities.
  • the image acquisition component 120 , the image processing component 125 , and the text processing component 130 are illustrated as being included within the memory 404 , these components are not necessarily all completely contained in the same storage device at the same time.
  • the image acquisition component 120 , the image processing component 125 , and the text processing component 130 are illustrated as being separate entities, in other embodiments some of these components, portions of some of these components, or all of these components may be packaged together.
  • the image acquisition component 120 , the image processing component 125 , and the text processing component 130 include instructions that execute on the processor 402 or instructions that are interpreted by instructions that execute on the processor 402 to carry out the functions as further described in this disclosure.
  • the image acquisition component 120 , the image processing component 125 , and the text processing component 130 are implemented in hardware via semiconductor devices, chips, logical gates, circuits, circuit cards, and/or other physical hardware devices in lieu of, or in addition to, a processor-based system.
  • image acquisition component 120 , image processing component 125 , and text processing component 130 include data in addition to instructions or statements.
  • the memory bus 403 may, in some embodiments, include multiple different buses or communication paths, which may be arranged in any of various forms, such as point-to-point links in hierarchical, star or web configurations, multiple hierarchical buses, parallel and redundant paths, or any other appropriate type of configuration.
  • the I/O bus interface 410 and the I/O bus 408 are shown as single respective units, the computer system 400 may, in some embodiments, contain multiple I/O bus interface units 410 , multiple I/O buses 408 , or both. Further, while multiple I/O interface units are shown, which separate the I/O bus 408 from various communications paths running to the various I/O devices, in other embodiments some or all of the I/O devices may be connected directly to one or more system I/O buses.
  • the computer system 400 may include a bus interface unit 407 to handle communications among the processor 402 , the memory 404 , a display system 406 , and the I/O bus interface unit 410 .
  • the I/O bus interface unit 410 may be coupled with the I/O bus 408 for transferring data to and from the various I/O units.
  • the I/O bus interface unit 410 communicates with multiple I/O interface units 412 , 414 , 416 , and 418 , which are also known as I/O processors (IOPs) or I/O adapters (IOAs), through the I/O bus 408 .
  • the display system 406 may include a display controller. The display controller may provide visual, audio, or both types of data to a display device 405 .
  • the display system 406 may be coupled with a display device 405 , such as a standalone display screen, computer monitor, television, or a tablet or handheld device display. In alternate embodiments, one or more of the functions provided by the display system 406 may be on board a processor 402 integrated circuit. In addition, one or more of the functions provided by the bus interface unit 407 may be on board a processor 402 integrated circuit.
  • the computer system 400 may be a multi-user mainframe computer system, a single-user system, or a server computer or similar device that has little or no direct user interface, but receives requests from other computer systems (clients). Further, in some embodiments, the computer system 400 may be implemented as a desktop computer, portable computer, laptop or notebook computer, tablet computer, pocket computer, telephone, smart phone, network switches or routers, or any other appropriate type of electronic device.
  • FIG. 4 is intended to depict the representative major components of an exemplary computer system 400 . In some embodiments, however, individual components may have greater or lesser complexity than as represented in FIG. 4 , components other than or in addition to those shown in FIG. 4 may be present, and the number, type, and configuration of such components may vary.
  • the present disclosure may be a system, a method, and/or a computer program product.
  • the computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present disclosure.
  • the computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device.
  • the computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
  • a non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read-only memory
  • EPROM or Flash memory erasable programmable read-only memory
  • SRAM static random access memory
  • CD-ROM compact disc read-only memory
  • DVD digital versatile disk
  • memory stick a floppy disk
  • a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon
  • a computer readable storage medium is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
  • Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network.
  • the network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers.
  • a network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
  • Computer readable program instructions for carrying out steps of the present disclosure may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages.
  • the computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
  • electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present disclosure.
  • These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
  • the computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • each block in the flowchart or block diagrams may represent a component, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s).
  • the functions noted in the block may occur out of the order noted in the figures.
  • two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • the present disclosure may be a system, a method, and/or a computer program product at any possible technical detail level of integration
  • the computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present disclosure
  • the computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device.
  • the computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
  • a non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read-only memory
  • EPROM or Flash memory erasable programmable read-only memory
  • SRAM static random access memory
  • CD-ROM compact disc read-only memory
  • DVD digital versatile disk
  • memory stick a floppy disk
  • a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon
  • a computer readable storage medium is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
  • Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network.
  • the network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers.
  • a network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
  • Computer readable program instructions for carrying out steps of the present disclosure may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, configuration data for integrated circuitry, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++, or the like, and procedural programming languages, such as the “C” programming language or similar programming languages.
  • the computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
  • electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present disclosure.
  • These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
  • the computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • each block in the flowchart or block diagrams may represent a component, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s).
  • the functions noted in the blocks may occur out of the order noted in the Figures.
  • two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.

Abstract

A system and method for deduplicating analyzing system logs is disclosed. The method includes capturing, by an image acquisition component, an image of at least one log entry, converting, by an image processing component, text in the image of the at least one log entry to machine-encoded text, and scanning, by a log scanning component, the machine-encoded text for the at least one redundant copy of the at least one log entry. The method also includes removing, by the log scanning component, the at least one redundant copy of the at least one log entry, determining, by a log analysis component, if the at least one log entry is significant, flagging the at least one log entry if the log entry is significant, and alerting a user if the log entry is flagged.

Description

    BACKGROUND
  • The present disclosure relates to data analytics and, more specifically, to deduplication of air-gapped system logs.
  • Technical support for users' computer systems can be provided when system logs are relayed to a technical support group. Information in the user's system logs can allow the technical support group to identify problems that the system may be experiencing, allowing the support group to provide solutions to these problems. In some cases, the transmission of the system logs from the user to the technical support group is automatic. In other cases, a user may determine whether to send logs to a technical support group. However, these solutions do not work on air-gapped systems, which are physically separated from other networks, because the user does not have a network connection to the support group.
  • SUMMARY
  • Various embodiments are directed to a method for deduplicating and analyzing system logs. An image of a user's system logs can be captured by an image acquisition component. The image can also be encrypted by the image acquisition component. Text of system log entries in the image can be converted to machine-encoded text by an image processing component. The machine-encoded text of the log entries can additionally be encrypted by the image processing component. A log scanning component can scan the machine-encoded text of the log entries and remove redundant copes of a log entry. Additionally, a log analysis component in the log processing device or the text processing device can scan the log entries and determine whether a log entry is significant. If the log entry is significant, it can be flagged, and the user can be alerted.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a block diagram illustrating a first system log processing environment, according to embodiments of the present disclosure.
  • FIG. 2 is a block diagram illustrating a second system log processing environment, according to embodiments of the present disclosure.
  • FIG. 3 is a flowchart illustrating a process of system log deduplication and analysis, according to embodiments of the present disclosure.
  • FIG. 4 illustrates a computer system, according to embodiments of the present disclosure.
  • DETAILED DESCRIPTION
  • System logs are records of various events that take place in a computer system, and these events can be related to problems with the system. Sharing log entries related to problems in a system with a technical support group can help the support group diagnose and solve the problem. However, sharing log entries becomes more complicated when a user has an air-gapped system (e.g., one that is physically separated from other networks). If the system cannot connect to the technical support group via a network, alternative means of transmitting log entries to the group must be employed.
  • According to some embodiments of the present disclosure, a user with an air-gapped system collects system log entries, and sends the entries to a technical support group without connecting the air-gapped system to a network. The user's system logs, visible on a console display, can be recorded as photographic images by a device that is not connected to the system. Photographic images of the log entries can also be collected as screenshots and transferred to a device that is not connected to the system via air-gapped transfer methods. Text in the photographic images of the logs can be converted to machine-encoded text, stored, and analyzed in the same device or in another device. The system logs can additionally be deduplicated, and displayed to a user. The user may have sensitive information in their system logs that they do not wish to share with a technical support group. If so, the user can sanitize the logs so that log entries containing sensitive information are removed. The user can select which logs will be sent to a technical support group, and send the logs without using a network connection between their system and the system of the technical support group.
  • FIG. 1 is a block diagram illustrating a first system log processing environment 100, according to some embodiments. Environment 100 includes a console display 105 and a log processing device 110. The log processing device 110 includes a camera 115, a mounting adaptor 117, an image acquisition component 120, an image processing component 125, and a text processing component 130. The text processing component includes a log scanning component 135 and a log analysis component 140. Though FIG. 1 illustrates the camera 115, mounting adaptor 117, image acquisition component 120, image processing component 125, and text processing component 130 as belonging to one device 110, multiple devices can be involved in other embodiments. An example of a processing environment 200 that includes multiple devices is discussed in greater detail with respect to FIG. 2.
  • The console display 105 illustrated in FIG. 1, can be any display that shows a user's system logs. In some embodiments, the console display 105 is a non-cathode ray tube (non-CRT) display. Examples of non-CRT displays can include plasma displays, liquid crystal displays (LCDs), light-emitting diode (LED) displays, etc. System log entries can be records of system events, changes, operations, etc. The log entries can contain information about a problem with the system, and analyzing the logs can help a user or technical support group determine the best solution to the problem. In some instances, a system can produce redundant log entries. For example, a log entry may be repeated many times if it records a significant event or problem with the system.
  • An image of the user's system logs is captured from the console display 105 by the log processing device 110 as a photographic image, such as a video or a still image. However, other means of capture can be used. For example, the image can be captured by another device, and transferred to the log processing device 110. The log processing device 110 also converts text in the captured image to machine-encoded text 127, and the log processing device 110 processes the text, as discussed below. The log processing device 110 can utilize the camera 115, the mounting adaptor 117, and the image acquisition component 120 to capture a photographic image. In some embodiments, the camera 115 can be included within the log processing device 110, and, in other embodiments, the camera 115 can be externally connected to the device 110. With the mounting adaptor 117, the log processing device 110 can be positioned at a distance from the console display 105 that allows the camera 115 to capture videos and/or still images of log entry text with sufficient focus so that the log entry can be analyzed. In other embodiments, the log processing device 110 and the camera 115 can be positioned without employing the mounting adaptor 117.
  • The image acquisition component 120 in the log processing device 110 facilitates transmission of the system log image from the camera 115 into the log processing device 110. In some embodiments, the image acquisition component 120 encrypts images that are to be stored on the device 110. The encryption can protect the security of the images in situations where the log processing device 110 is compromised. In some embodiments, visual secret sharing encryption is used. However, any type of encryption can be employed.
  • Text in the photographic image is converted to machine-encoded text 127 by the image processing component 125. One approach to this conversion is by optical character recognition (OCR), which is a method of recognizing printed characters, though any method of converting printed characters to machine encoded text can be used. OCR converts the printed characters in the photographic image of the system log to machine-encoded text 127 using photoelectric devices and computer software. The image processing component 125 can additionally encrypt the machine-encoded text 127 so that it cannot be read if the log processing device 110 is compromised.
  • Text in the images captured by the image acquisition component 120 can be processed in real-time, or the images can be stored for later analysis. In some embodiments, images of the log system text can be captured from screenshots, rather than the camera 115, and transferred to the log processing device 110 via other air-gapped file transfer methods. Examples of these methods include transfer using removable storage devices such as USB devices, flash memory cards, external hard drives, optical disks, magnetic tape, etc. in read only mode.
  • The log processing device 110 also includes a text processing component 130, which scans and analyzes the machine-encoded text 127 generated by the image processing component 125. The text processing component 130 comprises a log scanning component 135 and a log analysis component 140, which are discussed in greater detail below.
  • Machine-encoded system log text 127 produced by the image processing component 125 is monitored by the log scanning component 135. The log scanning component 135 determines whether there are redundant copies of log entries, and deduplicates the system logs by removing the redundant log entries. The log scanning component 135 determines whether log entries are duplicates of one another by comparing the machine-encoded text 127 of the entries. For example, the log scanning component 135 can detect duplicates when log entries each contain some or all of the same words. The log scanning component 135 can scan the complete machine-encoded text 127 of a log for matching words and phrases, or it can scan portions of the text. In some instances, the log scanning component 135 can detect that logs are duplicates if they each contain certain predefined keywords. For example, the log scanning component 140 can determine that logs are duplicates if each log contains the same error message.
  • In some embodiments, all redundant copies of log entries are removed during deduplication. However, in other embodiments, a portion of the redundant copies of log entries are removed. Instructions directing the extent of deduplication can be input by a user or organization. For example, a user may specify that log entries should be deduplicated when the number of log entries surpasses a threshold number. The portion size can be any number of system log copies, and the number can be preprogrammed or specified by the user. Additionally, instructions could specify that log entries recorded at particular times be deduplicated. The log scanning component 135 can additionally record the time span of deduplication and the quantity of redundant log entries that are removed. If the log scanning component 135 records that a log entry has a number of redundant copies that surpasses a threshold number of copies over a designated time span, this log entry may be considered significant or in some way indicative of a problem with the system. One example of a threshold number and time span is 1,000 redundant copies of a log entry in one hour, though other threshold numbers and time spans can be used.
  • Determination of the significance of a log entry is carried out by the log analysis component 140. If the log analysis component 140 detects a log entry that may be significant, it flags the entry. In some cases, a log entry can be flagged when the log scanning component 135 detects many copies of it and/or its repetition over a long time span (e.g., at least one copy detected every hour for more than eight hours). There may also be cases in which a user designates certain types of log entry as significant (e.g., log entries that record system errors, scheduled updates, input from external devices, etc.). The log analysis component 140 can detect and flag these significant log entries. When a log entry is flagged, an alert can be sent to a user. In some cases, a flag can indicate that a problem in the system has been detected. Additionally, in some embodiments, user-input settings can instruct the log scanning component 135 to refrain from deduplicating all or a portion of log entries that are considered significant.
  • If the user is alerted to the presence of a significant log entry, the user can examine the deduplicated system logs and decide what, if any, actions to take. In some embodiments, the user is alerted by sounds, pop-up messages, emails, short message service (‘SMS’ or text) messages, flashing lights, force or haptic feedback, electric impulses, etc. If a flagged log entry reports a problem in the system, the user may implement a solution, or send the entry to a technical support group. Sending the log entry to a support group, such as the system vendor's technical support group, allows the group to diagnose the problem, and offer improvements to the system. Before sending log entries to a technical support group, the user can sanitize the logs. This can allow the user to eliminate sensitive information, and send only non-sensitive, relevant, or necessary log entries to the technical support group. In other embodiments, if the log analysis component 140 detects a sensitive log entry, it can automatically sanitize the entry. In some examples, the log analysis component 140 can analyze the machine encoded text 127 as it is generated, while creating an additional document for the automatically sanitized logs. The log analysis component 140 can keep a record of this automatic sanitization for the user's review, and it can optionally alert the user.
  • FIG. 2 is a block diagram illustrating a second system log processing environment 200, according to some embodiments. The same processes can be carried out in processing environment 200 as in processing environment 100. However, in processing environment 200, the image acquisition component 120, image processing component 125, and text processing component 130 are not contained in a single device. Rather, the system includes at least one image acquisition device 212-1, 212-2, 212-N, (collectively 212), an image processing device 222, and a text processing device 232. Processing environment 200 also includes at least one console display 105-1, 105-2, 105-N, (collectively 105).
  • The components of processing environment 200 are arranged so that the image acquisition device 212 and the image processing device 222 together form an optical character recognition (OCR) processing environment. There is at least one OCR processing environment 202-1, 202-2, 202-N, (collectively 202), and each OCR processing environment 202 is connected to a central text processing device 232. Though not illustrated in FIG. 2, in some embodiments, there can be more than one text processing device 232. The OCR processing environment 202 is illustrated as containing one image processing device 222 and at least one image acquisition device 212, but other arrangements are possible. For example, in some embodiments, the OCR processing environment 202 can have more than one image processing device 222. There may also be instances in which a user has more than one console display 105, and each console display 105 is paired with an OCR processing environment 202.
  • The image acquisition device 212 includes an image acquisition component 120, a camera 115, and a mounting adaptor 117. These components are discussed in greater detail with respect to FIG. 1. Though FIG. 2 illustrates one camera 115, each image acquisition device 212 can include a camera 115. One or more image acquisition devices 212 collect images of system logs from the console display 105, and transfer the images to the image processing device 222. The image processing device 222 contains the image processing component 125, which converts system log text in the images to machine-encoded text 127, as discussed with respect to FIG. 1.
  • Machine-encoded text 127 of the system logs produced by the image processing device 222 is transferred from the OCR processing environment 202 to the text processing device 232. In some embodiments, multiple OCR processing environments 202 send the machine-encoded text 127 of log entries to a central text processing device 232. The text processing device 232 includes a log scanning component 135 and a log analysis component 140, which monitor, deduplicate, analyze, and flag system logs.
  • FIG. 3 is a flow diagram illustrating a process 300 of system log collection, deduplication, and analysis, according to some embodiments. In operation 310, photographic images of log entries from the user's system are captured as they appear on a console display 105. The image collection can occur continuously or at user-specified times. The photographic images can be still images or videos, and they can be collected by the log processing device 110 or the image acquisition device 212. In this example, the log processing device 110 and the image acquisition device 212 each employ the camera 115, image acquisition component 120, and mounting adaptor 117 to capture the images.
  • Photographic images of the system log entries collected by the log processing device 110 or the image acquisition device 212 in operation 310 are processed in operation 320, producing machine-encoded text 127 of the log entries. The processing operation 320 is carried out by the image processing component 120. As discussed in greater detail with respect to FIG. 1, the image processing component 125 converts the images of system log text into machine-encoded text 127.
  • The machine-encoded text 127 produced by the image processing component 120 in operation 320 is scanned in operation 330. The log scanning operation 330 determines whether there are redundant copies of system logs. The log scanning operation 330 is carried out by the log scanning component 135, and it can occur continuously as images appear or at user-specified times. Additionally, the system logs can be scanned in real-time, or they can be stored, and scanned at another time. The log scanning component 135 can be in the text processing device 232 or the log processing device 110. Additionally, the log scanning component 135 can record the number of copies of a log entry, as well as the time period in which the copies appeared.
  • If the log scanning component 135 locates redundant copies of a log entry, the log scanning component 135 deduplicates the system logs in operation 335. This step removes redundant copies of a log entry. As discussed in greater detail with respect to FIG. 1, the log scanning component can locate redundant copies by comparing the machine-encoded text 127 of multiple log entries, and locating matching portions. In some embodiments, all redundant copies can be removed, while, in other embodiments, a portion of the redundant copies can be removed. The extent of deduplication can be specified by user-input settings, and may depend on the content of the log entry.
  • The deduplicated system logs are analyzed by the log analysis component 140 in operation 340, and log entries that may be significant to a user are flagged. Whether the log entries are significant is determined by the log analysis component 140, which can be in the text processing device 232 or the log processing device 110. The log analysis component 140 can flag log entries that have been repeated in great quantities or over long time spans, as was discussed with respect to FIG. 1. These quantities and time spans can be designated by user-input and/or preprogrammed settings. In some cases, there can be a threshold number of log entries for a period of time. For example, a threshold can be set so that, if 50 or more copies of a log entry are produced in 30 minutes or less, the log entry is flagged. Further, in some embodiments, the log analysis component 140 can be configured to flag log entries that have features the user considers significant. For example, the user may consider log entries that report system errors to be significant, and program the log analysis component 140 to flag these log entries, regardless of the number of redundant copies.
  • The user is alerted to flagged log entries in operation 350. Examples of alerts can include sounds, pop-up messages, emails, short message service (‘SMS’ or text) messages, flashing lights, force or haptic feedback, electric impulses, etc. The same type of alert can be used for each flag, or different alerts can be used for different types of flags. For example, a pop-up message could be used when a log entry that the user has designated as important is flagged, and an email sent when a log entry is flagged because multiple copies of the log entry have been detected in a designated time span.
  • The user selects and shares flagged log entries in operation 360. In some embodiments, the user sends the selected log entries to a technical support group so that the group can diagnose the problem and offer a solution. However, the user may have sensitive information within the system logs that he or she does not want to share with the technical support group. If so, the user can examine and sanitize the deduplicated and flagged log entries before sending remaining log entries to the technical support group. Additionally, in some cases, the user can implement a solution to a system error that resulted in a flagged log entry without sending the log entries to the technical support group.
  • FIG. 4 shows a high-level block diagram of an example computer system 400 that may be used in implementing one or more of the methods, tools, and modules, and any related functions, described herein (e.g., using one or more processor circuits or computer processors of the computer), in accordance with embodiments of the present disclosure. In some embodiments, the major components of the computer system 400 may comprise one or more processors 402, a memory subsystem 404, a terminal interface 412, a storage interface 416, an I/O (input/output) device interface 414, and a network interface 418. These components may be communicatively coupled, directly or indirectly, for inter-component communication via a memory bus 403, an I/O bus 408, bus interface unit 407, and an I/O bus interface unit 410. The computer system 400 can also be connected to an internal or external camera 115. In some cases, camera 115 can be connected to the computer system 400 by the I/O device interface 414.
  • The computer system 400 may contain one or more general-purpose programmable central processing units (CPUs) 402-1, 402-2, and 402-N, herein generically referred to as the CPU 402. In some embodiments, the computer system 400 may contain multiple processors typical of a relatively large system; however, in other embodiments the computer system 400 may alternatively be a single CPU system. Each CPU 402 may execute instructions stored in the memory subsystem 404 and may include one or more levels of on-board cache.
  • The memory 404 may include a random-access semiconductor memory, storage device, or storage medium (either volatile or non-volatile) for storing or encoding data and programs. In some embodiments, the memory 404 represents the entire virtual memory of the computer system 400, and may also include the virtual memory of other computer systems coupled to the computer system 400 or connected via a network. The memory 404 is conceptually a single monolithic entity, but, in other embodiments, the memory 404 is a more complex arrangement, such as a hierarchy of caches and other memory devices. For example, memory may be in multiple levels of caches, and these caches may be further divided by function, so that one cache holds instructions while another holds non-instruction data, which is used by the processor or processors. Memory may be further distributed and associated with different CPUs or sets of CPUs, as is known in any of various so-called non-uniform memory access (NUMA) computer architectures. The memory 404 may store all or a portion of the following: an image acquisition component 120, an image processing component 125, and a text processing component 130. The text processing component 130 can include the log scanning component 135 and the log analysis component 140, though these are not illustrated in FIG. 4. Components 120, 125, 130, 135, and 140 were discussed in greater detail with respect to FIG. 1.
  • While components 120, 125, and 130 are illustrated as being included within the memory 404 in the computer system 400, in other embodiments, some or all of these components may be on different computer systems, and may be accessed remotely, e.g., via a network. The computer system 400 may use virtual addressing mechanisms that allow the programs of the computer system 400 to behave as if they only have access to a large, single storage entity instead of access to multiple, smaller storage entities. Thus, while the image acquisition component 120, the image processing component 125, and the text processing component 130 are illustrated as being included within the memory 404, these components are not necessarily all completely contained in the same storage device at the same time. Further, although the image acquisition component 120, the image processing component 125, and the text processing component 130 are illustrated as being separate entities, in other embodiments some of these components, portions of some of these components, or all of these components may be packaged together.
  • In an embodiment, the image acquisition component 120, the image processing component 125, and the text processing component 130 include instructions that execute on the processor 402 or instructions that are interpreted by instructions that execute on the processor 402 to carry out the functions as further described in this disclosure. In another embodiment, the image acquisition component 120, the image processing component 125, and the text processing component 130 are implemented in hardware via semiconductor devices, chips, logical gates, circuits, circuit cards, and/or other physical hardware devices in lieu of, or in addition to, a processor-based system. In another embodiment, image acquisition component 120, image processing component 125, and text processing component 130 include data in addition to instructions or statements.
  • Although the memory bus 403 is shown in FIG. 4 as a single bus structure providing a direct communication path among the CPUs 402, the memory subsystem 404, the display system 406, the bus interface 407, and the I/O bus interface 410, the memory bus 403 may, in some embodiments, include multiple different buses or communication paths, which may be arranged in any of various forms, such as point-to-point links in hierarchical, star or web configurations, multiple hierarchical buses, parallel and redundant paths, or any other appropriate type of configuration. Furthermore, while the I/O bus interface 410 and the I/O bus 408 are shown as single respective units, the computer system 400 may, in some embodiments, contain multiple I/O bus interface units 410, multiple I/O buses 408, or both. Further, while multiple I/O interface units are shown, which separate the I/O bus 408 from various communications paths running to the various I/O devices, in other embodiments some or all of the I/O devices may be connected directly to one or more system I/O buses.
  • The computer system 400 may include a bus interface unit 407 to handle communications among the processor 402, the memory 404, a display system 406, and the I/O bus interface unit 410. The I/O bus interface unit 410 may be coupled with the I/O bus 408 for transferring data to and from the various I/O units. The I/O bus interface unit 410 communicates with multiple I/ O interface units 412, 414, 416, and 418, which are also known as I/O processors (IOPs) or I/O adapters (IOAs), through the I/O bus 408. The display system 406 may include a display controller. The display controller may provide visual, audio, or both types of data to a display device 405. The display system 406 may be coupled with a display device 405, such as a standalone display screen, computer monitor, television, or a tablet or handheld device display. In alternate embodiments, one or more of the functions provided by the display system 406 may be on board a processor 402 integrated circuit. In addition, one or more of the functions provided by the bus interface unit 407 may be on board a processor 402 integrated circuit.
  • In some embodiments, the computer system 400 may be a multi-user mainframe computer system, a single-user system, or a server computer or similar device that has little or no direct user interface, but receives requests from other computer systems (clients). Further, in some embodiments, the computer system 400 may be implemented as a desktop computer, portable computer, laptop or notebook computer, tablet computer, pocket computer, telephone, smart phone, network switches or routers, or any other appropriate type of electronic device.
  • It is noted that FIG. 4 is intended to depict the representative major components of an exemplary computer system 400. In some embodiments, however, individual components may have greater or lesser complexity than as represented in FIG. 4, components other than or in addition to those shown in FIG. 4 may be present, and the number, type, and configuration of such components may vary.
  • The present disclosure may be a system, a method, and/or a computer program product. The computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present disclosure.
  • The computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
  • Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
  • Computer readable program instructions for carrying out steps of the present disclosure may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). In some embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present disclosure.
  • Aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.
  • These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
  • The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and step of possible implementations of systems, methods, and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a component, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
  • The descriptions of the various embodiments of the present disclosure have been presented for purposes of illustration, but are not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein was chosen to best explain the principles of the embodiments, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.
  • Although the present disclosure has been described in terms of specific embodiments, it is anticipated that alterations and modification thereof will become apparent to the skilled in the art. Therefore, it is intended that the following claims be interpreted as covering all such alterations and modifications as fall within the true spirit and scope of the disclosure.
  • The present disclosure may be a system, a method, and/or a computer program product at any possible technical detail level of integration. The computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present disclosure.
  • The computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), a static random access memory (SRAM), a portable compact disc read-only memory (CD-ROM), a digital versatile disk (DVD), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
  • Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
  • Computer readable program instructions for carrying out steps of the present disclosure may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, configuration data for integrated circuitry, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++, or the like, and procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). In some embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present disclosure.
  • Aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.
  • These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
  • The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and step of possible implementations of systems, methods, and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a component, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the blocks may occur out of the order noted in the Figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
  • The descriptions of the various embodiments of the present disclosure have been presented for purposes of illustration, but are not intended to be exhaustive or limited to the embodiments disclosed. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein was chosen to best explain the principles of the embodiments, the practical application or technical improvement over technologies found in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.

Claims (20)

1. A method for deduplicating and analyzing system logs, comprising:
capturing an image of at least one log entry;
converting text in the image of the at least one log entry to machine-encoded text;
scanning the machine-encoded text for at least one redundant copy of the at least one log entry;
removing the at least one redundant copy of the at least one log entry;
determining if the at least one log entry is significant;
flagging the at least one log entry if the at least one log entry is significant; and
alerting a user if the at least one log entry is flagged.
2. The method of claim 1, wherein the determining determines that the at least one log entry is significant if a number of the at least one redundant copy of the at least one log entry is greater than a threshold number of redundant copies of log entries in a designated time span.
3. The method of claim 1, wherein the determining determines that the at least one log entry is significant if the at least one log entry reports a system error.
4. The method of claim 1, wherein the image of the at least one log entry is a video.
5. The method of claim 1, wherein the image of the at least one log entry is a still image.
6. The method of claim 1, further comprising encrypting the image of the at least one log entry.
7. The method of claim 1, further comprising encrypting the machine-encoded text of the at least one log entry.
8. A system comprising:
at least one processing component;
at least one memory component;
at least one image acquisition component configured to capture an image of at least one log entry;
at least one image processing component configured to convert text in the image of the at least one log entry to machine-encoded text;
at least one log scanning component configured to scan the machine-encoded text for at least one redundant copy of the at least one log entry, and further configured to remove the at least one redundant copy of the at least one log entry; and
at least one log analysis component configured to determine if the at least one log entry is significant, and further configured to alert a user if the at least one log entry is flagged.
9. The system of claim 8, wherein the log analysis component determines that the at least one log entry is significant if a number of the at least one redundant copy of the at least one log entry is greater than a threshold number of redundant copies of log entries in a designated time span.
10. The system of claim 8, wherein the log analysis component determines that the at least one log entry is significant if the at least one log entry reports a system error.
11. The system of claim 8, wherein the image acquisition component, the image processing component, the log scanning component, and the log analysis component are in a log processing device.
12. The system of claim 8, wherein the image acquisition component is further configured to encrypt the image of the at least one log entry.
13. The system of claim 8, wherein the image processing component is further configured to encrypt the machine-encoded text of the at least one log entry.
14. The system of claim 8, further comprising a camera.
15. The system of claim 8, wherein the image acquisition component is in an image acquisition device.
16. A computer program product for deduplicating and analyzing system logs, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the device to perform a method comprising:
capturing an image of at least one log entry;
converting text in the image of the at least one log entry to machine-encoded text;
scanning the machine-encoded text for at least one redundant copy of the at least one log entry;
removing the at least one redundant copy of the at least one log entry;
determining if the at least one log entry is significant;
flagging the at least one log entry if the at least one log entry is significant; and
alerting a user to a flagged log entry.
17. The computer program product of claim 16, wherein the determining determines that the at least one log entry is significant if a number of the at least one redundant copy of the at least one log entry is greater than a threshold number of redundant copies of log entries in a designated time span.
18. The computer program product of claim 16, wherein the determining determines that the at least one log entry is significant if the at least one log entry reports a system error.
19. The computer program product of claim 16, further comprising encrypting the image of the at least one log entry.
20. The computer program product of claim 16, further comprising encrypting the machine-encoded text of the at least one log entry.
US15/420,855 2017-01-31 2017-01-31 Analytics and deduplication for air-gapped log analysis Abandoned US20180218004A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US15/420,855 US20180218004A1 (en) 2017-01-31 2017-01-31 Analytics and deduplication for air-gapped log analysis
US15/697,897 US9996545B1 (en) 2017-01-31 2017-09-07 Analytics and deduplication for air-gapped log analysis
US15/970,319 US10255289B2 (en) 2017-01-31 2018-05-03 Analytics and deduplication for air-gapped log analysis
US15/970,360 US10248658B2 (en) 2017-01-31 2018-05-03 Analytics and deduplication for air-gapped log analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US15/420,855 US20180218004A1 (en) 2017-01-31 2017-01-31 Analytics and deduplication for air-gapped log analysis

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/697,897 Continuation US9996545B1 (en) 2017-01-31 2017-09-07 Analytics and deduplication for air-gapped log analysis

Publications (1)

Publication Number Publication Date
US20180218004A1 true US20180218004A1 (en) 2018-08-02

Family

ID=62485734

Family Applications (4)

Application Number Title Priority Date Filing Date
US15/420,855 Abandoned US20180218004A1 (en) 2017-01-31 2017-01-31 Analytics and deduplication for air-gapped log analysis
US15/697,897 Expired - Fee Related US9996545B1 (en) 2017-01-31 2017-09-07 Analytics and deduplication for air-gapped log analysis
US15/970,360 Expired - Fee Related US10248658B2 (en) 2017-01-31 2018-05-03 Analytics and deduplication for air-gapped log analysis
US15/970,319 Expired - Fee Related US10255289B2 (en) 2017-01-31 2018-05-03 Analytics and deduplication for air-gapped log analysis

Family Applications After (3)

Application Number Title Priority Date Filing Date
US15/697,897 Expired - Fee Related US9996545B1 (en) 2017-01-31 2017-09-07 Analytics and deduplication for air-gapped log analysis
US15/970,360 Expired - Fee Related US10248658B2 (en) 2017-01-31 2018-05-03 Analytics and deduplication for air-gapped log analysis
US15/970,319 Expired - Fee Related US10255289B2 (en) 2017-01-31 2018-05-03 Analytics and deduplication for air-gapped log analysis

Country Status (1)

Country Link
US (4) US20180218004A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180218004A1 (en) * 2017-01-31 2018-08-02 International Business Machines Corporation Analytics and deduplication for air-gapped log analysis
US10698756B1 (en) * 2017-12-15 2020-06-30 Palantir Technologies Inc. Linking related events for various devices and services in computer log files on a centralized server

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8488916B2 (en) * 2011-07-22 2013-07-16 David S Terman Knowledge acquisition nexus for facilitating concept capture and promoting time on task
US9280397B2 (en) * 2007-06-27 2016-03-08 Intel Corporation Using buffered stores or monitoring to filter redundant transactional accesses and mechanisms for mapping data to buffered metadata

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8266114B2 (en) 2008-09-22 2012-09-11 Riverbed Technology, Inc. Log structured content addressable deduplicating storage
US8108343B2 (en) 2009-04-23 2012-01-31 Microsoft Corporation De-duplication and completeness in multi-log based replication
JP2011128995A (en) 2009-12-18 2011-06-30 Canon Inc Log information management system, control method therefor, and storage medium
US20130227352A1 (en) 2012-02-24 2013-08-29 Commvault Systems, Inc. Log monitoring
US8949561B2 (en) 2012-08-07 2015-02-03 Netapp, Inc. Systems, methods, and computer program products providing change logging in a deduplication process
WO2015105493A1 (en) 2014-01-09 2015-07-16 Hewlett-Packard Development Company, L.P. Support data deduplication
US9195674B1 (en) 2014-09-24 2015-11-24 Logzilla Corporation Systems and methods for large-scale system log analysis, deduplication and management
CN104636242A (en) 2015-02-06 2015-05-20 浪潮电子信息产业股份有限公司 Method for automatically deleting repeated content in system logs on basis of Linux operating system
US10360184B2 (en) 2015-06-24 2019-07-23 International Business Machines Corporation Log file analysis to locate anomalies
US20180218004A1 (en) * 2017-01-31 2018-08-02 International Business Machines Corporation Analytics and deduplication for air-gapped log analysis

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9280397B2 (en) * 2007-06-27 2016-03-08 Intel Corporation Using buffered stores or monitoring to filter redundant transactional accesses and mechanisms for mapping data to buffered metadata
US8488916B2 (en) * 2011-07-22 2013-07-16 David S Terman Knowledge acquisition nexus for facilitating concept capture and promoting time on task

Also Published As

Publication number Publication date
US10255289B2 (en) 2019-04-09
US20180232391A1 (en) 2018-08-16
US9996545B1 (en) 2018-06-12
US10248658B2 (en) 2019-04-02
US20180232390A1 (en) 2018-08-16

Similar Documents

Publication Publication Date Title
US10248689B2 (en) Supplementing candidate answers
ES2253531T5 (en) Storage management based on content
KR101410442B1 (en) A Digital Forensic Audit System for Analyzing User’s Behaviors
US20160342502A1 (en) How to track operator behavior via metadata
US11042447B2 (en) Retention rule compliance of record deletion based on deletion log
CN111629165B (en) Alarm video processing method, device, equipment and storage medium
US10255289B2 (en) Analytics and deduplication for air-gapped log analysis
US20210243233A1 (en) Method and sytem for protecting privacy of users in session recordings
US20170262329A1 (en) Configuring and utilizing call-home systems
US20210142121A1 (en) Image classification masking
WO2016138609A1 (en) Uploading user and system data from a source location to a destination location
EP4020284A1 (en) Systems and methods for cross-referencing forensic snapshot over time for root-cause analysis
JP6213676B2 (en) Analysis device, analysis method, and analysis program
US9244918B2 (en) Locating electronic documents
US20180314837A1 (en) Secure file wrapper for tiff images
US10762203B2 (en) Reducing impact of malware/ransomware in caching environment
US20170372079A1 (en) Selective data encryption
US20220171677A1 (en) Servicing data storage devices in a data storage array
US10970415B2 (en) Sensitive data redaction in memory dump
US11347849B2 (en) Ransomware detection and prevention
US10031811B1 (en) Systems and methods for enhancing electronic discovery searches
CN112506490A (en) Interface generation method and device, electronic equipment and storage medium
US10606985B2 (en) Secure file wrapper for TIFF images
US9952935B2 (en) Excluding content items from a backup operation
Schroader et al. Alternate data storage forensics

Legal Events

Date Code Title Description
AS Assignment

Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION, NEW Y

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:GOLDBERG, ITZHACK;MCGREGOR, HARRY R.;MOORE, CHRISTOPHER B.;AND OTHERS;SIGNING DATES FROM 20170126 TO 20170131;REEL/FRAME:041136/0436

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE