WO2013025211A1 - Tracing data block operations - Google Patents

Tracing data block operations Download PDF

Info

Publication number
WO2013025211A1
WO2013025211A1 PCT/US2011/048105 US2011048105W WO2013025211A1 WO 2013025211 A1 WO2013025211 A1 WO 2013025211A1 US 2011048105 W US2011048105 W US 2011048105W WO 2013025211 A1 WO2013025211 A1 WO 2013025211A1
Authority
WO
WIPO (PCT)
Prior art keywords
data block
record
virtual machine
computer apparatus
processor
Prior art date
Application number
PCT/US2011/048105
Other languages
French (fr)
Inventor
Chun Hui SUEN
Peter JAGADPRAMANA
Kok Leong Ryan KO
Bu Sung Lee
Original Assignee
Hewlett-Packard Development Company, L.P.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hewlett-Packard Development Company, L.P. filed Critical Hewlett-Packard Development Company, L.P.
Priority to PCT/US2011/048105 priority Critical patent/WO2013025211A1/en
Priority to CN201180072892.0A priority patent/CN103748551A/en
Priority to EP11870999.7A priority patent/EP2745199A4/en
Priority to US14/237,953 priority patent/US9213842B2/en
Publication of WO2013025211A1 publication Critical patent/WO2013025211A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/50Monitoring users, programs or devices to maintain the integrity of platforms, e.g. of processors, firmware or operating systems
    • G06F21/57Certifying or maintaining trusted computer platforms, e.g. secure boots or power-downs, version controls, system software checks, secure updates or assessing vulnerabilities
    • G06F21/577Assessing vulnerabilities and evaluating computer system security
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/188Virtual file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45579I/O management, e.g. providing access to device drivers or storage
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45591Monitoring or debugging support
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0659Command handling arrangements, e.g. command buffers, queues, command scheduling

Definitions

  • Cloud computing has increased in popularity in recent years as more applications and data services are being managed remotely on a server rather than locally on a client. For example, when a user wishes to create a document, a suitable application running on the server displays the document created by the user on the client web browser. Memory is allocated on a client device to display application data on a screen, but calculations are carried out by one or more remote computers on a network. Moreover, files may be stored remotely on cloud servers, including files that may contain sensitive or personal data.
  • FIG. 1 illustrates an example of a cloud system in accordance with aspects of the application.
  • FIG. 2 is an example of a cloud server in accordance with aspects of the application.
  • FIG. 3 is an example of a virtual machine hosted by a computer apparatus in accordance with aspects of the application .
  • FIG. 4 is a flow diagram of an illustrative method of storing trace information associated with a block operation .
  • FIG. 5 is a functional diagram of a virtual machine hosted by a computer apparatus in accordance with aspects of the application.
  • FIGS. 6A-B are schematic diagrams of examples of a file before and after execution of a block operation in accordance with aspects of the application. DETAILED DESCRIPTION
  • a computer apparatus may execute at least one virtual machine that emulates an independent computer apparatus.
  • data block operations within the computer apparatus and virtual machines therein may be intercepted. Attributes associated with the data block operations may be retrieved and appended to the targeted data block so as to imbed trace information therein.
  • the embedded information may be later utilized for purposes of forensic analyzes of the cloud network .
  • FIG. 1 presents a schematic diagram of an illustrative cloud system 100 depicting various computing devices used in a networked configuration.
  • FIG. 1 illustrates a plurality of computers 102, 104, 106 and 108.
  • Each computer may be a node of the cloud and may comprise any device capable of processing instructions and transmitting data to and from other computers, including a laptop, a full- sized personal computer, a high-end server, or a network computer lacking local storage capability.
  • a node may comprise a mobile phone 113 or a mobile device 114 capable of wirelessly exchanging data with a server.
  • Mobile device 114 may be a wireless-enabled PDA or a tablet PC.
  • the computers or devices disclosed in FIG. 1 may be interconnected via a network 112, which may be a local area network ("LAN”), wide area network ("WAN”), the Internet, etc.
  • Network 112 and intervening nodes may also use various protocols including virtual private networks, local Ethernet networks, private networks using communication protocols proprietary to one or more companies, cellular and wireless networks, instant messaging, HTTP and SMTP, and various combinations of the foregoing.
  • LAN local area network
  • WAN wide area network
  • Internet etc.
  • Network 112 and intervening nodes may also use various protocols including virtual private networks, local Ethernet networks, private networks using communication protocols proprietary to one or more companies, cellular and wireless networks, instant messaging, HTTP and SMTP, and various combinations of the foregoing.
  • FIG. 1 Although only a few computers are depicted in FIG. 1, it should be appreciated that a typical cloud system can include a large number of interconnected computers .
  • each computer or device shown in FIG. 1 may be at one node of cloud system 100 and capable of directly or indirectly communicating with other computers or devices of the system.
  • computer 104 may be a cloud server capable of communicating with a client computer such that computer 104 uses network 112 to transmit information for presentation to a user.
  • computer 104 may be used to generate requested information for display via, for example, a web browser executing on computer 102.
  • Any one of the computers 102, 104, 106, and 108 may also comprise a plurality of computers, such as a load balancing network, that exchange information with different nodes of a network for the purpose of receiving, processing, and transmitting data to multiple client computers.
  • the client computers will typically still be at different nodes of the network than any of the computers comprising computers 102, 104, 106 and 108.
  • FIG. 2 presents a close up illustration of computer 104.
  • computer 104 is a computer apparatus configured as a cloud server and may contain a processor 202, memory 204, and other components typically present in a computer. Other components may include a display
  • Memory 204 may store information accessible by processor 202, including instructions, which may be executed by processor 202.
  • the memory 204 may be any type of device capable of storing information accessible by the processor, such as a hard-drive, ROM, RAM, CD-ROM, flash memories, write-capable or read-only memories.
  • Storage device 220 may contain data that may be retrieved, manipulated, or stored by the processor and may comprise any non-volatile storage device.
  • the processor 202 may comprise any number of well known processors or a dedicated controller for executing operations, such as an ASIC. Systems and methods may include different combinations of the foregoing, whereby different portions of the instructions and data are stored on different types of media.
  • Network interface 222 of computer 104 may comprise circuitry suitable for communication with other computers or devices on the cloud system 100.
  • Network interface 222 may be an Ethernet interface that implements a standard encompassed by the Institute of Electrical and Electronic Engineers
  • network interface 222 may be a wireless fidelity (“Wi-Fi”) interface in accordance with the IEEE 802.11 suite of standards. It is understood that other standards or protocols may be utilized, such as Bluetooth or token ring.
  • Wi-Fi wireless fidelity
  • FIG. 2 functionally illustrates the processor 202 and memory 204 as being within the same block, it will be understood that the processor and memory may actually comprise multiple processors and memories that may or may not be stored within the same physical housing.
  • any one of the memories may be a hard drive or other storage media located in a server farm of a data center.
  • references to a processor, computer, or memory will be understood to include references to a collection of processors or computers or memories that may or may not operate in parallel.
  • storage device 220 may be at a location physically remote from, but still accessible by, the processor 202.
  • the instructions disclosed herein may be any set of instructions to be executed directly (such as machine code) or indirectly (such as scripts) by processor 202.
  • the instructions may be stored as computer code on a computer- readable medium.
  • the terms "instructions, " "programs,” or “modules” may be used interchangeably herein.
  • the instructions may be stored in object code format for direct processing by the processor, or in any other computer language including scripts or collections of independent source code modules that are interpreted on demand or compiled in advance.
  • examples herein can be realized in the form of software, hardware, or a combination of hardware and software. Functions, methods and routines of the instructions are explained in more detail below .
  • Virtualization allows a processor to emulate an independent computer apparatus in accordance with instructions, such as virtual machine instructions 212 and 214.
  • Operations on a cloud system may occur on a physical computer apparatus or on a virtual machine (e.g., hosted by a physical computer apparatus) .
  • Each virtual machine may have its own operating system, storage device, and network resources.
  • a separate portion of memory 204 and network interface 222 may be dedicated to each virtual machine.
  • FIG. 2 depicts two virtual machine instructions 212 and 214 that may be used to emulate two separate computers on the same physical computer apparatus 104. While two virtual machines are depicted in FIG.
  • a cloud server may execute any number of virtual machines each of which may be dedicated to different client requests on the cloud network. For example, while the remaining portions of computer 104 may serve the requests of computer 102 of cloud system 100, virtual machine 212 may simultaneously serve the requests of another client computer, such as computer 108. Each virtual machine may serve additional client requests simultaneously and may act as an independent computer apparatus with an operating system different than that of the physical computer apparatus or of other virtual machines.
  • Host monitor instructions 216 may supervise the resources being employed by different processes, including virtual machines, executing in computer 104.
  • Kernel 219 may be any set of instructions suitable for managing the resources of computer 104 and allowing other programs to utilize those resources. Kernel 219 may be a central component of operating system 217.
  • Operating system 217 represents a collection of programs that when executed by processor 202 serve as a platform on which instructions can execute. Examples of operating systems include, but are not limited to, various versions of Microsoft's Windows® and Linux®.
  • Storage Service 218 may be instructions that interface with kernel 219 to manage file operations executing on computer 104. While storage service 218 may execute locally in computer 104, storage service 218 may also execute on a remote computer.
  • the file operations may be any transaction associated with a file on computer 104 (e.g., read, write, copy, rename etc.) .
  • a file operation may comprise one or more operations aimed at one or more data blocks of a particular file (i.e., block operations) in storage device 220 or memory 204.
  • a data block is typically a sequence of bytes or bits of a nominal size.
  • FIG. 3 one example of a virtual machine is provided. While FIG. 3 focuses on virtual machine 212 for ease of illustration, it is understood that virtual machine 214 or any other co-existing virtual machine may include features similar to virtual machine 212 of FIG. 3.
  • virtual machine 212 may have an operating system 302 and a kernel 305.
  • File subsystem 308 may organize data in storage device 220 in a way that is accessible by virtual machine 212. The data arrangement utilized by file subsystem 308 is typically dependent on operating system 302. Examples of file subsystems include, but are not limited, to file allocation table (“FAT”) system or the UNIX file system.
  • Storage driver instructions 303 may provide an interface between storage device 220 and virtual machine 212.
  • FIG. 4 illustrates a flow diagram of a process to trace data block operations on a physical and virtual computer apparatus.
  • FIG. 5 illustrates aspects of the virtual machine and the physical computer apparatus.
  • FIGS. 6A-6B are close up illustrations of a file before and after implementation of multiple block operations. The actions shown in FIGS. 5-6 will be discussed below with regard to the flow diagram of FIG. 4.
  • a data block operation request and a first record may be received, as shown in block 402.
  • the first record may comprise at least one attribute associated with the data block operation and may be generated by the same entity making the request.
  • the requesting entity may be a virtual machine executing in computer apparatus 104, such as virtual machines 212 and 214.
  • the requesting entity may be a process on a remote computer on the cloud, such as computer 108.
  • the attributes of the record may include an identifier associated with a process making the request, such as process 304, an identifier associated with a user who initiated the request, a timestamp indicating the time in which a process started or terminated the request, resources utilized by the process making the request, or the configuration of the entity making the request (e.g., virtual or actual hardware model, CPU, network configuration, etc.) .
  • a second record may be generated, as shown in block 404.
  • the second record may be generated by host monitor 216 of computer apparatus 104.
  • the attributes of the second record may include host attributes that correspond to those of the first record.
  • the second record may contain resources utilized in the computer apparatus to serve the request, the host machine configuration, or an identifier associated with the host machine.
  • the aforementioned attributes of the first and second record are merely illustrative, and it should be understood that attributes may be added or deleted.
  • virtual machine 212 is the entity requesting the data block operation.
  • Process 304 is shown requesting a file operation (e.g., a file create, a file write, a file deletion, a file rename, etc.) and submitting the request to file subsystem 308.
  • File subsystem 308 may gather attributes associated with the operation.
  • file subsystem 308 may convert the file operation into block operations on one or more blocks of data within the targeted file.
  • Each block operation and its associated attributes may be forwarded to storage driver 303.
  • storage driver 303 may transmit the same to storage service 218 executing on physical computer apparatus 104, via communication channel 502.
  • communication channel 502 may be, for example, a virtual serial link, such as Citrix Xen V4V or a VMWare virtual machine communication interface ("VMCI") enabled for inter- domain communication. If storage service 218 executes in a remote computer, communication channel 502 may be a link over network 112 via network interface 222.
  • VMCI virtual machine communication interface
  • Host monitor instructions 216 may intercept each block operation and the attributes received from virtual machine 212. Furthermore, host monitor instructions 216 may generate the second record containing corresponding host machine attributes. Referring back to FIG. 4, the first record and the second record may be concatenated or attached to each data block targeted by each block operation, as shown in block 406. The first and second record may be pre-pended or appended to each data block. Alternatively, a first pointer may be associated with the first record and a second pointer may be associated with the second record. In this example, the first pointer and the second pointer may be pre- pended or appended to each data block targeted by each block operation. Each pointer may be indicative of an address of the record's location. In this regard, the first and second record may be stored in, for example, memory 204. In block 408 of FIG. 4, each data block operation may be executed upon each targeted data block.
  • Host monitor 216 may forward the data blocks with the appended records to storage service 218, which may implement the operations on one or more targeted data blocks in storage device 220.
  • FIGS. 6A-6B illustrate blocks of data before and after implementing block operation requests.
  • FIG. 6A shows storage device 220 containing a file 602 divided into individual data blocks 604-614 before receipt of data block operations.
  • FIG. 6B shows file 602 after executing one or more block operations upon file 602.
  • the data in blocks 605, 611, and 614 were updated as a result of block operations in accordance with the instructions of storage service 218.
  • Blocks 605, 611, and 614 are shown with appended records 615, 616, and 617 respectively.
  • Each record 615-617 may be a concatenation of the first record generated in virtual machine 212 and the second record generated in computer apparatus 104.
  • the appended records may be retrieved by another process to display the trace information to a user investigating behavior occurring in the cloud system.
  • the appended records may be encrypted such that only authorized users having access to a decryption key can view the trace information.
  • the above-described system enables the tracking of data block operations occurring on the cloud network.
  • users may have greater confidence that sensitive files stored in the cloud can be traced in case of theft or loss of data .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Hardware Design (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computing Systems (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Debugging And Monitoring (AREA)

Abstract

An apparatus and related method to track data block operations in a cloud system are provided. Attributes associated with the data block operation may be attached to each individual data block targeted by the data block operation.

Description

TRACING DATA BLOCK OPERATIONS
BACKGROUND
[0001] Cloud computing has increased in popularity in recent years as more applications and data services are being managed remotely on a server rather than locally on a client. For example, when a user wishes to create a document, a suitable application running on the server displays the document created by the user on the client web browser. Memory is allocated on a client device to display application data on a screen, but calculations are carried out by one or more remote computers on a network. Moreover, files may be stored remotely on cloud servers, including files that may contain sensitive or personal data.
BRIEF DESCRIPTION OF THE DRAWINGS
[0002] FIG. 1 illustrates an example of a cloud system in accordance with aspects of the application.
[0003] FIG. 2 is an example of a cloud server in accordance with aspects of the application.
[0004] FIG. 3 is an example of a virtual machine hosted by a computer apparatus in accordance with aspects of the application .
[0005] FIG. 4 is a flow diagram of an illustrative method of storing trace information associated with a block operation .
[0006] FIG. 5 is a functional diagram of a virtual machine hosted by a computer apparatus in accordance with aspects of the application.
[0007] FIGS. 6A-B are schematic diagrams of examples of a file before and after execution of a block operation in accordance with aspects of the application. DETAILED DESCRIPTION
[0008] While cloud computing has been praised for promoting scalability and simplifying maintenance, it has also been criticized for potential security risks including exposing information to unlawful monitoring and theft. Aspects of the application provide techniques for tracking data block operations in a cloud system. In one aspect, a computer apparatus may execute at least one virtual machine that emulates an independent computer apparatus. In another aspect, data block operations within the computer apparatus and virtual machines therein may be intercepted. Attributes associated with the data block operations may be retrieved and appended to the targeted data block so as to imbed trace information therein. The embedded information may be later utilized for purposes of forensic analyzes of the cloud network .
[0009] FIG. 1 presents a schematic diagram of an illustrative cloud system 100 depicting various computing devices used in a networked configuration. For example, FIG. 1 illustrates a plurality of computers 102, 104, 106 and 108. Each computer may be a node of the cloud and may comprise any device capable of processing instructions and transmitting data to and from other computers, including a laptop, a full- sized personal computer, a high-end server, or a network computer lacking local storage capability. Moreover, a node may comprise a mobile phone 113 or a mobile device 114 capable of wirelessly exchanging data with a server. Mobile device 114 may be a wireless-enabled PDA or a tablet PC.
[0010] The computers or devices disclosed in FIG. 1 may be interconnected via a network 112, which may be a local area network ("LAN"), wide area network ("WAN"), the Internet, etc. Network 112 and intervening nodes may also use various protocols including virtual private networks, local Ethernet networks, private networks using communication protocols proprietary to one or more companies, cellular and wireless networks, instant messaging, HTTP and SMTP, and various combinations of the foregoing. Although only a few computers are depicted in FIG. 1, it should be appreciated that a typical cloud system can include a large number of interconnected computers .
[0011] As noted above, each computer or device shown in FIG. 1 may be at one node of cloud system 100 and capable of directly or indirectly communicating with other computers or devices of the system. For example, computer 104 may be a cloud server capable of communicating with a client computer such that computer 104 uses network 112 to transmit information for presentation to a user. Accordingly, computer 104 may be used to generate requested information for display via, for example, a web browser executing on computer 102. Any one of the computers 102, 104, 106, and 108 may also comprise a plurality of computers, such as a load balancing network, that exchange information with different nodes of a network for the purpose of receiving, processing, and transmitting data to multiple client computers. In this instance, the client computers will typically still be at different nodes of the network than any of the computers comprising computers 102, 104, 106 and 108.
[0012] FIG. 2 presents a close up illustration of computer 104. In the example of FIG. 2, computer 104 is a computer apparatus configured as a cloud server and may contain a processor 202, memory 204, and other components typically present in a computer. Other components may include a display
(e.g., a monitor having a screen, a touch-screen, a projector, a television, a computer printer or any other electrical device that is operable to display information) , and a user input (e.g., a mouse, keyboard, touch-screen or microphone). Memory 204 may store information accessible by processor 202, including instructions, which may be executed by processor 202. The memory 204 may be any type of device capable of storing information accessible by the processor, such as a hard-drive, ROM, RAM, CD-ROM, flash memories, write-capable or read-only memories. Storage device 220 may contain data that may be retrieved, manipulated, or stored by the processor and may comprise any non-volatile storage device. The processor 202 may comprise any number of well known processors or a dedicated controller for executing operations, such as an ASIC. Systems and methods may include different combinations of the foregoing, whereby different portions of the instructions and data are stored on different types of media.
[0013] Network interface 222 of computer 104 may comprise circuitry suitable for communication with other computers or devices on the cloud system 100. Network interface 222 may be an Ethernet interface that implements a standard encompassed by the Institute of Electrical and Electronic Engineers
(IEEE), standard 802.3. In another example, network interface 222 may be a wireless fidelity ("Wi-Fi") interface in accordance with the IEEE 802.11 suite of standards. It is understood that other standards or protocols may be utilized, such as Bluetooth or token ring.
[0014] Although FIG. 2 functionally illustrates the processor 202 and memory 204 as being within the same block, it will be understood that the processor and memory may actually comprise multiple processors and memories that may or may not be stored within the same physical housing. For example, any one of the memories may be a hard drive or other storage media located in a server farm of a data center. Accordingly, references to a processor, computer, or memory will be understood to include references to a collection of processors or computers or memories that may or may not operate in parallel. Furthermore, storage device 220 may be at a location physically remote from, but still accessible by, the processor 202.
[0015] The instructions disclosed herein may be any set of instructions to be executed directly (such as machine code) or indirectly (such as scripts) by processor 202. For example, the instructions may be stored as computer code on a computer- readable medium. In that regard, the terms "instructions, " "programs," or "modules" may be used interchangeably herein. The instructions may be stored in object code format for direct processing by the processor, or in any other computer language including scripts or collections of independent source code modules that are interpreted on demand or compiled in advance. However, it will be appreciated that examples herein can be realized in the form of software, hardware, or a combination of hardware and software. Functions, methods and routines of the instructions are explained in more detail below .
[0016] The capacity of servers on the cloud may be utilized through a technique known as virtualization . Virtualization allows a processor to emulate an independent computer apparatus in accordance with instructions, such as virtual machine instructions 212 and 214. Operations on a cloud system may occur on a physical computer apparatus or on a virtual machine (e.g., hosted by a physical computer apparatus) . Each virtual machine may have its own operating system, storage device, and network resources. A separate portion of memory 204 and network interface 222 may be dedicated to each virtual machine. FIG. 2 depicts two virtual machine instructions 212 and 214 that may be used to emulate two separate computers on the same physical computer apparatus 104. While two virtual machines are depicted in FIG. 2, a cloud server may execute any number of virtual machines each of which may be dedicated to different client requests on the cloud network. For example, while the remaining portions of computer 104 may serve the requests of computer 102 of cloud system 100, virtual machine 212 may simultaneously serve the requests of another client computer, such as computer 108. Each virtual machine may serve additional client requests simultaneously and may act as an independent computer apparatus with an operating system different than that of the physical computer apparatus or of other virtual machines.
[0017] Host monitor instructions 216 may supervise the resources being employed by different processes, including virtual machines, executing in computer 104. Kernel 219 may be any set of instructions suitable for managing the resources of computer 104 and allowing other programs to utilize those resources. Kernel 219 may be a central component of operating system 217. Operating system 217 represents a collection of programs that when executed by processor 202 serve as a platform on which instructions can execute. Examples of operating systems include, but are not limited to, various versions of Microsoft's Windows® and Linux®. Storage Service 218 may be instructions that interface with kernel 219 to manage file operations executing on computer 104. While storage service 218 may execute locally in computer 104, storage service 218 may also execute on a remote computer. The file operations may be any transaction associated with a file on computer 104 (e.g., read, write, copy, rename etc.) . A file operation may comprise one or more operations aimed at one or more data blocks of a particular file (i.e., block operations) in storage device 220 or memory 204. A data block is typically a sequence of bytes or bits of a nominal size.
[0018] Referring to FIG. 3, one example of a virtual machine is provided. While FIG. 3 focuses on virtual machine 212 for ease of illustration, it is understood that virtual machine 214 or any other co-existing virtual machine may include features similar to virtual machine 212 of FIG. 3. As with the physical computer apparatus 104, virtual machine 212 may have an operating system 302 and a kernel 305. File subsystem 308 may organize data in storage device 220 in a way that is accessible by virtual machine 212. The data arrangement utilized by file subsystem 308 is typically dependent on operating system 302. Examples of file subsystems include, but are not limited, to file allocation table ("FAT") system or the UNIX file system. Storage driver instructions 303 may provide an interface between storage device 220 and virtual machine 212.
[0019] FIG. 4 illustrates a flow diagram of a process to trace data block operations on a physical and virtual computer apparatus. FIG. 5 illustrates aspects of the virtual machine and the physical computer apparatus. FIGS. 6A-6B are close up illustrations of a file before and after implementation of multiple block operations. The actions shown in FIGS. 5-6 will be discussed below with regard to the flow diagram of FIG. 4.
[0020] Referring to FIG. 4, a data block operation request and a first record may be received, as shown in block 402. The first record may comprise at least one attribute associated with the data block operation and may be generated by the same entity making the request. The requesting entity may be a virtual machine executing in computer apparatus 104, such as virtual machines 212 and 214. Alternatively, the requesting entity may be a process on a remote computer on the cloud, such as computer 108. The attributes of the record may include an identifier associated with a process making the request, such as process 304, an identifier associated with a user who initiated the request, a timestamp indicating the time in which a process started or terminated the request, resources utilized by the process making the request, or the configuration of the entity making the request (e.g., virtual or actual hardware model, CPU, network configuration, etc.) . Referring back to FIG. 4, a second record may be generated, as shown in block 404. The second record may be generated by host monitor 216 of computer apparatus 104. The attributes of the second record may include host attributes that correspond to those of the first record. For example, the second record may contain resources utilized in the computer apparatus to serve the request, the host machine configuration, or an identifier associated with the host machine. The aforementioned attributes of the first and second record are merely illustrative, and it should be understood that attributes may be added or deleted.
[0021] In the example of FIG. 5, virtual machine 212 is the entity requesting the data block operation. Process 304 is shown requesting a file operation (e.g., a file create, a file write, a file deletion, a file rename, etc.) and submitting the request to file subsystem 308. File subsystem 308 may gather attributes associated with the operation. Furthermore, file subsystem 308 may convert the file operation into block operations on one or more blocks of data within the targeted file. Each block operation and its associated attributes may be forwarded to storage driver 303. In turn, storage driver 303 may transmit the same to storage service 218 executing on physical computer apparatus 104, via communication channel 502. If storage service 218 executes locally in computer 104, communication channel 502 may be, for example, a virtual serial link, such as Citrix Xen V4V or a VMWare virtual machine communication interface ("VMCI") enabled for inter- domain communication. If storage service 218 executes in a remote computer, communication channel 502 may be a link over network 112 via network interface 222.
[0022] Host monitor instructions 216 may intercept each block operation and the attributes received from virtual machine 212. Furthermore, host monitor instructions 216 may generate the second record containing corresponding host machine attributes. Referring back to FIG. 4, the first record and the second record may be concatenated or attached to each data block targeted by each block operation, as shown in block 406. The first and second record may be pre-pended or appended to each data block. Alternatively, a first pointer may be associated with the first record and a second pointer may be associated with the second record. In this example, the first pointer and the second pointer may be pre- pended or appended to each data block targeted by each block operation. Each pointer may be indicative of an address of the record's location. In this regard, the first and second record may be stored in, for example, memory 204. In block 408 of FIG. 4, each data block operation may be executed upon each targeted data block.
[0023] Host monitor 216 may forward the data blocks with the appended records to storage service 218, which may implement the operations on one or more targeted data blocks in storage device 220.
[0024] FIGS. 6A-6B illustrate blocks of data before and after implementing block operation requests. FIG. 6A shows storage device 220 containing a file 602 divided into individual data blocks 604-614 before receipt of data block operations. FIG. 6B shows file 602 after executing one or more block operations upon file 602. In the example of FIG. 6B, the data in blocks 605, 611, and 614 were updated as a result of block operations in accordance with the instructions of storage service 218. Blocks 605, 611, and 614 are shown with appended records 615, 616, and 617 respectively. Each record 615-617 may be a concatenation of the first record generated in virtual machine 212 and the second record generated in computer apparatus 104. The appended records may be retrieved by another process to display the trace information to a user investigating behavior occurring in the cloud system. In one example, the appended records may be encrypted such that only authorized users having access to a decryption key can view the trace information.
[0025] The above-described system enables the tracking of data block operations occurring on the cloud network. In this regard, users may have greater confidence that sensitive files stored in the cloud can be traced in case of theft or loss of data .
[0026] Although the application herein has been described with reference to particular examples, it is to be understood that these examples are merely illustrative of the principles and applications of the disclosure. It is therefore to be understood that numerous modifications may be made to the illustrative examples and that other arrangements may be devised without departing from the spirit and scope of the application as defined by the appended claims. Furthermore, while particular processes are shown in a specific order in the appended drawings, such processes are not limited to any particular order unless such order is expressly set forth herein. Rather, various steps can be handled in a different order or simultaneously, and steps may be omitted or added.

Claims

1. A computer apparatus comprising:
a processor:
to access a first record generated by an entity requesting a data block operation, the first record comprising at least one attribute associated with the data block operation;
to generate a second record comprising at least one complementary attribute corresponding to the at least one attribute of the first record;
to attach the first record and the second record to a data block, the data block being a target of the data block operation; and
to execute the requested data block operation.
2. The computer apparatus of claim 1, wherein the processor :
to execute at least one virtual machine, the at least one virtual machine emulating an independent computer apparatus, the virtual machine being the entity requesting the data block operation .
3. The computer apparatus of claim 2, wherein the virtual machine:
to request a file operation; and
to translate the request for the file operation to requests for one or more block operations.
4. The computer apparatus of claim 3, wherein the at least one attribute comprises:
an identifier associated with a process that originated the file operation in the virtual machine; and
an indication of resources utilized by the process during origination of the file operation.
5. The computer apparatus of claim 1, wherein the entity requesting the data block operation is a process.
6. The computer apparatus of claim 5, wherein the process executes on a remote computer apparatus.
7. A computer apparatus comprising:
a processor:
to access a first record generated by an entity requesting a data block operation, the first record comprising at least one attribute associated with the data block operation;
to generate a second record comprising at least one complementary attribute corresponding to the at least one attribute of the first record;
to associate the first record with a first pointer, the first pointer being indicative of an address of the first record;
to associate the second record with a second pointer, the second pointer being indicative of an address of the second record;
to attach the first pointer and the second pointer to a data block, the data block being a target of the data block operation; and
to execute the requested data block operation.
8. The computer apparatus of claim 7, wherein the processor to execute at least one virtual machine, the at least one virtual machine emulating an independent computer apparatus, the virtual machine being the entity requesting the data block operation.
9. The computer apparatus of claim 8, wherein the virtual machine:
to request a file operation; and
to translate the request for the file operation to requests for one or more block operations.
10. The computer apparatus of claim 9, wherein the at least one attribute comprises:
an identifier associated with the process that originated the file operation in the virtual machine; and
an indication of resources utilized by the process during origination of the file operation.
11. A method comprising:
receiving a data block operation, the data block operation being requested by an entity;
receiving a first record comprising at least one attribute associated with the data block operation, the first record being generated by the entity requesting the data block operation;
generating, using a processor, a second record comprising at least one complementary attribute corresponding to the at least one attribute of the first record,
attaching, using the processor, the first record and the second record to a data block, the data block being a target of the data block operation; and
executing, using the processor, the data block operation .
12. The method of claim 11, wherein the entity requesting the data block operation is a virtual machine emulating an independent computer apparatus .
13. The method of claim 12, wherein the requesting of the data block operation by the virtual machine comprises:
requesting, using the processor, a file operation; and translating, using the processor, the request for the file operation to requests for one or more block operations.
14. The method of claim 13, wherein the at least one attribute comprises:
an identifier associated with a process that originated the file operation in the virtual machine; and
resources utilized by the process during origination of the file operation.
15. The method of claim 11, wherein the entity requesting the data block operation is a process executing in a remote computer.
PCT/US2011/048105 2011-08-17 2011-08-17 Tracing data block operations WO2013025211A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
PCT/US2011/048105 WO2013025211A1 (en) 2011-08-17 2011-08-17 Tracing data block operations
CN201180072892.0A CN103748551A (en) 2011-08-17 2011-08-17 Tracing data block operations
EP11870999.7A EP2745199A4 (en) 2011-08-17 2011-08-17 Tracing data block operations
US14/237,953 US9213842B2 (en) 2011-08-17 2011-08-17 Tracing data block operations

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2011/048105 WO2013025211A1 (en) 2011-08-17 2011-08-17 Tracing data block operations

Publications (1)

Publication Number Publication Date
WO2013025211A1 true WO2013025211A1 (en) 2013-02-21

Family

ID=47715334

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2011/048105 WO2013025211A1 (en) 2011-08-17 2011-08-17 Tracing data block operations

Country Status (4)

Country Link
US (1) US9213842B2 (en)
EP (1) EP2745199A4 (en)
CN (1) CN103748551A (en)
WO (1) WO2013025211A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104361284B (en) * 2014-10-26 2018-02-13 深圳润迅数据通信有限公司 To third party's intrusion detection method of cloud storage packet

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020010622A1 (en) * 2000-07-18 2002-01-24 Fumino Okamoto System and method capable of appropriately managing customer information and computer-readable recording medium having customer information management program recorded therein
US20020059287A1 (en) * 1999-08-31 2002-05-16 Fujitsu Limited File device and file access method
US20080270730A1 (en) * 2007-04-30 2008-10-30 Sandisk Il Ltd. Method for efficient storage of metadata in flash memory
US20090300607A1 (en) * 2008-05-29 2009-12-03 James Michael Ferris Systems and methods for identification and management of cloud-based virtual machines

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100483983C (en) * 2003-08-27 2009-04-29 华为技术有限公司 User side data tracking method
US7797335B2 (en) * 2007-01-18 2010-09-14 International Business Machines Corporation Creation and persistence of action metadata
US8214343B2 (en) * 2008-03-19 2012-07-03 Microsoft Corporation Purposing persistent data through hardware metadata tagging
US8443166B2 (en) 2009-03-06 2013-05-14 Vmware, Inc. Method for tracking changes in virtual disks
US8234518B2 (en) * 2009-07-21 2012-07-31 Vmware, Inc. Method for voting with secret shares in a distributed system
US8589913B2 (en) 2009-10-14 2013-11-19 Vmware, Inc. Tracking block-level writes
WO2013009300A1 (en) * 2011-07-12 2013-01-17 Hewlett-Packard Development Company, L.P. Tracing operations in a cloud system
RU2523113C1 (en) * 2012-12-25 2014-07-20 Закрытое акционерное общество "Лаборатория Касперского" System and method for target installation of configured software

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020059287A1 (en) * 1999-08-31 2002-05-16 Fujitsu Limited File device and file access method
US20020010622A1 (en) * 2000-07-18 2002-01-24 Fumino Okamoto System and method capable of appropriately managing customer information and computer-readable recording medium having customer information management program recorded therein
US20080270730A1 (en) * 2007-04-30 2008-10-30 Sandisk Il Ltd. Method for efficient storage of metadata in flash memory
US20090300607A1 (en) * 2008-05-29 2009-12-03 James Michael Ferris Systems and methods for identification and management of cloud-based virtual machines

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2745199A4 *

Also Published As

Publication number Publication date
CN103748551A (en) 2014-04-23
EP2745199A1 (en) 2014-06-25
EP2745199A4 (en) 2015-04-01
US9213842B2 (en) 2015-12-15
US20140208432A1 (en) 2014-07-24

Similar Documents

Publication Publication Date Title
US11916920B2 (en) Account access security using a distributed ledger and/or a distributed file system
US12079342B2 (en) Data lineage management
EP2880589B1 (en) Trusted execution environment virtual machine cloning
EP2302509B1 (en) Synchronization of server-side cookies with client-side cookies
US8769269B2 (en) Cloud data management
US20160364200A1 (en) Remote desktop exporting
US10083245B1 (en) Providing secure storage of content and controlling content usage by social media applications
US10346618B1 (en) Data encryption for virtual workspaces
WO2015062339A1 (en) Method and device for running remote application program
KR102134491B1 (en) Network based management of protected data sets
US10216601B2 (en) Agent dynamic service
JP2022522678A (en) Secure execution guest owner environment control
EP3709199A1 (en) Container security policy handling method and related device
US9591079B2 (en) Method and apparatus for managing sessions of different websites
US10291721B2 (en) Remote document signing
Xiong et al. Design and implementation of a prototype cloud video surveillance system
US20220300159A1 (en) Backup services for distributed file systems in cloud computing environments
EP3213198B1 (en) Monitoring a mobile device application
JP6418419B2 (en) Method and apparatus for hard disk to execute application code
US10534626B2 (en) Methods for facilitating self-service automation utilities and devices thereof
US12124616B2 (en) Enhancement of trustworthiness of artificial intelligence systems through data quality assessment
US9213842B2 (en) Tracing data block operations
JP6205013B1 (en) Application usage system
CN117751347A (en) Techniques for distributed interface component generation
CN115113800A (en) Multi-cluster management method and device, computing equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11870999

Country of ref document: EP

Kind code of ref document: A1

REEP Request for entry into the european phase

Ref document number: 2011870999

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2011870999

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 14237953

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE