AU2003241506B2 - Mechanism for applying transforms to multi-part files - Google Patents

Mechanism for applying transforms to multi-part files Download PDF

Info

Publication number
AU2003241506B2
AU2003241506B2 AU2003241506A AU2003241506A AU2003241506B2 AU 2003241506 B2 AU2003241506 B2 AU 2003241506B2 AU 2003241506 A AU2003241506 A AU 2003241506A AU 2003241506 A AU2003241506 A AU 2003241506A AU 2003241506 B2 AU2003241506 B2 AU 2003241506B2
Authority
AU
Australia
Prior art keywords
data
stream
transform
computer
transforms
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU2003241506A
Other versions
AU2003241506B8 (en
AU2003241506A1 (en
Inventor
Jason M. Cahill
Roger Cheng
Michael J. Hillberg
David B. Ornstein
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of AU2003241506A1 publication Critical patent/AU2003241506A1/en
Application granted granted Critical
Publication of AU2003241506B2 publication Critical patent/AU2003241506B2/en
Publication of AU2003241506B8 publication Critical patent/AU2003241506B8/en
Assigned to MICROSOFT TECHNOLOGY LICENSING, LLC reassignment MICROSOFT TECHNOLOGY LICENSING, LLC Request for Assignment Assignors: MICROSOFT CORPORATION
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/543User-generated data transfer, e.g. clipboards, dynamic data exchange [DDE], object linking and embedding [OLE]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/60Protecting data
    • G06F21/62Protecting access to data via a platform, e.g. using keys or access control rules
    • G06F21/6209Protecting access to data via a platform, e.g. using keys or access control rules to a single file or object, e.g. in a secure envelope, encrypted and accessed using a key, or with access control rules appended to the object itself
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/545Interprogram communication where tasks reside in different layers, e.g. user- and kernel-space
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/54Indexing scheme relating to G06F9/54
    • G06F2209/542Intercept

Description

P:OPER\SE W\2009OctobeMI246734 amended pages Is p doc-1/10/2M MECHANISM FOR APPLYING TRANSFORMS TO MULTI-PART FILES Background of the Invention Computer systems today typically store a large amount of data in several files. The 5 format for the files may be one of several different formats that are compatible with various applications, such as word processors, spreadsheets, and the like. Many times it is necessary to transmit a file to another computer so that another user may see or manipulate the data within the file. Sometimes, when the file is quite large, a transformation (e.g., compression) is performed on the file before sending the file to the other computer. By 10 compressing the file, less bandwidth is needed to send the data to the other computer. In other situations, another transformation (e.g., encryption) may be performed to protect the data from being seen by unauthorized users. Some of these transformations have specific encoding methods and use a separate file (e.g., dictionary) to store information about the specific encoding method. The separate 15 file must be used when accessing the transformed file. If the separate file becomes corrupted, lost, or otherwise unavailable, the transformed file becomes useless. In addition, because some of these transformations define their own specific encoding methods for interleaving encoded data and processing information, once the file is transformed, the file can not be shared or have common processing performed on it. In addition, before 20 transforming a file, current transformations require that the data within the file to be arranged in contiguous bytes. Ensuring that the bytes for the file remain contiguous consumes a lot of overhead and is not viable for files that are edited quite often. Thus, while these transformations are very useful, the way in which they are implemented do not offer a versatile experience to users. 25 Summary of the Invention According to the present invention, there is provided a computer-implemented method, comprising: receiving a request to access a stream within a multi-part file; wherein the multi 30 part file includes streams,-data spaces and transformation instances; wherein each stream is associated with a data space; wherein each data space is associated with a list of P:OPER\SEW\209\Ociobe 246740 amended paes i s doc-8/JI(V21K9 transformation instances; and wherein each transformation instance includes a description of a transformation to be applied in a specific order to data within a stream; identifying a data space that is associated with the stream, the data space being identified from within the multi-part file; and 5 performing the transforms in the order specified in the list of transforms on data before completing the request. The present invention also provides a computer system having a mechanism for applying transforms to multi-part files, the computer system comprising: a processor; and 10 a memory, the memory being allocated for a plurality of computer-executable instructions which are loaded into the memory for execution by the processor, the computer-executable instructions performing a method comprising: receiving a request to access a stream within a multi-part file; wherein the multi part file includes streams and data spaces; wherein each data space specifies a list of 15 transforms to be applied in a specific order to data within a stream; identifying the list of transforms associated with the stream in response to the data space specification of the list of transformations, the list being identified from within the multi-part file; and performing the transforms in the order specified in the list of transforms on data 20 before completing the request. The present invention further provides a computer-readable storage medium encoded with a data structure, comprising: a first stream included within a multipart file, wherein the first stream includes a map that correlates a stream within the multi-part file with a name for a data space that 25 specifies a list of transforms to be applied in a particular order to the stream within the multipart file; a second stream included within the multipart file, wherein the second stream describes the data space by listing each of the transforms included within the data space in the particular order that they are to be applied to the stream; and 2 P.0PER\SEW\2009\OctoberA 2467341 amended pges Is spa doc-S/ IV2W a third stream, included within the multipart file, for each of the transforms, wherein the third stream identifies information that describes the transformations to be applied to streams associated with the transform. Embodiments of the present invention are directed at a system and method for 5 implementing transformations that provide greater flexibility to users. Briefly stated, one embodiment of the present invention provides a mechanism for storing transformation information associated with one or more transformations within a multi-part file. The multi-part file also contains the data upon which the one or more transformations are applied. Thus, one embodiment of the present invention provides a file format for the 10 multi-part file so that applications accessing the data may easily access the transformed data. In accordance with one embodiment of the invention, multiple data transforms may be chained together. These chained data transforms are referred to as a "data spaces". Each data space has a unique order and type for the transforms that are chained together. For example, two data spaces may specify the same transforms, but specify a different order 15 for applying the transforms. The transformation information contains information about the data spaces. In accordance with another embodiment of the invention, the multi-part file contains a plurality of streams. Each stream may be associated with one of the data spaces. Thus, in accordance with one embodiment of the present invention some streams within 20 the multi-part file may be transformed while other streams may remain in their native format. This ability to transform specific streams without requiring transformation of the entire multi-part file offers great flexibility to users, such as allowing the user to encrypt only the sensitive information within the multi-part file (e.g., redacting documents). Thus, embodiments of the present invention are directed at a system and method 25 for applying transforms to multi-part files. A request is received to access a stream within a multi-part file. Upon receipt of the request, a list of transforms associated with the stream is identified. The list is also included within the multi-part file. The transforms specified in the list of transforms are performed on data before completing the request. If the request is a write, the transforms encode the data. If the request is a read, the transforms decode the 30 data. The list of transforms is order dependent. The list of transforms includes a data structure having a first stream that includes a map that correlates the stream with a name 3 P OPER\SEW\09\ctoberk 2467340a mnded pges Ii spdoc-8/10/2009 for the list of transforms. A second stream that lists each of the transforms for the stream. A third stream for each of the transforms listed that identifies information associated with the transform. 5 Brief Description of the Drawings Embodiments of the present invention are hereinafter described, by way of example only, with reference to the accompanying drawings, wherein: FIGURE 1 is a functional block diagram that illustrates a computing device that may be used in implementations of the present invention. 10 FIGURE 2 is a functional flow diagram generally illustrating an overview of a transformation process in accordance with the present invention. FIGURE 3 is a graphical representation of an exemplary tree hierarchy that represents the transform metadata shown in FIGURE 2. FIGURE 4 is a graphical depiction of the transformation process. 15 FIGURE 5 is a logical flow diagram generally illustrating a process for accessing transformed data within a multi-part file, in accordance with one embodiment of the invention. Detailed Description of the Preferred Embodiment 20 The invention provides a mechanism for applying transforms to multi-part files. The mechanism provides a structure for specifying transformation information. The transformation information and the transformed data co-exist within the same document. The mechanism of the invention is preferably based on a multi-part file format that allows multiple types of streams within one document. The inventors have determined that the 25 Object Linking and Embedding (OLE) compound file format is especially well suited to implementations of the invention. Thus, the following discussion describes the invention using the compound file format. However, those skilled in the art, after a careful reading of the following description, will recognize that other multi-file formats may implement the present invention with various modifications to the mechanism described below to 30 accommodate the other multi-file formats. Thus, it will be appreciated that embodiments of the invention are not limited to those described here. 3a P\OPER\SEW\2009\Octobe6I12467340 amended pages I sp doc-K/I0/2(O9 The invention will be described here first with reference to one example of an illustrative computing environment in which embodiments of the invention can be implemented. Next, a detailed example of one specific implementation of the invention will be described. Alternative implementations may also be included with respect to certain 5 details of the specific implementation. 3b Illustrative Computing Environment of the Invention FIGURE 1 is a functional block diagram that illustrates a computing device that may be used in implementations of the present invention. FIGURE 1 illustrates an exemplary computing device that may be used in illustrative 5 implementations of the present invention. With reference to FIGURE 1, in a very basic configuration, computing device 100 typically includes at least one processing unit 102 and system memory 104. Depending on the exact configuration and type of computing device 100, system memory 104 may be volatile (such as RAM), non volatile (such as ROM, flash memory, etc.) or some combination of the two. System 10 memory 104 typically includes an operating system 105, one or more program modules 106, and may include program data 107. Examples of program modules 106 include a browser application, a finance management application, a word processor, and the like. This basic configuration is illustrated in FIGURE 1 by those components within dashed line 108 15 Computing device 100 may have additional features or functionality. For example, computing device 100 may also include additional data storage devices (removable and/or non-removable) such as, for example, magnetic disks, optical disks, or tape. Such additional storage is illustrated in FIGURE 1 by removable storage 109 and non-removable storage 110. Computer storage media may include 20 volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. System memory 104, removable storage 109 and non-removable storage 110 are all examples of computer storage media. Computer storage media includes, but is not limited to, RAM, ROM, 25 EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by computing device 100. Any such computer storage media may be part of device 100. 30 Computing device 100 may also have input device(s) 112 such as keyboard, mouse, pen, voice input device, touch input device, etc. Output device(s) 114 such as a display, speakers, printer, etc. may also be included. These devices are well know in the art and need not be discussed at length here. 4 Computing device 100 may also contain communication connections 116 that allow the device 100 to communicate with other computing devices 118, such as over a network. Communication connections 116 are one example of communication media. Communication media may typically be embodied by 5 computer readable instructions, data structures, program modules, or other data in a modulated data signal, such as a carrier wave or other transport mechanism, and includes any information delivery media. The term "modulated data signal" means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, 10 communication media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared and other wireless media. The term computer readable media as used herein includes both storage media and communication media. General Discussion of Components 15 FIGURE 2 is a functional flow diagram generally illustrating an overview of components of an environment implementing the present invention. Illustrated is a multi-part file 202, preferably an OLE compound file. The OLE document model is known in the art and is widely recognized as a mechanism for containing many disparate types of data within a single document. Conventionally, 20 the OLE compound file is used in conjunction with having several embedded files or other support content associated with a single document. Each element in the compound file is stored in a manner such that it can be manipulated by the application that created the element. Each element is stored as a stream, such as streams 204, 206, and 208 shown in FIGURE 2. As mentioned above, each stream 25 may be one of several types. For instance, stream 1 204 may be a word processing document, stream2 206 may be a spreadsheet, and streamZ 208 may be a graphics file. In the past, upon requesting a transformation on the multi-part file 202, the entire content of multi-part file 202 (i.e., streams 204-208) would have been 30 required to be contiguous and would have been transformed together. However, in accordance with the present invention, the streams 204-208 need not be contiguous. Rather, the streams 204-208 may be sector-based. For the following discussion, sector-based files refer to files having multiple chunks of data that are stored and 5 that represent the entire stream. The multiple chunks may be stored contiguously, but typically are stored non-contiguously. In one embodiment, the chunks may be fixed size, such a fixed at 512 bytes. Alternatively, the chunks may be variably sized without departing from the scope of the present invention. When the stream is 5 edited, a new chunk of data may be created and stored in non-contiguous bytes in relations to the other chunks of data for the stream. Thus, sector-based files allow for easy editing of the stream without the overhead of ensuring that the stream remains contiguous. As will be described in detail below, the present invention allows 10 specified chunks of data 240 associated with a stream (e.g., stream 206) within the multi-part file 202 to be transformed without transforming other streams. Because the present invention allows specified streams to be transformed independent of other streams, the invention provides a great flexibility for securing and controlling data. For example, FIGURE 2 illustrates stream2 206 undergoing a transformation 15 process. Stream2 206 may represents a spreadsheet containing the costs associated with a particular item. Therefore, it may be desirable to secure this cost information so that unauthorized users can not view the costs. Thus, the data 240 destined for stream2 206 undergoes a chain of transforms (i.e., transforms 220-224). As one skilled in the art will appreciate, any number of transforms may be chained and may 20 be chained in any order. The specific transforms that are chained and the order in which the transforms are chained represent a data space 230. In general, a data space may specify one transform or may specify multiple transforms. In the above example, the last transform (e.g., transform 224) writes the transformed data to the stream2 206, which may reside on a disk (not shown). One embodiment for using 25 the mechanism for applying transforms to multi-part files is described in detail in conjunction with FIGURE 4 below. Discussion of a Particular Embodiment of the Invention FIGURE 3 is a graphical representation of one embodiment of a tree hierarchy that represents the transform metadata 210 shown in FIGURE 2. In 30 general, the tree hierarchy may be included within the multi-part file in any manner compatible with the multi-part file. The following discussion, describes the tree hierarchy with reference to compound files. In overview, compound files are commonly considered as a "file system within a file." Within the compound file is a 6 hierarchy of "storages," which are analogous to directories in a file system, and "streams," which are analogous to files in a file system. For FIGURE 3, rectangular boxes represent the streams and ovals represent the storage. Before describing the transform metadata 210 of the present invention, one will note that the streams 204 5 208 (shown in FIGURE 2) are illustrated under the root 302 in this example hierarchy. Defining streams under the root is a common technique for compound file formats. The transform metadata 210 provided by the present invention is now discussed in further detail. A special storage, named "\006DataSpaces" 310 off the 10 root 302 stores the transform metadata 210. The \006DataSpace storage 310 contains a DataSpaceMap stream 320, a DataSpacelnfo storage 330, and a TransformInfo storage 340. For this embodiment, the name chosen for the special storage, "\006DataSpaces", is written in context of the C Programming language. Thus, in this embodiment, the name begins with a single non-alphanumeric token 15 and a token value of 6. In general, the name assigned to the special storage is arbitrary and depends on the user's implementation. The DataSpaceMap stream 320 maps the streams (e.g., streams 204 208) with their associated data space. In one embodiment, the DataSpaceMap stream 320 is a table having two columns: a stream reference column 322 and a 20 DataSpaceName column 324. The contents within the stream reference column 322 refer to one of the streams (e.g., streams 204-208) stored within the compound document. The contents within the DataSpaceName refer to a specific data space that has been defined for the associated stream identified within the stream reference column 322. One data space may be associated with any number of streams. For 25 example, as shown in FIGURE 3, the data space identified as "DataSpaceNamel" is associated with Streaml 204 and Stream2 206. While the above description of the DataSpaceMap stream 320 describes the DataSpaceMap stream 320 as a table, those skilled in the art will appreciated that other data formats may also be used to identify and correlate the stream with a data space. 30 The DataSpaceInfo storage 330 contains one or more DataSpaceName streams (e.g., DataSpaceName stream 332 and 334). For the described embodiment, the DataSpaceName stream is named in accordance with standard, compound-file short name conventions. Each DataSpaceName stream 332 7 and 334 identifies a list 336 of transforms associated with the respective DataSpaceName stream 332 and 334. In one embodiment, each of the DataSpaceName streams 332 and 334 may be an ordered list of the transforms that make up the data space. Because transforms stack, the order within the list 336 is 5 important. In one embodiment, the first transform 337 within the list 336 is referred to as the "bottom" transform, which means the transform 337 is closest to the bits in the underlying data stream (e.g., stream 204). The last transform 339 within the list 336 is referred to as the "top" transform, which means the transform 339 is the closest to the consumer/producer of the data (e.g., an application). As will be 10 described in detail below in conjunction with FIGURE 4, the order specified in the list 336 determines the flow of data through the transforms. The TransformInfo storage 340 contains one or more TransformInstance storages (e.g., TransformInstance storage 342, 344, and 346). In one embodiment, the names of these substorages are the names of the transforms. 15 Within each of the TransformInstance storages 342, 344, and 346, there is at least one stream named "\006Primary" 350. The \006Primary stream 350 contains pertinent information about the specific transform, such as TransformClass Type 354 and TransformClass Name 356. The TransformClass Type 354 denotes a particular transform class that implements a particular transform (e.g., LZ 20 compression, Digital Rights Management (DRM) protection, and the like). In one embodiment, the TransformClass Name 356 is specified as a string that uniquely identifies the class (i.e., type) of the transform. The string that identifies the class may be a class name for the class that implements the transform. The TransformClass Type 354 specifies a type indicator that tells how to interpret the 25 string specified in the TransformClass Name 356. The \006Primary stream 350 may also contain space for TransformInstance Data 358. The TransformlnstanceData 358 stores information specified to the transform specified by the TransformClass Name 356 and TransformClass Type 354. For example, if the transform is a compression transform, the TransformInstanceData 358 may contain a window size 30 and the like. For certain transforms, the TransformInstanceData 358 may not allow sufficient space to store the necessary information. Thus, as a further refinement, the present invention allows transforms to store additional information 8 in a TransformInstanceData stream (e.g., TransformInstanceData stream 370). This is allowed as long as there are no name collisions with the \006Primary stream 350. The nature of the TransformInstanceData will vary depending on the type of transform. 5 While the above tree hierarchy describes one embodiment of a document format for storing transformed data with its transformation information, those skilled in the art will appreciate that the hierarchy may be changed without impacting the operation of the present invention. Therefore, any tree hierarchy in which transformation information is stored along with the transfonned data does not 10 depart from the present invention. FIGURE 4 is a graphical depiction of the transformation process in which the mechanism for formatting documents having transformed data in accordance with the present invention is used. In this illustrative transformation process, an application 400 attempts to read and write to the multi part file 202 described in FIGURE 2. In general, each instance of a transform class 15 takes an IStream interface as input, and outputs the encoded (i.e., transformed) data to another IStream interface. The transforms (e.g., transforms 420 and 422) have been registered and the data space associated with stream 206 as already been specified, such as via application programming interfaces provided by OLE compound documents. For example, when stream 206 was first created, the 20 application that created the stream 206 within multi-part document 202 was responsible for specifying which transforms to apply to the data. This may have occurred via an argument list, where each argument referred to a transform. The read and write access is via an OS layer. In the past, a write operation would have accessed stream2 206 via IStream interface 414. However, in 25 accordance with the present invention, one or more transforms may be inserted before the IStream interface 414. Each transform (e.g., transform 420 and 422) takes an IStream interface as input (IStream interface 410 and 412, respectively), and output their encoded (i.e., transformed) data to another IStream interface (IStream interface 412 and 414, respectively). 30 Likewise, when application 400 attempts to read stream2 206 within multi-part file 202, one or more inverse transforms (e.g., inverse transforms 450 and 452) may be inserted. The number of inverse transforms is identical to the number of transforms in order for the data to be properly decoded so that the application can 9 understand the data. The manner in which the transforms are inserted between the application 400 and the stream 206 is now described in conjunction with FIGURE 5. FIGURE 5 is a logical flow diagram generally illustrating a process for accessing transformed data within a compound file, in accordance with the 5 invention. The process 500 begins at a starting block 501 where an application has requested an access to data within in stream of a multi-part file. The transform information 210 has already been specified for the stream. At decision block, a detennination is made whether the stream is a member of a data space. Referring to FIGURE 3, for one embodiment, this 10 determination is made by searching within the DataSpaceMap for the stream reference 322 that identifies the requested stream. If the stream reference 322 associated with the stream is not found, the stream does not have any transforms defined and processing proceeds to the end. In this situation, the application accesses the data in the way in which it was done before the present invention. 15 However, if the stream reference 322 is contained within the DataSpaceMap, processing continues at block 504. At block 504, the DataSpaceName associated with the stream reference 322 is obtained. The DataSpaceName may be a string or any other format. At block 506, using the DataSpaceName obtained from block 504, 20 the DataSpaceInfo storage is searched to identify the DataSpaceName stream associated with the DataSpaceName identified within the DataSpaceMap. The DataSpaceName stream contains a list of transforms associated with this data space name. At block 508, a transform from within the list is identified. 25 Depending on whether the access is a write or a read, the transform may encode the data or may decode the data, respectively. The DataSpaceName stream lists each transform in a specific order. If the access is a write, the order is from top to bottom. If the access is a read, the order is from bottom to top. At block 510, the identified transform is applied. When applying the 30 transform, the transform instance data is used to properly transform the data. If the access is a write, the transform (encode) is applied. If the access is a read, the inverse transform (decode) is applied. 10 P:\OPER\SEW\200)9%Octobem246734)amcndpages Is spa doc-R/IW209 At decision block 512, a determination is made whether the data space includes any further chained transforms. This may be determined by seeing whether list 336 references any more transform instances. If the last transform in the data space has been applied, the last transform outputs the data and the process is complete. However, if there is another 5 transform listed, processing loops back to block 508 and proceeds as described above until the last transform has been applied. In addition, one skilled in the art will appreciate that the functionality provided by process 300 may be implemented in various ways. For example, there may be a mapping directly from the stream name to a transform list (skipping the use of a data space). Thus, 10 the present invention includes this and other embodiments for mapping the stream to its transform information. Process 500 illustrates one such embodiment. The above specification, examples and data provide a complete description of the manufacture and use of the composition of the invention. Since many embodiments of the invention can be made without departing from the spirit and scope of the invention, the 15 invention resides in the claims hereinafter appended. Throughout this specification and the claims which follow, unless the context requires otherwise, the word "comprise", and variations such as "comprises" and "comprising", will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or 20 steps. The reference in this specification to any prior publication (or information derived from it), or to any matter which is known, is not, and should not be taken as an acknowledgment or admission or any form of suggestion that that prior publication (or information derived from it) or known matter forms part of the common general 25 knowledge in the field of endeavour to which this specification relates. 11

Claims (28)

1. A computer-implemented method, comprising: receiving a request to access a stream within a multi-part file; wherein the multi 5 part file includes streams,-data spaces and transformation instances; wherein each stream is associated with a data space; wherein each data space is associated with a list of transformation instances; and wherein each transformation instance includes a description of a transformation to be applied in a specific order to data within a stream; identifying a data space that is associated with the stream, the data space being 10 identified from within the multi-part file; and performing the transforms in the order specified in the list of transforms on data before completing the request.
2. The computer-implemented method of claim 1, wherein the multi-part file includes 15 an OLE compound file.
3. The computer-implemented method of claim 1, wherein identifying the data space comprises searching a data space map for the stream, the data space map providing a correlation between the stream and a data space name associated with the data space. 20
4. The computer-implemented method of claim 1, wherein the list includes a type and a name for each transform listed.
5. The computer-implemented method of claim 1, wherein the list further includes 25 transform instance data for one of the transforms identified in the list, the transform instance data specifies pertinent information for decoding data transformed by the one transform.
6. The computer-implemented method of claim 1, wherein the request is a write 30 request of a data chunk and the at least one transform is performed on the data chunk to create encoded data that is written to a storage medium. 12 P OPER\SEW\2I9\Octobe:\ 246134( amended pages is 6pa do-X/)f/2 9
7. The computer-implemented method of claim 6, wherein the data chunk is not contiguous with other data chunks previously written to the storage medium for the stream. 5
8. The computer-implemented method of claim 6, wherein the data chunk is a fixed size chunk with other data chunks for the stream.
9. The computer-implemented method of claim 1, wherein the list includes a compression transform and an encryption transform. 10
10. The computer-implemented method of claim 1, wherein the request is a read request of a data chunk and the at least one transform performs an inverse transform on the data chunk retrieved from the stream to provide decoded data to the requestor. 15
11. A computer system having a mechanism for applying transforms to multi-part files, the computer system comprising: a processor; and a memory, the memory being allocated for a plurality of computer-executable instructions which are loaded into the memory for execution by the processor, the 20 computer-executable instructions performing a method comprising: receiving a request to access a stream within a multi-part file; wherein the multi part file includes streams and data spaces; wherein each data space specifies a list of transforms to be applied in a specific order to data within a stream; identifying the list of transforms associated with the stream in response to the data 25 space specification of the list of transformations, the list being identified from within the multi-part file; and performing the transforms in the order specified in the list of transforms on data before completing the request. 30
12. The computer system of claim 11, wherein the multi-part file includes an OLE compound file. 13 P:\OPER\SEW2009\Octobc61246734tl amended pcs it spa doc-8/10/2MY
13. The computer system of claim 11, wherein identifying the list of transforms comprises searching a data space map for the stream, the data space map providing a correlation between the stream and a data space name associated with the data space. 5
14. The computer system of claim 11, wherein the list of transforms includes a type and a name for each transform listed.
15. The computer system of claim 11, wherein the list further includes transform 10 instance data for one of the transforms identified in the list, the transform instance data specifies pertinent information for decoding data transformed by the one transform.
16. The computer system of claim 11, wherein the request is a write request of a data chunk and the at least one transform is performed on the data chunk to create encoded data 15 that is written to a storage medium.
17. The computer system of claim 16, wherein the data chunk is not contiguous with other data chunks previously written to the storage medium for the stream. 20
18. The computer system of claim 16, wherein the data chunk is a fixed size chunk with other data chunks for the stream.
19. The computer system of claim 11, wherein the list includes a compression transform and an encryption transform. 25
20. The computer system of claim 11, wherein the request is a read request of a data chunk and the at least one transform performs an inverse transform on the data chunk retrieved from the stream to provide decoded data to the requestor. 30
21. A computer-readable storage medium encoded with a data structure, comprising: 14 PAOPERSE W2009\OctobeM2467340amened pags Is a doc-8/I020)9 a first stream included within a multipart file, wherein the first stream includes a map that correlates a stream within the multi-part file with a name for a data space that specifies a list of transforms to be applied in a particular order to the stream within the multipart file; 5 a second stream included within the multipart file, wherein the second stream describes the data space by listing each of the transforms included within the data space in the particular order that they are to be applied to the stream; and a third stream, included within the multipart file, for each of the transforms, wherein the third stream identifies information that describes the transformations to be 10 applied to streams associated with the transform.
22. The computer-readable storage medium of claim 21, wherein the data structure is included within the multi-part file. 15
23. The computer-readable storage medium of claim 21, wherein the name for the list of transforms may be correlated with multiple streams within the multi-part file.
24. The computer-readable storage medium of claim 21, wherein the list of transforms is order dependent. 20
25. The computer-readable storage medium of claim 21, wherein the information identifies a class name and a class type for the transform.
26. The computer-readable storage medium of claim 25, wherein the information 25 further identifies instance data associated with the transform.
27. The computer-readable storage medium of claim 26, wherein the transform includes a compression transform and the instance data includes a window size for the compression transform. 30 15 P:\OPER\SEW\209\OctoberAt 2467340 amended pages Isa spadoc.R/In/20U9
28. A computer-implemented method, or a computer system, or a computer-readable storage medium substantially as hereinbefore described with reference to the accompanying drawings. 16
AU2003241506A 2003-05-17 2003-05-17 Mechanism for applying transforms to multi-part files Ceased AU2003241506B8 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2003/015707 WO2004107198A1 (en) 2003-05-17 2003-05-17 Mechanism for applying transorms to multi-part files

Publications (3)

Publication Number Publication Date
AU2003241506A1 AU2003241506A1 (en) 2005-01-21
AU2003241506B2 true AU2003241506B2 (en) 2009-11-26
AU2003241506B8 AU2003241506B8 (en) 2010-01-07

Family

ID=33449011

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2003241506A Ceased AU2003241506B8 (en) 2003-05-17 2003-05-17 Mechanism for applying transforms to multi-part files

Country Status (9)

Country Link
EP (1) EP1625510A4 (en)
JP (1) JP4378342B2 (en)
KR (1) KR100965709B1 (en)
CN (1) CN100430916C (en)
AU (1) AU2003241506B8 (en)
BR (1) BR0306999A (en)
CA (1) CA2470720C (en)
MX (1) MXPA04007407A (en)
WO (1) WO2004107198A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4862079B2 (en) * 2006-05-31 2012-01-25 サイトリックス システムズ, インコーポレイテッド System and method for determining character set encoding for request submission decoding at gateway
US8255373B2 (en) 2008-10-24 2012-08-28 Microsoft Corporation Atomic multiple modification of data in a distributed storage system
US8886796B2 (en) 2008-10-24 2014-11-11 Microsoft Corporation Load balancing when replicating account data
US9996572B2 (en) 2008-10-24 2018-06-12 Microsoft Technology Licensing, Llc Partition management in a partitioned, scalable, and available structured storage
CN108337208B (en) * 2017-01-19 2020-11-24 中国电信股份有限公司 Data protection method and replacement device based on cloud storage and cloud storage system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001037135A2 (en) * 1999-11-16 2001-05-25 Informatica Corporation System and method for managing a database
US20020003886A1 (en) * 2000-04-28 2002-01-10 Hillegass James C. Method and system for storing multiple media tracks in a single, multiply encrypted computer file
US6357042B2 (en) * 1998-09-16 2002-03-12 Anand Srinivasan Method and apparatus for multiplexing separately-authored metadata for insertion into a video data stream
US6529948B1 (en) * 1999-08-31 2003-03-04 Accenture Llp Multi-object fetch component

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL8901827A (en) * 1989-07-14 1991-02-01 Oce Nederland Bv SYSTEM FOR PROCESSING FILE ORGANIZED DATA, MANAGEMENT MODULE USED THEREIN AND STORAGE MEDIA, PROVIDED WITH THE PROGRAMMING OF THIS MANAGEMENT MODULE.
JP3308770B2 (en) * 1994-07-22 2002-07-29 三菱電機株式会社 Information processing apparatus and calculation method in information processing apparatus
US5742818A (en) * 1995-12-15 1998-04-21 Microsoft Corporation Method and system of converting data from a source file system to a target file system
US20020001412A1 (en) * 1998-07-21 2002-01-03 Hewlett-Packard Company System for variable quantization in jpeg for compound documents
SE513356C2 (en) * 1998-11-20 2000-08-28 Ericsson Telefon Ab L M Method and apparatus for encrypting images
US7644366B1 (en) * 1999-07-30 2010-01-05 Computer Associates Think, Inc. Method and system for displaying a plurality of discrete files in a compound file
EP1189432A3 (en) * 2000-08-14 2004-10-20 Matsushita Electric Industrial Co., Ltd. A hierarchical encryption scheme for secure distribution of predetermined content

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6357042B2 (en) * 1998-09-16 2002-03-12 Anand Srinivasan Method and apparatus for multiplexing separately-authored metadata for insertion into a video data stream
US6529948B1 (en) * 1999-08-31 2003-03-04 Accenture Llp Multi-object fetch component
WO2001037135A2 (en) * 1999-11-16 2001-05-25 Informatica Corporation System and method for managing a database
US20020003886A1 (en) * 2000-04-28 2002-01-10 Hillegass James C. Method and system for storing multiple media tracks in a single, multiply encrypted computer file

Also Published As

Publication number Publication date
MXPA04007407A (en) 2005-02-17
JP2006526191A (en) 2006-11-16
WO2004107198A1 (en) 2004-12-09
AU2003241506B8 (en) 2010-01-07
KR20060037217A (en) 2006-05-03
EP1625510A1 (en) 2006-02-15
JP4378342B2 (en) 2009-12-02
AU2003241506A1 (en) 2005-01-21
KR100965709B1 (en) 2010-06-24
EP1625510A4 (en) 2008-03-12
CA2470720A1 (en) 2004-11-17
BR0306999A (en) 2005-02-09
CN100430916C (en) 2008-11-05
CA2470720C (en) 2011-10-18
CN1615477A (en) 2005-05-11

Similar Documents

Publication Publication Date Title
US7523221B2 (en) Mechanism for applying transforms to multi-part files
US7631022B2 (en) Information processing apparatus and recording medium
US7496586B1 (en) Method and apparatus for compressing data in a file system
US7188118B2 (en) System and method for detecting file content similarity within a file system
US7293150B2 (en) Method and system for creating and restoring an image file
US9460060B2 (en) System and method for managing dynamic document references
US10726147B2 (en) File encryption support for FAT file systems
EP1586988A2 (en) Information processing apparatus, information processing method, program and recording medium used therewith
US6775678B1 (en) Data structure and method of storing a digital document
US20070156745A1 (en) System and method for managing dynamic document references
JP2016509309A (en) Method and system for storing and retrieving data
JP2008537259A (en) Efficient description of relationships between resources
CA2542162A1 (en) Preload library for transparent file transformation
US8250468B2 (en) System and method for managing dynamic document references
US20070150807A1 (en) System and method for managing dynamic document references
US7415480B2 (en) System and method for providing programming-language-independent access to file system content
AU2003241506B8 (en) Mechanism for applying transforms to multi-part files
US20110271064A1 (en) Storage device and method for accessing the same
JP2005108239A (en) Storage system for storing data in hierarchical data structure
CN105243090A (en) Exclusive file acquisition method and system
Munegowda et al. Directory compaction techniques for space optimizations in ExFAT and FAT file systems for embedded storage devices
RU2348069C2 (en) Mechanism of modification application to files consisting of multiple parts

Legal Events

Date Code Title Description
TH Corrigenda

Free format text: IN VOL 23, NO 46, PAGE(S) 11223 UNDER THE HEADING APPLICATIONS ACCEPTED - NAME INDEX UNDER THE NAME MICROSOFT CORPORATION, APPLICATION NO. 2003241506, UNDER INID (72), CORRECT THE CO-INVENTORS TO ORNSTEIN, DAVID B.; CHENG, ROGER; HILLBERG, MICHAEL J.: AND CAHILL, JASON M.

FGA Letters patent sealed or granted (standard patent)
PC Assignment registered

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC

Free format text: FORMER OWNER WAS: MICROSOFT CORPORATION

MK14 Patent ceased section 143(a) (annual fees not paid) or expired