CN101419626B

CN101419626B - File system name space management method orienting application

Info

Publication number: CN101419626B
Application number: CN2008102366002A
Authority: CN
Inventors: 冯丹; 施展; 朱春霖; 李志超; 赵恒�; 李勇; 邓聪林
Original assignee: Huazhong University of Science and Technology
Current assignee: Huazhong University of Science and Technology
Priority date: 2008-11-28
Filing date: 2008-11-28
Publication date: 2010-06-09
Anticipated expiration: 2028-11-28
Also published as: CN101419626A

Abstract

The invention relates to an application-oriented file system namespace management method which belongs to the technical field of computer storage and aims at solving the problem that the file application semantics is not included in the metadata extension contents in the existing file systems so as to shorten the metadata access time and improve the application access efficiency. The method comprises three steps, i.e. extracting application semantics, cutting namespace and accessing cut namespace. By extracting the application semantics between the processes of the application program and the file accessed by the application program, the file system namespace that faces the processes of each application program is cut so that the metadata access time of the application program is shortened and the access efficiency is improved, and the method is suitable for the file system with more and more complex namespace.

Description

A kind of application oriented file system name space management method

Technical field

The invention belongs to computer memory technical field, be specifically related to a kind of application oriented file system name space management method.

Background technology

Along with the growth in PC market and improving constantly of storage device data storage capacity, the personal data quantitative change gets increasing.File system is as the basis of leading subscriber data, and its name space becomes and becomes increasingly complex, and this causes the file system metadata access performance to become more and more lower.

In computer system, file comprises metadata and data two parts, and metadata is the description to attributes such as file size, types, and the name of file is in the metadata, is used to identify a file; File system is the set of file, and the name space of file system is the set of the name of its included file.

The name space that the process of application program is visited is the set of the name of this process institute access file.

File system metadata occupies very consequence in semantic file systems, many semantic informations all are to represent by the form of metadata.Current file system does not give metadata enough attention, carries out content search so they can not make full use of the metadata of file system, and some senior correlation data accesses based on semanteme more can not be provided.Present many semantic file systems all are the modes by extended file attributes in metadata, comprising polytype semantic information, and utilize the metadata of these expansions to improve the recall precision of file.

The content of metadata expansion can comprise following a few class substantially in the semantic file systems at present:

1 running state information.Many fields all need running state information, such as application program and compiler.Application program need be to the short-term of status information and long-term storage, such as license information, and user profile, password, DNS or smtp server, ORB information etc.Compiler also can produce the information of relevant procedures data type.

2 data models.Data base management system (DBMS), windows registry, the interface storage vault is all being safeguarded the information about data structure and pattern, data type for example, index, constraint, contact and interface etc.

3 multimedias.The image pattern sheet, video, the sense of hearing such as music and visual information can not therefrom extract the useful form that is used to inquire about easily.Therefore just need metadata information to be used to provide relevant visit.Common this information may be imported after manually extracting the feature pattern according to input after the key class data or by picture processing program.

Though current many file system, such as the WinFS of Microsoft, the Spotlight of apple, semantic information in a way all is provided, but the semantic file systems that makes up than the open widely data model method of a use, the semantic information that they comprise all also is nowhere near.Most of file system are used the hierarchical directory structure, and this structure only is only applicable on a small scale or the file system of medium-scale name space.In the large-scale data file system, the classification of file and retrieval will become very difficult.The user needs more effectively constituent act, the method for retrieving files badly.

Summary of the invention

The present invention proposes a kind of application oriented file system name space management method, and the content that solves metadata expansion in the existing file system is the problem of the application semantics of include file not, to shorten the metadata access time and to improve the efficient of application access.

A kind of application oriented file system name space management method of the present invention comprises the following steps:

(1) application semantics extraction step: move each application program in advance, follow the tracks of of the visit of the process of each application program file, and increase hereof one the record metadata be used for preserving the process name of the application program of visiting this document; Then according to all record metadata, extract application semantics, every application semantics is described the relation between a process and the institute's access file, and the form of each application semantics with the XML file is saved in the database, and wait is called;

(2) name space cutting step: when this step is called, process name according to application program takes out its corresponding application semantics from database, cut out name space with this process context according to application semantics, its form with dom tree is resided in the internal memory, return then;

(3) accessing step of cutting name space: actual motion application program, when the process access file of application program, file system is redirected to name space with the cutting of this process context with the request of access that receives, and is redirected successfully, then finishes request of access; Otherwise call name space cutting step, send request of access to file system again.

Described application oriented file system name space management method is characterized in that:

(1) described application semantics extraction step, order is carried out following process:

(1.1) start each application program, when file was accessed, order was carried out;

(1.2) title of interpolation visit process in the record metadata of accessed file F, the form of record metadata is A:B:C: ..., wherein F is accessed filename, A, B, C are respectively the process name of the application program of this file of visit;

(1.3) all application programs finish, and according to the record metadata of all accessed files, extract application semantics, and its form is A=F1:F2:F3: ..., wherein A is a process name, F1, F2, F3 are accessed filename;

(1.4) with the application semantics of the process of each application program of being extracted, write as the XML file, be saved in the database, finished;

(2) described name space cutting step, order is carried out following process:

(2.1) when this step is called, obtain the ID parameter of program process;

(2.2) according to the ID parameter of described process, visit its data structure, obtain process name A;

(2.3) from database, read the XML file of the application semantics that comprises process name A;

(2.4) the XML document analysis of reading is become dom tree, the All Files name metadata corresponding that the application semantics of process name A comprises is added in this dom tree, and this dom tree is resided in the internal memory, obtain the name space of the cutting related, finish with A;

(3) accessing step of described cutting name space, carry out following process:

(3.1) during the process A access file of application program, send request of access, carry out next step to file system;

(3.2) file system obtains the ID parameter of process A from request package, request of access is redirected to the name space of the cutting related with cA again, carries out next step;

(3.3) redirect operation checks in internal memory whether the dom tree corresponding with the ID parameter of process A exists, and exists then to turn over journey (3.5), does not exist and then carries out next step;

(3.4) call name space cutting step, obtain the name space of the cutting related, turn over journey (3.2) with process A;

(3.5) in the name space of the cutting related, finish the request of access of file, proceed application program with process A;

(3.6) application program is finished when withdrawing from, and removes the name space of the cutting relevant with its all included processes in internal memory, finishes.

The present invention is saved in each application semantics in the database with the XML file by extracting the process and the related information between its access file (being the application semantics of file) of application program, avoids the extraction that repeats that causes because system restarts; And utilize the name space of the process context of the application semantics cutting of these files and application program, taken into full account and utilized the greatest differences of name space of the process of system and application program, shorten the time of the process accesses meta-data of application program, improve the access efficiency performance of metadata, be applicable to the file system that current name space becomes and becomes increasingly complex.

Description of drawings

Fig. 1 is a schematic flow sheet of the present invention;

Fig. 2 is an application semantics extraction step schematic flow sheet of the present invention;

Fig. 3 is a name space cutting steps flow chart synoptic diagram of the present invention;

Fig. 4 is the accessing step schematic flow sheet of cutting name space of the present invention;

Embodiment

Suppose that an application program is made up of 2 processes, process name is respectively PA, PB; Visit 5 files altogether, be respectively File1, File2, File3, File4, File5.

Fig. 2 is an application semantics extraction step synoptic diagram;

(1) start this application program, when file was accessed, order was carried out;

(2) title of interpolation visit process in the record metadata of accessed file;

(3) all application programs finish, and according to the record metadata of all accessed files, extract application semantics, and application semantics is PA=File1:File2:File3, PB=File4:File5;

(4) with the application semantics of the process of each application program of being extracted, write as the XML file, be saved in the database;

(5) finish.

Fig. 3 is a name space cutting flow chart of steps;

(1) when this step is called, obtains the ID parameter of program process;

(2) according to the ID parameter of described process, visit its data structure, obtain process name PA;

(3) from database, read the XML file of the application semantics that comprises process name PA;

(4) the XML document analysis of reading is become dom tree, File1, File2, File3 metadata corresponding that the application semantics of process name PA comprises are added in this dom tree, and this dom tree is resided in the internal memory, obtain the name space of the cutting related with PA;

(5) finish.

Fig. 4 is the access process process flow diagram of cutting name space;

(1) during the process PA access file File1 of application program, sends request of access, carry out next step to file system;

(2) file system obtains the ID parameter of process PA from request package, request of access is redirected to the name space of the cutting related with process PA again, carries out next step;

(3) redirect operation checks in internal memory whether the dom tree corresponding with the ID parameter of process PA exists, and exists then to turn over journey (5), does not exist and then carries out next step;

(4) call name space cutting step, obtain the name space of the cutting related, turn over journey (2) with process PA;

(5) in the name space of the cutting related, finish the request of access of file File1, proceed application program with process PA;

(6) application program is finished when withdrawing from, and removes the name space of the cutting relevant with its included process PA in internal memory;

(7) finish.

Claims

1. an application oriented file system name space management method comprises the following steps:

(2) name space cutting step, order is carried out following process:

(2.1) when this step is called, obtain the ID parameter of program process;

(2.4) the XML document analysis of reading is become dom tree, the All Files name metadata corresponding that the application semantics of process name A comprises is added in this dom tree, and this dom tree is resided in the internal memory, obtain the name space of the cutting related, return then with A;

(3) accessing step of cutting name space, carry out following process:

(3.2) file system obtains the ID parameter of process A from request package, request of access is redirected to the name space of the cutting related with process A again, carries out next step;

2. application oriented file system name space management method as claimed in claim 1 is characterized in that:

(1.2) title of interpolation visit process in the record metadata of accessed file F, the form of record metadata is A:B:C:..., and wherein F is accessed filename, and A, B, C are respectively the process name of the application program of this file of visit;

(1.3) all application programs finish, and according to the record metadata of all accessed files, extract application semantics, and its form is A=F1:F2:F3: ..., wherein A is a process name, F1, and F2, F3 are accessed filename;

(1.4) with the application semantics of the process of each application program of being extracted, write as the XML file, be saved in the database, finished.