CN101419626B - File system name space management method orienting application - Google Patents

File system name space management method orienting application Download PDF

Info

Publication number
CN101419626B
CN101419626B CN2008102366002A CN200810236600A CN101419626B CN 101419626 B CN101419626 B CN 101419626B CN 2008102366002 A CN2008102366002 A CN 2008102366002A CN 200810236600 A CN200810236600 A CN 200810236600A CN 101419626 B CN101419626 B CN 101419626B
Authority
CN
China
Prior art keywords
application
file
name
name space
application program
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2008102366002A
Other languages
Chinese (zh)
Other versions
CN101419626A (en
Inventor
冯丹
施展
朱春霖
李志超
赵恒�
李勇
邓聪林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huazhong University of Science and Technology
Original Assignee
Huazhong University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong University of Science and Technology filed Critical Huazhong University of Science and Technology
Priority to CN2008102366002A priority Critical patent/CN101419626B/en
Publication of CN101419626A publication Critical patent/CN101419626A/en
Application granted granted Critical
Publication of CN101419626B publication Critical patent/CN101419626B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to an application-oriented file system namespace management method which belongs to the technical field of computer storage and aims at solving the problem that the file application semantics is not included in the metadata extension contents in the existing file systems so as to shorten the metadata access time and improve the application access efficiency. The method comprises three steps, i.e. extracting application semantics, cutting namespace and accessing cut namespace. By extracting the application semantics between the processes of the application program and the file accessed by the application program, the file system namespace that faces the processes of each application program is cut so that the metadata access time of the application program is shortened and the access efficiency is improved, and the method is suitable for the file system with more and more complex namespace.

Description

A kind of application oriented file system name space management method
Technical field
The invention belongs to computer memory technical field, be specifically related to a kind of application oriented file system name space management method.
Background technology
Along with the growth in PC market and improving constantly of storage device data storage capacity, the personal data quantitative change gets increasing.File system is as the basis of leading subscriber data, and its name space becomes and becomes increasingly complex, and this causes the file system metadata access performance to become more and more lower.
In computer system, file comprises metadata and data two parts, and metadata is the description to attributes such as file size, types, and the name of file is in the metadata, is used to identify a file; File system is the set of file, and the name space of file system is the set of the name of its included file.
The name space that the process of application program is visited is the set of the name of this process institute access file.
File system metadata occupies very consequence in semantic file systems, many semantic informations all are to represent by the form of metadata.Current file system does not give metadata enough attention, carries out content search so they can not make full use of the metadata of file system, and some senior correlation data accesses based on semanteme more can not be provided.Present many semantic file systems all are the modes by extended file attributes in metadata, comprising polytype semantic information, and utilize the metadata of these expansions to improve the recall precision of file.
The content of metadata expansion can comprise following a few class substantially in the semantic file systems at present:
1 running state information.Many fields all need running state information, such as application program and compiler.Application program need be to the short-term of status information and long-term storage, such as license information, and user profile, password, DNS or smtp server, ORB information etc.Compiler also can produce the information of relevant procedures data type.
2 data models.Data base management system (DBMS), windows registry, the interface storage vault is all being safeguarded the information about data structure and pattern, data type for example, index, constraint, contact and interface etc.
3 multimedias.The image pattern sheet, video, the sense of hearing such as music and visual information can not therefrom extract the useful form that is used to inquire about easily.Therefore just need metadata information to be used to provide relevant visit.Common this information may be imported after manually extracting the feature pattern according to input after the key class data or by picture processing program.
Though current many file system, such as the WinFS of Microsoft, the Spotlight of apple, semantic information in a way all is provided, but the semantic file systems that makes up than the open widely data model method of a use, the semantic information that they comprise all also is nowhere near.Most of file system are used the hierarchical directory structure, and this structure only is only applicable on a small scale or the file system of medium-scale name space.In the large-scale data file system, the classification of file and retrieval will become very difficult.The user needs more effectively constituent act, the method for retrieving files badly.
Summary of the invention
The present invention proposes a kind of application oriented file system name space management method, and the content that solves metadata expansion in the existing file system is the problem of the application semantics of include file not, to shorten the metadata access time and to improve the efficient of application access.
A kind of application oriented file system name space management method of the present invention comprises the following steps:
(1) application semantics extraction step: move each application program in advance, follow the tracks of of the visit of the process of each application program file, and increase hereof one the record metadata be used for preserving the process name of the application program of visiting this document; Then according to all record metadata, extract application semantics, every application semantics is described the relation between a process and the institute's access file, and the form of each application semantics with the XML file is saved in the database, and wait is called;
(2) name space cutting step: when this step is called, process name according to application program takes out its corresponding application semantics from database, cut out name space with this process context according to application semantics, its form with dom tree is resided in the internal memory, return then;
(3) accessing step of cutting name space: actual motion application program, when the process access file of application program, file system is redirected to name space with the cutting of this process context with the request of access that receives, and is redirected successfully, then finishes request of access; Otherwise call name space cutting step, send request of access to file system again.
Described application oriented file system name space management method is characterized in that:
(1) described application semantics extraction step, order is carried out following process:
(1.1) start each application program, when file was accessed, order was carried out;
(1.2) title of interpolation visit process in the record metadata of accessed file F, the form of record metadata is A:B:C: ..., wherein F is accessed filename, A, B, C are respectively the process name of the application program of this file of visit;
(1.3) all application programs finish, and according to the record metadata of all accessed files, extract application semantics, and its form is A=F1:F2:F3: ..., wherein A is a process name, F1, F2, F3 are accessed filename;
(1.4) with the application semantics of the process of each application program of being extracted, write as the XML file, be saved in the database, finished;
(2) described name space cutting step, order is carried out following process:
(2.1) when this step is called, obtain the ID parameter of program process;
(2.2) according to the ID parameter of described process, visit its data structure, obtain process name A;
(2.3) from database, read the XML file of the application semantics that comprises process name A;
(2.4) the XML document analysis of reading is become dom tree, the All Files name metadata corresponding that the application semantics of process name A comprises is added in this dom tree, and this dom tree is resided in the internal memory, obtain the name space of the cutting related, finish with A;
(3) accessing step of described cutting name space, carry out following process:
(3.1) during the process A access file of application program, send request of access, carry out next step to file system;
(3.2) file system obtains the ID parameter of process A from request package, request of access is redirected to the name space of the cutting related with cA again, carries out next step;
(3.3) redirect operation checks in internal memory whether the dom tree corresponding with the ID parameter of process A exists, and exists then to turn over journey (3.5), does not exist and then carries out next step;
(3.4) call name space cutting step, obtain the name space of the cutting related, turn over journey (3.2) with process A;
(3.5) in the name space of the cutting related, finish the request of access of file, proceed application program with process A;
(3.6) application program is finished when withdrawing from, and removes the name space of the cutting relevant with its all included processes in internal memory, finishes.
The present invention is saved in each application semantics in the database with the XML file by extracting the process and the related information between its access file (being the application semantics of file) of application program, avoids the extraction that repeats that causes because system restarts; And utilize the name space of the process context of the application semantics cutting of these files and application program, taken into full account and utilized the greatest differences of name space of the process of system and application program, shorten the time of the process accesses meta-data of application program, improve the access efficiency performance of metadata, be applicable to the file system that current name space becomes and becomes increasingly complex.
Description of drawings
Fig. 1 is a schematic flow sheet of the present invention;
Fig. 2 is an application semantics extraction step schematic flow sheet of the present invention;
Fig. 3 is a name space cutting steps flow chart synoptic diagram of the present invention;
Fig. 4 is the accessing step schematic flow sheet of cutting name space of the present invention;
Embodiment
Suppose that an application program is made up of 2 processes, process name is respectively PA, PB; Visit 5 files altogether, be respectively File1, File2, File3, File4, File5.
Fig. 2 is an application semantics extraction step synoptic diagram;
(1) start this application program, when file was accessed, order was carried out;
(2) title of interpolation visit process in the record metadata of accessed file;
(3) all application programs finish, and according to the record metadata of all accessed files, extract application semantics, and application semantics is PA=File1:File2:File3, PB=File4:File5;
(4) with the application semantics of the process of each application program of being extracted, write as the XML file, be saved in the database;
(5) finish.
Fig. 3 is a name space cutting flow chart of steps;
(1) when this step is called, obtains the ID parameter of program process;
(2) according to the ID parameter of described process, visit its data structure, obtain process name PA;
(3) from database, read the XML file of the application semantics that comprises process name PA;
(4) the XML document analysis of reading is become dom tree, File1, File2, File3 metadata corresponding that the application semantics of process name PA comprises are added in this dom tree, and this dom tree is resided in the internal memory, obtain the name space of the cutting related with PA;
(5) finish.
Fig. 4 is the access process process flow diagram of cutting name space;
(1) during the process PA access file File1 of application program, sends request of access, carry out next step to file system;
(2) file system obtains the ID parameter of process PA from request package, request of access is redirected to the name space of the cutting related with process PA again, carries out next step;
(3) redirect operation checks in internal memory whether the dom tree corresponding with the ID parameter of process PA exists, and exists then to turn over journey (5), does not exist and then carries out next step;
(4) call name space cutting step, obtain the name space of the cutting related, turn over journey (2) with process PA;
(5) in the name space of the cutting related, finish the request of access of file File1, proceed application program with process PA;
(6) application program is finished when withdrawing from, and removes the name space of the cutting relevant with its included process PA in internal memory;
(7) finish.

Claims (2)

1. an application oriented file system name space management method comprises the following steps:
(1) application semantics extraction step: move each application program in advance, follow the tracks of of the visit of the process of each application program file, and increase hereof one the record metadata be used for preserving the process name of the application program of visiting this document; Then according to all record metadata, extract application semantics, every application semantics is described the relation between a process and the institute's access file, and the form of each application semantics with the XML file is saved in the database, and wait is called;
(2) name space cutting step, order is carried out following process:
(2.1) when this step is called, obtain the ID parameter of program process;
(2.2) according to the ID parameter of described process, visit its data structure, obtain process name A;
(2.3) from database, read the XML file of the application semantics that comprises process name A;
(2.4) the XML document analysis of reading is become dom tree, the All Files name metadata corresponding that the application semantics of process name A comprises is added in this dom tree, and this dom tree is resided in the internal memory, obtain the name space of the cutting related, return then with A;
(3) accessing step of cutting name space, carry out following process:
(3.1) during the process A access file of application program, send request of access, carry out next step to file system;
(3.2) file system obtains the ID parameter of process A from request package, request of access is redirected to the name space of the cutting related with process A again, carries out next step;
(3.3) redirect operation checks in internal memory whether the dom tree corresponding with the ID parameter of process A exists, and exists then to turn over journey (3.5), does not exist and then carries out next step;
(3.4) call name space cutting step, obtain the name space of the cutting related, turn over journey (3.2) with process A;
(3.5) in the name space of the cutting related, finish the request of access of file, proceed application program with process A;
(3.6) application program is finished when withdrawing from, and removes the name space of the cutting relevant with its all included processes in internal memory, finishes.
2. application oriented file system name space management method as claimed in claim 1 is characterized in that:
(1) described application semantics extraction step, order is carried out following process:
(1.1) start each application program, when file was accessed, order was carried out;
(1.2) title of interpolation visit process in the record metadata of accessed file F, the form of record metadata is A:B:C:..., and wherein F is accessed filename, and A, B, C are respectively the process name of the application program of this file of visit;
(1.3) all application programs finish, and according to the record metadata of all accessed files, extract application semantics, and its form is A=F1:F2:F3: ..., wherein A is a process name, F1, and F2, F3 are accessed filename;
(1.4) with the application semantics of the process of each application program of being extracted, write as the XML file, be saved in the database, finished.
CN2008102366002A 2008-11-28 2008-11-28 File system name space management method orienting application Expired - Fee Related CN101419626B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2008102366002A CN101419626B (en) 2008-11-28 2008-11-28 File system name space management method orienting application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2008102366002A CN101419626B (en) 2008-11-28 2008-11-28 File system name space management method orienting application

Publications (2)

Publication Number Publication Date
CN101419626A CN101419626A (en) 2009-04-29
CN101419626B true CN101419626B (en) 2010-06-09

Family

ID=40630415

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2008102366002A Expired - Fee Related CN101419626B (en) 2008-11-28 2008-11-28 File system name space management method orienting application

Country Status (1)

Country Link
CN (1) CN101419626B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101248414A (en) * 2005-04-01 2008-08-20 微软公司 User data profile namespace

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101248414A (en) * 2005-04-01 2008-08-20 微软公司 User data profile namespace

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JP特開2004-334479A 2004.11.25

Also Published As

Publication number Publication date
CN101419626A (en) 2009-04-29

Similar Documents

Publication Publication Date Title
CN102567140B (en) Use the file system backup of change journal
US7860907B2 (en) Data processing
US8131691B1 (en) System and method for updating a search engine index based on which files are identified in a file change log
US20070038665A1 (en) Local computer search system and method of using the same
US8452788B2 (en) Information retrieval system, registration apparatus for indexes for information retrieval, information retrieval method and program
US8095678B2 (en) Data processing
US20150106335A1 (en) Hierarchical data archiving
CN103460197A (en) Computer system, file management method and metadata server
KR20130086005A (en) Method and appartus searching data in multiple device
US10379779B2 (en) Concurrent, incremental, and generational mark and sweep garbage collection
US9824104B2 (en) System and method for content storage
US20040088301A1 (en) Snapshot of a file system
CN109284273B (en) Massive small file query method and system adopting suffix array index
US20080005524A1 (en) Data processing
US20060143242A1 (en) Content management device
JP2007287147A (en) Fast file attribute search
US20080016106A1 (en) Data processing
CN102521383A (en) Method for storing and accessing mass files in distributed system
CN101419626B (en) File system name space management method orienting application
CN101650737A (en) Method for rapidly opening files based on FAT system
US8886656B2 (en) Data processing
US20040088274A1 (en) Semantic hashing
EP3273365B1 (en) Method for generating search index and server utilizing the same
JP2008071048A (en) System for presenting dynamic content and its program
CN100444166C (en) Position-based interface access method and device for FAT file system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20100609

Termination date: 20201128

CF01 Termination of patent right due to non-payment of annual fee