CN101022396A - Grid data duplicate management system - Google Patents

Grid data duplicate management system Download PDF

Info

Publication number
CN101022396A
CN101022396A CNA2007100380725A CN200710038072A CN101022396A CN 101022396 A CN101022396 A CN 101022396A CN A2007100380725 A CNA2007100380725 A CN A2007100380725A CN 200710038072 A CN200710038072 A CN 200710038072A CN 101022396 A CN101022396 A CN 101022396A
Authority
CN
China
Prior art keywords
server
copy
unit
data
daily record
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2007100380725A
Other languages
Chinese (zh)
Other versions
CN100518131C (en
Inventor
黄林鹏
杨欢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiaotong University
Original Assignee
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University filed Critical Shanghai Jiaotong University
Priority to CNB2007100380725A priority Critical patent/CN100518131C/en
Publication of CN101022396A publication Critical patent/CN101022396A/en
Application granted granted Critical
Publication of CN100518131C publication Critical patent/CN100518131C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A management system of grid data copy comprises processing server, journal library server, application analysis server, meta-data server, copy positioning server, copy selection server, copy maintenance server, data storage and display terminal. Its managing method is also disclosed.

Description

Grid data duplicate management system
Technical field
What the present invention relates to is the system in a kind of telecommunication technology field, specifically a kind of grid data duplicate management system.
Background technology
Storing large-scale data in the grid computing environment, by these large-scale datas are carried out organization and management effectively, grid provides high quality services for data-intensive application program.How effectively the large-scale data in the managing gridding becomes the focus of research.Current, research to the data management in the grid is mainly concentrated both ways: being to improve the large-scale data transmission performances by improving modes such as Data Transport Protocol on the one hand, is by duplicating of large-scale dataset being improved the availability of data on the other hand.Wherein, Data Replication Technology in Mobile is widely used.For the availability that improves data and the performance of visit data, need create a plurality of data trnascriptions to a large-scale dataset usually, and be stored in different back end.This just has been born to the demand of data duplicate management system.In existing part grid data management system, considered the realization of data duplicate management system, yet, in these systems ubiquity some problems as: the design to data duplicate management system rests on the level of middleware more and does not consider demands of applications, thereby causes it to be difficult to adapt to the application demand of dynamic change; The design of data duplicate management system is paid close attention to the research of consistency model etc. and less consideration copy is placed strategy and data and duplicated combination with transfer of data more.Based on optimizing the efficient that the copy of placing can improve the application access data, therefore design a kind of data duplicate management system of tissue as required, the organization and management data trnascription is the problem that prior art need solve for the data-intensive application program based on grid provides high quality services effectively.
Find by prior art documents, discussed the realization of a kind of replica management system in " DataManagement and Transfer in High-Performance Computational GridEnvironments " (data management under the high-performance calculation grid environment and transmission) that Bill Allcock etc. deliver on the 749th to 771 page of " Parallel ComputingJournal " (parallel computation periodical) 2002 the 28th phase, this system comprises application program, the metadata service, the replica management service, copy is selected service, five assemblies such as information service, its function are to make application program can discern the optimal storage position of desired data collection.The deficiency of this system: (1) does not possess the ability of the manual management data copy of user, make the user can't browsing data copy deposit position, manual creation copy, manual deletion copy, manual migration copy; (2) do not possess as required the ability of management data copy automatically, can not according to application program to the real time access performance of data copy automatically create, deletion and migration data copy; (3) do not possess the ability of on the data memory node of isomery, storing data trnascription; (4) do not have friendly graphical interfaces client.
Summary of the invention
The present invention is directed to the deficiencies in the prior art and defective, a kind of grid replica management system of managing large scale data trnascription is provided, make it that current business intelligence technology is applied to the grid computing field, by the access log of analysis application to the data copy, the intelligent predicting application program is to the visit total demand of data copy, and the optimization as required that realizes data trnascription is in view of the above placed, thereby improved the access performance of application program, promoted the application and the development of the management of grid computing and data trnascription the data copy.
The present invention is achieved by the following technical solutions, the present invention includes: processing server, daily record storehouse server, applied analysis server, meta data server, copy location-server, copy are selected server, copy maintenance server, data storage, display terminal.Display terminal links to each other with processing server, processing server selects server to link to each other with daily record storehouse server, applied analysis server, meta data server, copy location-server and copy simultaneously, daily record storehouse server links to each other with the applied analysis server, applied analysis server, meta data server, copy location-server, copy select server all to link to each other with the copy maintenance server, and the copy maintenance server is connected with one or more data storages.Processing server accepts to come from the data trnascription management and the data access demand of display terminal or external application, and selects server interaction with processing demands according to the same meta data server of different demands, copy location-server and copy; Processing server generates the data access log record simultaneously and handles historical record, and the log transmission that generates is given daily record storehouse server and sent the log store order, and daily record storehouse server receives storing daily record after the order; When daily record analysis entry condition satisfied, applied analysis startup of server log analysis process was by analyzing log information, handle the copy placement strategy that produces optimization; The copy maintenance server is implemented the above-mentioned strategy and the data trnascription of recombinating; The positional information of the copy after the renewal, derivatization process etc. are all presented to the user by display terminal.When user's request msg or copy maintenance server when implementing copy and placing strategy, data storage is accepted the Data Transmission Controlling order that the copy maintenance server sends, between turn-on data memory and the user or data storage and data storage between transfer of data.
Described processing server comprises: TU task unit and daily record generation unit.TU task unit selects server, daily record generation unit to be connected with display terminal, meta data server, copy location-server, copy, and the daily record generation unit is connected with daily record storehouse server, applied analysis server.Input triggering or application's data access request by the user trigger TU task unit initiating task processing threads, and finish this task; The task processing threads triggers the daily record generation unit according to task and task performance then, generates daily record and sends the log store request to daily record storehouse server; Last TU task unit outputs to display terminal with the task performance.
Described daily record storehouse server comprises: journalizing unit and log store unit.The input of journalizing unit is connected to processing server, applied analysis server, and output is connected to the log store unit.The log store unit only is connected with the journalizing unit.The journalizing request from input is at first accepted in the journalizing unit, analysis operation request then, the executable operations request of last operation log store unit.The function of daily record storehouse server is acceptance, discerns and finish the journalizing order from input, and the storage of daily record on the physical location.
Described applied analysis server comprises: daily record reading unit and reasoning from logic unit.The input of daily record reading unit is connected to daily record storehouse server, and output is connected to the reasoning from logic unit, and the input of reasoning from logic unit is connected to the daily record reading unit, and output is connected to the copy maintenance server.The daily record reading unit reads and the preliminary treatment log information from daily record storehouse server.Technology such as the data analysis of reasoning from logic unit application, data mining are analyzed the log information after the daily record reading unit is handled, and the predicted application program is to the requirements for access of data copy, and reasoning produces the data trnascription placement schemes of optimizing.The workflow of applied analysis server is: the daily record reading unit reads daily record from daily record storehouse server, to the daily record preliminary treatment, outputs to the reasoning from logic unit; The daily record of reasoning from logic element analysis generates the copy placement schemes of optimizing, and output copy placement schemes is implemented this scheme to the copy maintenance server by it.
Described meta data server comprises: metadata query unit, metadata updates unit and metadata storage unit.The metadata query unit is connected with processing server, metadata storage unit, the metadata updates unit is connected with processing server, copy maintenance server, metadata storage unit, and metadata storage unit is connected with metadata query unit, metadata updates unit.The function of meta data server is the intrasystem metadata information of service data copy: promptly inquire about, upgrade the logical name of each data trnascription and the mapping between physical name.When query metadata information, processing server transmit a request to the metadata query unit, metadata query unit operations metadata storage unit, and last metadata query unit returns Query Result to handling server.When update metadata information, at first processing server transmit a request to the metadata updates unit, secondly the metadata updates unit transmit a request to metadata storage unit, may transmit a request to the copy maintenance server by the metadata updates unit according to the practical operation needs once more, last metadata updates unit return result is to handling server.
Described copy location-server comprises: copy physical location query unit, copy physical location updating block and copy physical location memory cell.Copy physical location query unit is connected to processing server, copy physical location memory cell, copy physical location updating block is connected to processing server, copy maintenance server, copy physical location memory cell, and copy physical location memory cell is connected to copy physical location query unit, copy physical location updating block.The copy location-server is used to inquire about and upgrades mapping between the physical location of the physical name of data trnascription and its storage.When inquiry copy physical location, processing server transmit a request to copy physical location query unit, and operation of copy physical location query unit and copy physical location memory cell are returned Query Result at last to handling server.When latest copy physical location more, at first processing server transmit a request to copy physical location updating block, secondly copy physical location updating block transmit a request to copy physical location memory cell, may transmit a request to the copy maintenance server by copy physical location updating block according to the practical operation needs once more, last copy physical location updating block return result is to handling server.
Described copy selects server to be connected to processing server, copy maintenance server, is used for the access performance of analysis application to a plurality of data trnascriptions of being positioned at different physical locations, and sorts according to its access performance at each copy.Processing server sends to copy selection server with a plurality of physical locations of same data trnascription, copy selects server to be connected to the copy maintenance server, the performance of each data trnascription of predicted application routine access, is returned ranking results each copy ordering at last according to the performance quality.
Described copy maintenance server comprises: data transmission unit and data fusion unit.Data transmission unit is connected with processing server, copy maintenance server, is used for from physically carrying out the deletion of transfer of data and data.The data fusion unit is connected with processing server, copy maintenance server, is used to merge the data on the various isomery storage mediums.The function that data trnascription is safeguarded has been finished in these two unit cooperations, creates data trnascription, deleted data copy, migration data copy etc. that is:.
Described data storage, the physical store object that is meant data be promptly: GridFTP server, ftp server and database.Exist one or more data storages in system, each data storage all is connected with the copy maintenance server, when carrying out the copy maintenance task, interconnects between a plurality of data storages that the participation copy is safeguarded.
Described display terminal is comprising administrative unit and display unit.Administrative unit all is connected with processing server with display unit.Display terminal presents in the mode of graphic user interface.The user is under graphical interfaces, and by display terminal visit replica management system, execution creates a Copy, the deletion copy, and the inquiry copy is browsed operations such as copy position.
Compared with prior art, characteristics of the present invention are: can realize the management of data trnascription in the grid environment, and the manual administration copy under the grid environment is provided and manages copy as required automatically, have user interface visual characteristics directly perceived again, the user can check the physical location and the derivatization process of copy under graphical interfaces, and it is compatible good to have, and extensibility is strong, the characteristics that maintenance costs is little.Native system has been realized the management as required of data trnascription in the grid environment, places strategy by the copy of the optimization that infers in real time, has supported the high-property transmission and the high availability of data, and the performance of application access data is provided.
The present invention proposes a data duplicate management system towards grid environment in addition, it adopts distributed structure/architecture and object-oriented and service-oriented mode construction, and based on international standards such as Simple Object Access Protocol and extending mark languages.System of the present invention has telemanagement and processing capacity, processing server, daily record storehouse server, the applied analysis server, meta data server, the copy location-server, copy is selected server, copy maintenance server, data storage, display terminal can be arranged in a computer, also can be in a plurality of nodes and multiple devices that are distributed in the grid environment, be easy to integrated and merge to other grid application in the mode of plug and play, can integrated existing infrastructure, reduce the expense of building the grid computing applied environment, and adapted to the following needs that increase.
Beneficial effect of the present invention is as follows:
1) intelligent: the present invention is by analyzing the data with existing access log, and the intelligent predicting application program generates the copy placement schemes of optimizing to the requirements for access of data copy thereby derive;
2) high efficiency: the present invention has adopted the copy placement schemes of real-time optimization, organizes copy in a kind of good mode, thereby has improved the efficient of application access data;
3) flexibility and customizability: the function of manual administration data trnascription both was provided,, be convenient to user flexibility and select, and the replica management strategy can be customized voluntarily by the user, has stronger flexibility and customizability by the function that intelligent automatic management copy is provided;
4) extensibility: the present invention is based on distributed system, be easy to expand and add new data storage, meta data server, copy location-server etc. to system, this makes native system that extensibility preferably be arranged;
5) use and convenient management: the user uses and management system by display terminal, and display terminal is based on graphic user interface, friendly interface, and simple to operate, this makes that native system is easy to use and convenient management.
Description of drawings
Fig. 1 is a system architecture diagram of the present invention;
Fig. 2 is the connection layout of each subsystem among the present invention;
Fig. 3 is the flow chart of user inquiring data trnascription among the present invention;
Fig. 4 is the flow chart of user's manual administration data trnascription among the present invention.
Fig. 5 is the flow chart of application access data trnascription among the present invention.
Fig. 6 is the flow chart of management data copy automatically as required among the present invention.
Embodiment
Below in conjunction with accompanying drawing embodiments of the invention are elaborated: present embodiment has provided detailed execution mode and process being to implement under the prerequisite with the technical solution of the present invention, but protection scope of the present invention is not limited to following embodiment.
As shown in Figure 1, present embodiment specifically comprises: processing server 1, daily record storehouse server 2, applied analysis server 3, meta data server 4, copy location-server 5, copy are selected server 6, copy maintenance server 7, data storage 8, display terminal 9.
As shown in Figure 2, the annexation among the present invention between each subsystem, wherein: the user is connected to processing server 1 by display terminal 9, and application program is connected to processing server 1 by interface; Processing server 1 is connected to daily record storehouse server 2, applied analysis server 3, meta data server 4, copy location-server 5, copy selection server 6; Daily record storehouse server 2 is connected to applied analysis server 3; Applied analysis server 3, meta data server 4, copy location-server 5, copy select server 6 all to be connected to copy maintenance server 7; Copy maintenance server 7 is connected to one or more data storages 8.
Described processing server 1 is made up of TU task unit, daily record generation unit, TU task unit selects server 6, daily record generation unit to be connected with display terminal 9, meta data server 4, copy location-server 5, copy, the daily record generation unit is connected with daily record storehouse server 2, applied analysis server 3, wherein TU task unit 9 transmission come according to display terminal user's input or application access request, create a task processing threads, and finish this task; The task processing threads triggers the daily record generation unit according to task and task performance then, generates daily record and sends the log store request to daily record storehouse server 2; Last TU task unit outputs to display terminal 9 with the task performance.
Described daily record storehouse server 2 is made up of journalizing unit, log store unit, the input of journalizing unit is connected to processing server 1, applied analysis server 3, output is connected to the log store unit, the log store unit only is connected with the journalizing unit, wherein the journalizing request from processing server 1 and applied analysis server 3 is accepted in the journalizing unit, the analysis operation request, request is carried out in last operation log store unit.
Described applied analysis server 3 is made up of daily record reading unit, reasoning from logic unit, the input of daily record reading unit is connected to daily record storehouse server 2, output is connected to the reasoning from logic unit, the input of reasoning from logic unit is connected to the daily record reading unit, output is connected to copy maintenance server 7, wherein the daily record reading unit reads daily record and preliminary treatment daily record from daily record storehouse server 2, the daily record of reasoning from logic element analysis, generate the copy placement schemes of optimizing, and the copy placement schemes that generates is transferred to copy maintenance server 7 implement.
Described meta data server 4 is made up of metadata query unit, metadata updates unit, metadata storage unit, the metadata query unit is connected with processing server 1, metadata storage unit, the metadata updates unit is connected with processing server 1, copy maintenance server 7, metadata storage unit, metadata storage unit is connected with metadata query unit, metadata updates unit, wherein the metadata query request from processing server 1 is accepted in the metadata query unit, and the query metadata memory cell is obtained metadata information and returned to processing server 1.The metadata updates request from processing server 1 is accepted in the metadata updates unit, and the update metadata memory cell sends a message to copy maintenance server 7, and last return result gives processing server 1.
Described copy location-server 5 is by copy physical location query unit, copy physical location updating block, copy physical location memory cell is formed, copy physical location query unit is connected to processing server 1, copy physical location memory cell, copy physical location updating block is connected to processing server 1, copy maintenance server 7, copy physical location memory cell, copy physical location memory cell is connected to copy physical location query unit, copy physical location updating block, wherein copy physical location query unit is accepted the copy Location Request from processing server 1, the physics deposit position that inquiry copy physical location memory cell is obtained data trnascription returns Query Result to handling server 1.Copy physical location updating block is accepted the copy physical location update request from processing server 1, and more latest copy physical location memory cell sends a message to copy maintenance server 7, and last return result gives processing server 1.
Described copy maintenance server 7 is made up of data transmission unit, data fusion unit, data transmission unit is connected with processing server 1, copy maintenance server 7, be used for from physically carrying out the deletion of transfer of data and data, wherein data transmission unit links to each other with data storage 8, with data from some memory transfer to other memories.The data fusion unit links to each other with data storage, has realized conversion and the fusion of same data trnascription in a plurality of isomeric data memories.
Described display terminal 9 is comprising administrative unit and display unit.Administrative unit all is connected with processing server with display unit.Display terminal presents in the mode of graphic user interface.The user is under graphical interfaces, and by display terminal visit replica management system, execution creates a Copy, the deletion copy, and the inquiry copy is browsed operations such as copy position.
As shown in Figure 3, the user browses and inquires about the execution mode of copy: when the user imports the copy querying condition by the display unit of display terminal 9, display unit transmit a request to the TU task unit in the processing server 1, and TU task unit receives request back and judges request type and send inquiry copy order and returned the copy that satisfies condition by the metadata query unit to the metadata query unit of meta data server 4.The TU task unit of processing server 1 is given an order to the copy physical location query unit of copy location-server 5 afterwards, positional information with the inquiry copy, after the copy physical location query unit of copy location-server 5 is returned the copy positional information, the TU task unit of processing server 1 triggers the daily record generation unit of processing server 1, the daily record generation unit generates the daily record of this time inquiry, send the journalizing unit of log store order to daily record storehouse server 2, the log store unit is gone into log store in the journalizing unit, the TU task unit of processing server 1 returns copy and positional information thereof the display unit to display terminal 9 afterwards, display terminal shows copy information on graphical interfaces, browse for the user.
As shown in Figure 4, the user manages the execution mode of copy by hand: the user sends the replica management instruction promptly by the administrative unit of display terminal 9: create or delete a data copy, the TU task unit of processing server 1 accepts request, and obtain the physical location at this data trnascription place by the copy physical location query unit of copy location-server 5, and then the data transmission unit by copy maintenance server 7 upgrades the data on the physical location on the data storage 8.After aforesaid operations is finished, the TU task unit of processing server 1 triggers the daily record generation unit of processing server 1, the daily record generation unit generates the daily record of this time operation, send the journalizing unit of log store order to daily record storehouse server 2, the log store unit is gone into log store in the journalizing unit, the TU task unit return result of processing server 1 notifies this time of user operating result to display terminal 9 by the display unit of display terminal 9 on graphical interfaces afterwards.
As shown in Figure 5, the execution mode of application access data trnascription: application program sends request of data to the TU task unit of processing server 1, the metadata query unit of the TU task unit query metadata server 4 of processing server 1 obtains the physical file name of copy, the TU task unit of processing server 1 is again according to the physical location of physical file name by the copy physical location query unit inquiry copy of copy location-server 5, when there is a plurality of physical location in copy, the TU task unit of processing server 1 selects server 7 to obtain the highest physical location of access efficiency by copy, the record daily record of operation this time afterwards, and return copy data to application program.
As shown in Figure 6, the automatic execution mode of replica management as required: at first the default applied analysis server 3 of the TU task unit of processing server 1 analyze entry conditions as: start once or log record starts once when increasing some every the set time.When entry condition satisfies, the daily record reading unit of applied analysis server 3 reads log record in the log server 2, and to the daily record preliminary treatment, daily record after the processing passes to the reasoning from logic unit, strategy is placed in reasoning from logic unit starting data analysis example and the optimization of reasoning ghost, to place strategy afterwards gives copy maintenance server 7 and is implemented, the data transmission unit of copy maintenance server 7 sends data transfer command to data storage 8, the actual transfer of data of beginning between a plurality of data storages 8, after to be transmitted the finishing, copy maintenance server 7 notifier processes servers 1, by the more latest copy physical location mapping of copy physical location updating block of the TU task unit of processing server 1 notice copy location-server 5, and the recording operation daily record.

Claims (10)

1, a kind of grid data duplicate management system, comprise: processing server (1), daily record storehouse server (2), applied analysis server (3), meta data server (4), copy location-server (5), copy is selected server (6), copy maintenance server (7), data storage (8), display terminal (9), it is characterized in that, display terminal (9) links to each other with processing server (1), processing server (1) while and daily record storehouse server (2), applied analysis server (3), meta data server (4), copy location-server (5) and copy select server (6) to link to each other, daily record storehouse server (2) links to each other with applied analysis server (3), applied analysis server (3), meta data server (4), copy location-server (5), copy selects server (6) all to link to each other with copy maintenance server (7), copy maintenance server (7) is connected with one or more data storages, wherein: processing server (1) accepts to come from the data trnascription management and the data access demand of display terminal (9) or external application, and according to the same meta data server of different demands (4), copy location-server (7) and copy select server (6) alternately with processing demands; Processing server (1) generates the data access log record simultaneously and handles historical record, the log transmission that generates is given daily record storehouse server (2) and sent the log store order, daily record storehouse server (2) receives storing daily record after the order, when daily record analysis entry condition satisfies, applied analysis server (3) starting log analysis process, by analyzing log information, handle and produce the copy placement strategy of optimizing, copy maintenance server (7) is implemented the above-mentioned strategy and the data trnascription of recombinating, the positional information of the copy after the renewal, derivatization process is all presented to the user by display terminal (9); When user's request msg or copy maintenance server (7) when implementing copy and placing strategy, data storage (8) is accepted the Data Transmission Controlling order that copy maintenance server (7) sends, between turn-on data memory (8) and the user or data storage (8) and data storage (8) between transfer of data.
2, grid data duplicate management system according to claim 1, it is characterized in that, described processing server (1) comprises TU task unit, the daily record generation unit, TU task unit and display terminal (9), meta data server (4), copy location-server (5), copy is selected server (6), the daily record generation unit connects, daily record generation unit and daily record storehouse server (2), applied analysis server (3) connects, wherein TU task unit transmission comes according to display terminal (9) user's input or application access request, create a task processing threads, and finish this task, the task processing threads triggers the daily record generation unit according to task and task performance then, generate daily record and send the log store request to daily record storehouse server (2), last TU task unit outputs to display terminal (9) with the task performance.
3, grid data duplicate management system according to claim 1, it is characterized in that, described daily record storehouse server (2) comprises journalizing unit, log store unit, the input of journalizing unit is connected to processing server (1), applied analysis server (3), output is connected to the log store unit, the log store unit only is connected with the journalizing unit, wherein the journalizing request from processing server (1) and applied analysis server (3) is accepted in the journalizing unit, the analysis operation request, request is carried out in last operation log store unit.
4, grid data duplicate management system according to claim 1, it is characterized in that, described applied analysis server (3) comprises the daily record reading unit, the reasoning from logic unit, the input of daily record reading unit is connected to daily record storehouse server (2), output is connected to the reasoning from logic unit, the input of reasoning from logic unit is connected to the daily record reading unit, output is connected to copy maintenance server (7), wherein the daily record reading unit reads daily record and preliminary treatment daily record from daily record storehouse server (2), the daily record of reasoning from logic element analysis, generate the copy placement schemes of optimizing, and the copy placement schemes that generates is transferred to copy maintenance server (7) implement.
5, grid data duplicate management system according to claim 1, it is characterized in that, described meta data server (4) comprises the data query unit, the metadata updates unit, metadata storage unit, metadata query unit and processing server (1), metadata storage unit connects, metadata updates unit and processing server (1), copy maintenance server (7), metadata storage unit connects, metadata storage unit and metadata query unit, the metadata updates unit connects, wherein the metadata query request from processing server (1) is accepted in the metadata query unit, the query metadata memory cell is obtained metadata information and is returned to processing server (1), the metadata updates request from processing server (1) is accepted in the metadata updates unit, the update metadata memory cell, send a message to copy maintenance server (7), last return result gives processing server (1).
6, grid data duplicate management system according to claim 1, it is characterized in that, described copy location-server (5) comprises copy physical location query unit, copy physical location updating block, copy physical location memory cell, copy physical location query unit is connected to processing server (1), copy physical location memory cell, copy physical location updating block is connected to processing server (1), copy maintenance server (7), copy physical location memory cell, copy physical location memory cell is connected to copy physical location query unit, copy physical location updating block, wherein copy physical location query unit is accepted the copy Location Request from processing server (1), inquiry copy physical location memory cell is obtained the physics deposit position of data trnascription, return Query Result to handling server (1), copy physical location updating block is accepted the copy physical location update request from processing server (1), latest copy physical location memory cell more, send a message to copy maintenance server (7), last return result gives processing server (1).
7, grid data duplicate management system according to claim 1, it is characterized in that, described copy maintenance server (7) comprises data transmission unit, the data fusion unit, data transmission unit and processing server (1), copy maintenance server (7) connects, be used for from physically carrying out the deletion of transfer of data and data, wherein data transmission unit links to each other with data storage (8), with data from some memory transfer to other memories, the data fusion unit links to each other with data storage (8), has realized conversion and the fusion of same data trnascription in a plurality of isomeric data memories.
8, grid data duplicate management system according to claim 1, it is characterized in that, described copy is selected server (6), be used for the access performance of analysis application to a plurality of data trnascriptions of being positioned at different physical locations, and sort according to its access performance at each copy, processing server (1) sends to copy selection server (6) with a plurality of physical locations of same data trnascription, copy selects server (6) to be connected to copy maintenance server (7), the performance of each data trnascription of predicted application routine access,, at last ranking results is returned each copy ordering according to the performance quality.
9, grid data duplicate management system according to claim 1, it is characterized in that, described data storage (8), the physical store object that is meant data is promptly: GridFTP server, ftp server and database, in system, exist one or more data storages, each data storage all is connected with copy maintenance server (7), when carrying out the copy maintenance task, interconnects between a plurality of data storages that the participation copy is safeguarded.
10, grid data duplicate management system according to claim 1, it is characterized in that, described display terminal (9), comprising administrative unit and display unit, administrative unit all is connected with processing server (1) with display unit, and display terminal (9) presents in the mode of graphic user interface, and the user is under graphical interfaces, by display terminal (9) visit replica management system, carry out and create a Copy, delete copy, inquire about copy, browse the copy position operation.
CNB2007100380725A 2007-03-15 2007-03-15 Grid data duplicate management system Expired - Fee Related CN100518131C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2007100380725A CN100518131C (en) 2007-03-15 2007-03-15 Grid data duplicate management system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2007100380725A CN100518131C (en) 2007-03-15 2007-03-15 Grid data duplicate management system

Publications (2)

Publication Number Publication Date
CN101022396A true CN101022396A (en) 2007-08-22
CN100518131C CN100518131C (en) 2009-07-22

Family

ID=38710052

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2007100380725A Expired - Fee Related CN100518131C (en) 2007-03-15 2007-03-15 Grid data duplicate management system

Country Status (1)

Country Link
CN (1) CN100518131C (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101188521B (en) * 2007-12-05 2010-07-14 北京金山软件有限公司 A method for digging user behavior data and website server
CN101841556A (en) * 2010-02-23 2010-09-22 中国科学院计算技术研究所 Method and system for placing resources replication in CDN-P2P (Content Distribution Network-Peer-to-Peer) network
CN101340458B (en) * 2008-07-09 2011-03-16 南京邮电大学 Grid data copy generation method based on time and space limitation
CN101751309B (en) * 2009-12-28 2011-06-29 北京理工大学 Optimized transcript distributing method in data grid
CN102110139A (en) * 2011-01-27 2011-06-29 浪潮通信信息系统有限公司 Analytic algorithm for geographic grid in telecommunication field
CN102480513A (en) * 2010-11-29 2012-05-30 国际商业机器公司 Apparatus for transmitting update content with assistance in social network and method thereof
CN102497394A (en) * 2011-11-28 2012-06-13 中国科学院研究生院 Duplicate file placement method in content distribution network based on optimized model
CN102801772A (en) * 2012-03-07 2012-11-28 武汉理工大学 DCell network-oriented energy-saving copy placement method for cloud computing environment
CN101599810B (en) * 2008-06-06 2013-06-05 博通集成电路(上海)有限公司 Error concealing device and error concealing method
CN102025758B (en) * 2009-09-18 2014-06-04 华为数字技术(成都)有限公司 Method, device and system for recovering data copy in distributed system
WO2014094502A1 (en) * 2012-12-20 2014-06-26 国家电网公司 Parallel real-time database node locating system for large power grid
CN105574205A (en) * 2016-01-18 2016-05-11 国家电网公司 Dynamic log analyzing system for distributed computing environment
CN107105050A (en) * 2017-05-11 2017-08-29 北京奇艺世纪科技有限公司 A kind of storage of business object, method for down loading and system
CN110192190A (en) * 2017-01-18 2019-08-30 微软技术许可有限责任公司 Divide storage
CN110740168A (en) * 2019-09-24 2020-01-31 安徽大学 Self-adaptive method for multi-tenant server in cloud
US11675666B2 (en) 2017-01-18 2023-06-13 Microsoft Technology Licensing, Llc Including metadata in data resources
US12019684B2 (en) 2017-01-18 2024-06-25 Microsoft Technology Licensing, Llc Application programming interface arranged to interface with a plurality of data sources

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101232540B (en) * 2008-02-21 2012-04-04 中兴通讯股份有限公司 Method and system for interacting information among systems

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101188521B (en) * 2007-12-05 2010-07-14 北京金山软件有限公司 A method for digging user behavior data and website server
CN101599810B (en) * 2008-06-06 2013-06-05 博通集成电路(上海)有限公司 Error concealing device and error concealing method
CN101340458B (en) * 2008-07-09 2011-03-16 南京邮电大学 Grid data copy generation method based on time and space limitation
CN102025758B (en) * 2009-09-18 2014-06-04 华为数字技术(成都)有限公司 Method, device and system for recovering data copy in distributed system
CN101751309B (en) * 2009-12-28 2011-06-29 北京理工大学 Optimized transcript distributing method in data grid
CN101841556A (en) * 2010-02-23 2010-09-22 中国科学院计算技术研究所 Method and system for placing resources replication in CDN-P2P (Content Distribution Network-Peer-to-Peer) network
CN101841556B (en) * 2010-02-23 2013-01-30 中国科学院计算技术研究所 Method and system for placing resources replication in CDN-P2P (Content Distribution Network-Peer-to-Peer) network
CN102480513A (en) * 2010-11-29 2012-05-30 国际商业机器公司 Apparatus for transmitting update content with assistance in social network and method thereof
CN102480513B (en) * 2010-11-29 2014-09-10 国际商业机器公司 Apparatus for transmitting update content with assistance in social network and method thereof
CN102110139A (en) * 2011-01-27 2011-06-29 浪潮通信信息系统有限公司 Analytic algorithm for geographic grid in telecommunication field
CN102110139B (en) * 2011-01-27 2013-09-25 浪潮通信信息系统有限公司 Analytic algorithm for geographic grid in telecommunication field
CN102497394A (en) * 2011-11-28 2012-06-13 中国科学院研究生院 Duplicate file placement method in content distribution network based on optimized model
CN102497394B (en) * 2011-11-28 2014-01-15 中国科学院研究生院 Duplicate file placement method in content distribution network based on optimized model
CN102801772A (en) * 2012-03-07 2012-11-28 武汉理工大学 DCell network-oriented energy-saving copy placement method for cloud computing environment
CN102801772B (en) * 2012-03-07 2015-05-27 武汉理工大学 DCell network-oriented energy-saving copy placement method for cloud computing environment
WO2014094502A1 (en) * 2012-12-20 2014-06-26 国家电网公司 Parallel real-time database node locating system for large power grid
CN105574205B (en) * 2016-01-18 2019-03-19 国家电网公司 The log dynamic analysis system of distributed computing environment
CN105574205A (en) * 2016-01-18 2016-05-11 国家电网公司 Dynamic log analyzing system for distributed computing environment
CN110192190A (en) * 2017-01-18 2019-08-30 微软技术许可有限责任公司 Divide storage
US11675666B2 (en) 2017-01-18 2023-06-13 Microsoft Technology Licensing, Llc Including metadata in data resources
US12019684B2 (en) 2017-01-18 2024-06-25 Microsoft Technology Licensing, Llc Application programming interface arranged to interface with a plurality of data sources
CN107105050A (en) * 2017-05-11 2017-08-29 北京奇艺世纪科技有限公司 A kind of storage of business object, method for down loading and system
CN107105050B (en) * 2017-05-11 2020-01-31 北京奇艺世纪科技有限公司 Storage and downloading method and system for service objects
CN110740168A (en) * 2019-09-24 2020-01-31 安徽大学 Self-adaptive method for multi-tenant server in cloud
CN110740168B (en) * 2019-09-24 2022-06-03 安徽大学 Self-adaptive method for multi-tenant server in cloud

Also Published As

Publication number Publication date
CN100518131C (en) 2009-07-22

Similar Documents

Publication Publication Date Title
CN100518131C (en) Grid data duplicate management system
CN102855239B (en) A kind of distributed geographical file system
CN100547583C (en) Database automatically and the method that dynamically provides
RU2507567C2 (en) Multiuser network collaboration
CN101866305B (en) Continuous data protection method and system supporting data inquiry and quick recovery
US6182111B1 (en) Method and system for managing distributed data
CN104573068A (en) Information processing method based on megadata
AU2004267742B2 (en) Automatic and dynamic provisioning of databases
EP1932322B1 (en) System and method to maintain coherence of cache contents in a multi-tier software system aimed at interfacing large databases
CN102779185B (en) High-availability distribution type full-text index method
CN105243155A (en) Big data extracting and exchanging system
CN103605698A (en) Cloud database system used for distributed heterogeneous data resource integration
JP2012098934A (en) Document management system, method for controlling document management system and program
CN102999584A (en) Electric GIS (Gas Insulated Switchgear) cross-platform spatial data service method and system
CN114647716B (en) System suitable for generalized data warehouse
CN103581332A (en) HDFS framework and pressure decomposition method for NameNodes in HDFS framework
CN113760453B (en) Container mirror image distribution system and container mirror image pushing, pulling and deleting method
CN102546674A (en) Directory tree caching system and method based on network storage device
CN111459900B (en) Big data life cycle setting method, device, storage medium and server
Ratner et al. Peer replication with selective control
CN105407044A (en) Method for implementing cloud storage gateway system based on network file system (NFS)
CN114265814A (en) Data lake file system based on object storage
Ye Research on the key technology of big data service in university library
CN113312345A (en) Kubernetes and Ceph combined remote sensing data storage system, storage method and retrieval method
Jolfaei et al. Improvement of job scheduling and tow level data replication strategies in data grid

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20090722

Termination date: 20120315