CN1258921C

CN1258921C - Distributive video interactive system and its data recording and accessing method

Info

Publication number: CN1258921C
Application number: CN 02136350
Authority: CN
Inventors: 夏建洲; 刘湘宇; 王�华; 王怿忻; 张建强; 李加周; 李喜欣; 温央央
Original assignee: ZTE Corp
Current assignee: ZTE Corp
Priority date: 2002-07-30
Filing date: 2002-07-30
Publication date: 2006-06-07
Anticipated expiration: 2022-07-30
Also published as: CN1472963A

Abstract

The present invention discloses a distributed video requesting system and a method for achieving data storage and access thereof. The storage method is characterized in that the method for data storage comprises the steps that data files are divided into data blocks according to a set mode; the data blocks in the set number are calculated to form check data blocks; the data blocks and the check data blocks are stored by turns to a plurality of set buffer servers in a set mode; index information stored by corresponding data blocks is generated in each buffer server. The technical scheme of the present invention achieves the effect of improving storage efficiency and saves much storage space; because the data is stored in a distributed mode, namely that data files are distributed among the buffer servers, the safety of the data is enhanced. Besides, due to the adoption of an optimized data redundancy check method, the reliability of data distribution is enhanced.

Description

The method of distributed video on demand system and realization storage and visit

Technical field

Invention relates to the distributed video on demand system, relates in particular to the storage and the visit of data.

Background technology

Storage is a kind of key technology that relates in the video on-demand system.In the distributed video on demand system, data are stored very special requirement.Because the demand to the data memory space in the video on-demand system is very big, therefore, in the distributed video on demand system, how to improve operating factor of memory space and just becomes a very difficult problem.

In the distributed video on demand system of routine, (Di remembers during by the field as adopting United States Patent (USP) 56491%.Tora, the husky ment sys of mana turns round .having a cache server a.d me and hangs fer than the pseudo-e of note) the middle method of describing, described distributed method is based on the division on a kind of geographic area, in order to allow all video servers share certain or some data flow that need cushion, just must copy data to all buffer servers, this means that there are a plurality of copies in data in whole distributed video on demand system, therefore data space take very greatly, the data space utilance of whole distributed video on demand system is very low.Because the complete copy of data exists, reduce safety of data widely in addition on a plurality of servers, increased the possibility that data are stolen by invadors such as hackers.

Summary of the invention

Technical problem to be solved by this invention is need repeatedly copy and cause the low shortcoming of memory space utilance at the buffer server that distributes in order to overcome data flow in the existing distributed video on-demand system.The technical scheme that realizes technical problem to be solved by this invention and take is summarized as follows: on the one hand, propose to realize in the distributed video on demand system method of storage and visit, it is characterized in that the method for storage may further comprise the steps:

A. the data file is carried out deblocking by the mode of setting;

B. the data block of setting number is calculated and formed the checking data piece;

C. described data block and checking data piece are stored in a plurality of buffer servers of setting in turn by the mode of setting;

D. in described each buffer server, generate the index information of corresponding data block storage.The method of data access may further comprise the steps:

A. client and server connect;

B. client reads the index information of the data file correspondence that will visit: c. according to index information read block in corresponding a plurality of buffer servers from index server;

D. go verification to reading the data chunk that reaches group density;

E. client is recombinated to the data block that reads, the restore data file.

On the other hand, propose a kind of distributed video on demand system, comprise the plurality of video client, the buffer server of several stored data files and index server, described client and server connects and communication by communication network, it is characterized in that:

Client is sent the video file request to index server, to the data block reorganization of reading from buffer server;

Index server, the control data file is divided into data block by setting means, the data block of setting number is calculated formed the checking data piece, controls described data block and checking data piece and stores a plurality of data block store servers respectively in turn into by the mode of setting;

Buffer server, the index information of recording data blocks storage, and storage data block.

Adopt technical solution of the present invention, compared with prior art, obtained the progress on system data storage and the access method, reached the effect that improves storage efficiency, saved a large amount of memory spaces.Simultaneously because the The data distributed storage, data file distributes between a plurality of buffer servers, so illegal invasion person (as the hacker), even broken through certain or certain several servers, also can't obtain the complete file data, therefore, take technical scheme of the present invention to improve safety of data; On the other hand owing to adopted the method for preferred data redundancy verification, even some buffer servers occur unusual, the file data blocks of storing in the buffer server by other also can recover out with impaired data block, thereby realize reorganization and recovery, improve the reliability that data distribute the data file.

Description of drawings

Fig. 1 is an embodiment of distributed video on demand provided by the invention system; Fig. 2 is that the data structure of the primary index information table of the system shown in Fig. 1: Fig. 3 is the data structure of the secondary index information of the system shown in Fig. 1; Fig. 4 is the Stored Procedure figure to the data file of the system shown in Fig. 1; Fig. 5 is the generation method flow diagram of checking data behind the piecemeal among Fig. 4; Fig. 6 is the flow chart that distributed store is handled among Fig. 4;

Fig. 7 is the flow chart that in the system shown in Fig. 1 data is conducted interviews.

Embodiment

Below in conjunction with accompanying drawing, illustrate the mode of an enforcement of the present invention.

Fig. 1 is an embodiment of distributed video on demand provided by the invention system.In the following description, suppose that video on-demand system client 10 sends request of data by communication network 20 from index server 50, data file evenly is stored on the individual secondary server 60 of N (N is a natural number) (present embodiment is based on two-layer configuration, so buffer server is called secondary server).User's request is at first processed at index server 50 places, the distribution situation of the data of search request in N secondary server 60, added by communication network and sent to user side 10 by the data of a plurality of secondary servers 60 with request then, user side 10 is finished the process that reconfigures of data.

Index server 50 is preserved a plurality of catalogue forms.Each data file is all generated a record.Whenever there being data file to be distributed the formula storage once, all generate the primary index information 30 of a correspondence, primary index information is a form that is formed by the multirow data, is called master index tables of data 200, the master index tables of data is as shown in Figure 2.For the data that are distributed on each server, on secondary server to secondary index tables of data 40 should be arranged.The secondary index tables of data as shown in Figure 3.

The data record that master index tables of data 200 comprises has: Data Filename is used for representing the degree of distribution of data distribution situation; First secondary server sign-initial server of distributed store; Be used for identifying the group density of a piece group setting data piece number; The secondary server name sequence of storage data.If client 10 needs certain data file of request, then obtain master index 30 by connecting index server 50, read the master index tables of data 200 that belongs to this demand file, therefrom obtain the distributed intelligence of file, initial secondary server, group density and server name sequence.

After having obtained primary index information, client 10 will connect one by one with the secondary server in the server name sequence in master index tables of data 200, the retrieving information request msg of utilizing secondary index tables of data 210 to provide, connection is from first secondary server initial server one by one, to the last a data block runs through, and whole process finishes.

Comprising first record in the secondary index tables of data 210 is Data Filename, Data Filename be with the master index tables of data in the data file file-name field be associated, second record is the sequence number of data block, be used for the position of unlabeled data piece in source file, the 3rd record is previous server, the 4th record is a back server, the memory location of adjacent data block indicated in these two records, last record is the memory address of data block, and memory address has comprised the memory location of data block in external memory 70.

Fig. 4 is the Stored Procedure figure to the data file of the system shown in Fig. 1.When carrying out the storage of data file, at first carry out and read the step 300 that file is analyzed by index server, carry out the step 310 of file being carried out piecemeal according to the parameter of file block then, in this step, the size of data block can specifically be set, and can be equal and opposite in direction between each data block, can not wait yet, in the present embodiment, the size of each data block equates, promptly adopts the mode of even piecemeal; To set the quantity data piece after piecemeal is finished and form data chunk, and carry out each piece batch total is calculated the step 320 that forms the checking data piece, then

Checking data is put into data chunk as one of piece group data, form new data chunk, carry out at last all data blocks are carried out step 330 in the external memory 70 of distributed N the secondary server that stores appointment in turn into.

Fig. 5 is the generation method flow diagram of checking data behind the piecemeal among Fig. 4, and behind the piece number (promptly organizing density) in configuring data block size and piece group, key step is as follows:

1, reads the step 400 of specific data blocks of data;

2, judged whether to reach the step 410 of last data block, forward step 426 processing to if reach last piece, otherwise, execution in step 420, i.e. whether the piece number of judgment data piece group appointment (group density) reaches, if do not reach, then continue to read next data block, execution in step 400, otherwise a data chunk, execution in step 425 have been read in expression;

3, step 425 to reading to such an extent that the data block batch total is calculated generation checking data piece, continues execution in step 400 then;

4, step 426, whether the piece number (group density) of judgment data piece group appointment reaches, if reach execution in step 440, otherwise execution in step 430;

5, step 430, to the data block portions padding data of lazy weight, to replenish not enough data block, the supplementary data piece generates last data chunk after forming the piece group; 6, step 440 is calculated generation checking data piece to last data chunk; 7, step 450 is carried out distributed store with all data blocks together with the checking data piece and is handled, and specifically sees Fig. 6.

In the present embodiment, the calculating generation method of checking data piece is a lot, enumerates several concrete examples here, suppose group density 200d=M (M is a natural number), correspondingly each piece group is D 1, Dz, ... DM, checking data to be generated are P, can take following several method of calibration:

(1) parity check method: be called the RAID method again, method is to adopt step-by-step to carry out the method for binary addition, according to the method, and P=D1 field DZ.....DM. or P=one (D10DZ ... .. consolidates), wherein ". " expression nonequivalence operation, " one " represents NOT operation.Suppose that the Gong in the piece group makes a mistake, so correspondingly, to data reconstruction method that should verification be: several=D1. is several .... encircle a P.Dk+1... field DM. or Dk two (DI.DZ.... Dk-loP field Dk+1....DM).

(2) modular arithmetic method: method is, regards the piece group as binary number, and several number of bits of supposing each piece are K, and all piece groups are counted addition, and to the ZAK delivery, that is: P==(D1+DZ+...+nM) mod ZAK.Suppose that the Dk in the piece group makes a mistake, to data reconstruction method that should verification be so: Dk=P+2 sighs one (DI+DZ+...+Dk one 1+D foretells 1...+DM) modZAK.

(3) mirror method: this method is that each piece is generated an identical data copy piece, is equivalent to generate in system two identical distributed copys.

Method also has a lot, gives an example no longer one by one here, and all methods all must meet the following conditions: usually, the generation method of checking data P is: to data D1, DZ ..., the corresponding verification generating function f () of DM satisfies P two f (D,, DZ ..., DM), correspondingly, recognize for arbitrary data, a corresponding checking data recovers function g (), satisfies Dk=g (D,, DZ, a Dk-1, P, D foretells a DM), function f () can be identical with g (), also can be different.

Fig. 6 is the flow chart that distributed store is handled among Fig. 4, may further comprise the steps:

1. step 500 is set distributed constant, promptly sets the number of servers of participation distributed store and all server names, generates master index tables of data 200;

2. step 510 is set origin server, promptly determines first server that begins to store;

3. step 520, the read block group;

4. step 530, distributed storage data chunk and generate secondary index promptly from origin server, is stored first data block, generates the secondary index tables of data information 210 of this piece on this server; Connect next server then in turn, store second data block, on this server, generate the secondary index tables of data information 210 of this piece, till all data blocks of this piece group store on the server step by step (comprising the corresponding check data block);

5. step 540 judges whether this piece group is last piece group, if not, then reset origin server, execution in step 510, if, represent so all data chunk distributed store finish, distributed storage finishes;

6, end step 550.In order further to improve the reliability and the flexibility of data, checking data can be according to certain regular assigned address in the piece group, as in the fixed position or dynamic position, wherein the adopted dynamic circulation method of dynamic position storage as checking data is placed on first in first data chunk, is placed on second in second data chunk, ... when checking data piece position reaches group density, from the circulation of first BOB(beginning of block), can in implementation process, select again.

Fig. 7 is the flow chart that in the system shown in Fig. 1 data is conducted interviews, and mainly comprises the steps:

1. step 610, client 10 is connected with index server 50 by communication network 20, asks certain data file;

2. step 620, after connecting foundation, index server is searched primary index information by request, obtains master index 30, obtains this request msg file index tables of data 200 of sign, from 200, extract the distributed intelligence of this document, the include file name, degree of distribution, initial server, group density, information such as server name sequence;

3. step 630 connects origin server;

4. step 640, call and read the secondary index information process, obtain secondary index 40, obtain the secondary index tables of data 210 that belongs to this demand file, from 210, obtain the respective file name of this data block in server, data block number, on information such as server, next server, memory location;

5. step 650, read block in external memory 70;

6. step 660, judge whether the data block number that reads is reached for this document designated groups density value: (1) is if reach the group density value, execution in step 680 so, promptly go the process that reconfigures of verification and data block, the redundancy check data of going the verification regrouping process will carry out the data reliability assurance is originally removed.Continue execution in step 690;

(2) if do not reach the group density value, execution in step 670 promptly connects next server, continues execution in step 640 then;

7. step 690, judgment data piece number, watch whether reaching last data block: (1) if do not reach, execution in step 670 promptly connects next server, continues execution in step 640;

(2) if reached last data block, so whole data retrieval access process finishes.

Although the disclosed method that relates to distributed video on demand system and realization storage and visit has been carried out special description with reference to embodiment, those skilled in the art can understand, under the situation that does not depart from scope and spirit of the present invention, can carry out all conspicuous modification of form and details to it, as can free data verification method and data distribution density, simultaneously the secondary mode of the server distribution among the embodiment can be changed into 3 grades or multistage mode, also can change single-stage into, the index information table also can design voluntarily, increases newer field etc.Therefore, embodiment described above is illustrative and not restrictive, and under the situation that does not break away from the spirit and scope of the present invention, all variations and modification are all within the scope of the present invention.

Claims

1, realize the method for storage in a kind of distributed video on demand system, it is characterized in that the method for storage may further comprise the steps:

A. the data file is carried out deblocking by the mode of setting;

D. in described each buffer server, generate the index information of corresponding data block storage.

2, realize the method for data access in a kind of distributed video on demand system, it is characterized in that the method for data access may further comprise the steps:

A. client and server connect;

B. client reads the index information of the data file correspondence that will visit from index server;

C. according to index information read block in corresponding a plurality of buffer servers;

D. go verification to reading the data chunk that reaches group density;

E. client is recombinated to the data block that reads, the restore data file.

3, storage means according to claim 1 is characterized in that the data file is carried out uniform deblocking.

4, storage means according to claim 1 is characterized in that in described each data chunk, and arbitrary data block can be recovered by checking data piece and other data blocks.

5, storage means according to claim 1 is characterized in that wherein the data block of setting number being calculated in the step that forms the checking data piece, and is further comprising the steps of:

B.1. judge the quantity of the data set of last data block of include file, whether reach the number of setting,, calculate and form the checking data piece if reach;

B.2. if do not reach, the data block of filler quantity not sufficient is calculated then and is formed the checking data piece.

6, storage means according to claim 1 is characterized in that wherein described data block is stored in a plurality of servers in the step in turn by the mode of setting, and is further comprising the steps of:

C.1. set the quantity and the title of distributed store buffer server;

C.2. determine the initial storage buffer server.

7, storage means according to claim 1 is characterized in that the checking data piece memory location in the data chunk is dynamic.

8, storage means according to claim 1 is characterized in that wherein generating in described each buffer server in the step of index information of corresponding storage, and index information comprises following content: institute's data blocks stored corresponding file name;

Institute's data blocks stored sequence number;

The server name of the adjacent data blocks storage of the data block of storing;

The memory address of institute's data blocks stored in server.

9, data access method according to claim 2 is characterized in that wherein in the step according to index information read block in corresponding a plurality of servers further comprising the steps of:

C.1. connect origin server;

C.2. read the data block of file correspondence by index information;

C.3. judge whether the data block quantity that is read equals to organize density, if not, carries out next step, if carry out client and the data block that reads is recombinated the step of restoring data file;

C.4. connect next server, continue read block, execution in step c.2.

10, distributed video on demand system comprises the plurality of video client, the buffer server of several storage video data files and index server, and described client and server connects and communication by communication network, it is characterized in that:

Client is sent file request to index server, to the data block reorganization of reading from buffer server; Index server, the control data file is divided into data block by setting means, the data block of setting number is calculated formed the checking data piece, controls described data block and checking data piece and stores a plurality of data block store servers respectively in turn into by the mode of setting;