CN108540315A - Distributed memory system, method and apparatus - Google Patents

Distributed memory system, method and apparatus Download PDF

Info

Publication number
CN108540315A
CN108540315A CN201810269046.1A CN201810269046A CN108540315A CN 108540315 A CN108540315 A CN 108540315A CN 201810269046 A CN201810269046 A CN 201810269046A CN 108540315 A CN108540315 A CN 108540315A
Authority
CN
China
Prior art keywords
storage
protected field
node
failure
domain
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810269046.1A
Other languages
Chinese (zh)
Other versions
CN108540315B (en
Inventor
郑乾坤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
New H3C Technologies Co Ltd Chengdu Branch
Original Assignee
New H3C Technologies Co Ltd Chengdu Branch
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by New H3C Technologies Co Ltd Chengdu Branch filed Critical New H3C Technologies Co Ltd Chengdu Branch
Priority to CN201810269046.1A priority Critical patent/CN108540315B/en
Publication of CN108540315A publication Critical patent/CN108540315A/en
Application granted granted Critical
Publication of CN108540315B publication Critical patent/CN108540315B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1097Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/50Network service management, e.g. ensuring proper service fulfilment according to agreements
    • H04L41/508Network service management, e.g. ensuring proper service fulfilment according to agreements based on type of value added network service under agreement
    • H04L41/5096Network service management, e.g. ensuring proper service fulfilment according to agreements based on type of value added network service under agreement wherein the managed service relates to distributed or central networked applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Retry When Errors Occur (AREA)

Abstract

Present disclose provides a kind of distributed memory system, method and apparatus;The system includes multiple protected fields;Each protected field includes multiple failure domains, and each failure domain includes one group of memory node;Each copy of same data is stored on the memory node in the different faults domain in the same protected field; with in failure memory node; it is safeguarded in protected field belonging to failed storage node, in each protected field, the maximum allowable number of faults of failure domain is less than the quantity of copy.The disclosure can improve the safety and reliability of distributed memory system.

Description

Distributed memory system, method and apparatus
Technical field
This disclosure relates to technical field of data storage, more particularly, to a kind of distributed memory system, method and apparatus.
Background technology
It with the fast development of Internet technology and is widely used, there is mass data continually to generate daily, therefore, Storage system as data storage carrier is also more and more important.Poor, safety that heritage storage system usually has the scalability and The defects of reliability is poor, the storage demand of user cannot be met by leading to it.Distributed memory system is with its good number According to processing capacity and reliability, increasingly approved by user.
But since distributed memory system has the characteristics that distributed hardware structure, if one or more in system Failure has occurred in storage server, can usually influence the data service performance of whole system, makes all services of system sometimes It interrupts, increasing with the scale of system, the service range of influence is also increasing, causes the service of distributed memory system Performance is relatively low.
Invention content
In view of this, the disclosure is designed to provide a kind of distributed memory system, method and apparatus, to improve distribution The service performance of formula storage system.
To achieve the goals above, the technical solution that the disclosure uses is as follows:
In a first aspect, present disclose provides a kind of distributed memory system, which includes multiple protected fields;Each protection Domain includes multiple failure domains, and each failure domain includes one group of memory node;Each copy of same data is stored in the same guarantor It protects on the memory node in the different faults domain in domain, in failure memory node, in the guarantor belonging to failed storage node Shield is safeguarded in domain, and in each protected field, the maximum allowable number of faults of failure domain is less than the quantity of copy.
Second aspect, present disclose provides a kind of distributed storage method, this method is applied to distributed memory system Node is monitored, system includes multiple protected fields;Each protected field includes multiple failure domains, and each failure domain includes one group of storage section Point;Method includes:Receive data to be stored;Generate multiple copies of data;Multiple copies of data are stored respectively to same On the memory node in the different faults domain in one protected field.
The third aspect, present disclose provides a kind of distributed storage devices, including memory and processor, wherein storage Device is for storing one or more computer instruction, and one or more computer instruction is executed by the processor, in realization State distributed storage method.
Above-mentioned distributed memory system, method and apparatus can be by same numbers by the way that system is divided into multiple protected fields According to each copy be stored on the memory node in the different faults domain in the same protected field, in failure memory node When, it is safeguarded in the protected field belonging to failed storage node, which improves the tolerable maximum malfunctioning node of system Quantity, the coverage and system for reducing failed storage node stop the probability servicing, make the service of distributed memory system Performance is more stablized, and the safety and reliability of system is improved.
Other feature and advantage of the disclosure will illustrate in the following description, alternatively, Partial Feature and advantage can be with Deduce from specification or unambiguously determine, or by implement the disclosure above-mentioned technology it can be learnt that.
To enable the above objects, features, and advantages of the disclosure to be clearer and more comprehensible, better embodiment cited below particularly, and match Appended attached drawing is closed, is described in detail below.
Description of the drawings
It, below will be to specific in order to illustrate more clearly of disclosure specific implementation mode or technical solution in the prior art Embodiment or attached drawing needed to be used in the description of the prior art are briefly described, it should be apparent that, in being described below Attached drawing is some embodiments of the disclosure, for those of ordinary skill in the art, before not making the creative labor It puts, other drawings may also be obtained based on these drawings.
Fig. 1 is a kind of structural schematic diagram for distributed memory system that disclosure embodiment provides;
Fig. 2 is the structural schematic diagram for another distributed memory system that disclosure embodiment provides;
Fig. 3 is the structural schematic diagram for another distributed memory system that disclosure embodiment provides;
Fig. 4 is the structural schematic diagram for another distributed memory system that disclosure embodiment provides;
Fig. 5 is the structural schematic diagram for another distributed memory system that disclosure embodiment provides;
Fig. 6 is the structural schematic diagram for another distributed memory system that disclosure embodiment provides;
Fig. 7 is a kind of flow chart for distributed storage method that disclosure embodiment provides;
Fig. 8 is a kind of structural schematic diagram for distributed storage devices that disclosure embodiment provides.
Specific implementation mode
To keep the purpose, technical scheme and advantage of disclosure embodiment clearer, below in conjunction with attached drawing to this public affairs The technical solution opened is clearly and completely described, it is clear that and described embodiment is a part of embodiment of the disclosure, Rather than whole embodiment.Based on the embodiment in the disclosure, those of ordinary skill in the art are not making creation Property labour under the premise of the every other embodiment that is obtained, belong to the range of disclosure protection.
Distributed memory system is that data dispersion is stored in more independent equipment, utilizes more storage servers point Load storage load;Relative to direct-connected additional storage, network attached storage, storage area network these traditional network storage framves Structure, distributed memory system have preferable reliability, availability and access efficiency, are also easy to system scale extension.
For the application platform of enterprise-level, especially Internet Service Provider is (for example, shopping, retrieval, Map Services The supplier of website or application program), with the continuous expansion of business scale, business becomes to become increasingly complex, and concurrent user asks Ask more and more, data to be processed are also more and more;Distributed memory system can constantly expand with the business demand of enterprise Exhibition enhances the processing capacity of distributed system entirety, to support at high concurrent and mass data by increasing number of servers Reason ability meets business event and increases the calculating demand brought.
A kind of structural schematic diagram of distributed memory system shown in Figure 1;Management server (also referred to as monitoring section Point) by data distributed and saved in the storage server (also referred to as memory node) being attached thereto, and record the guarantor of Various types of data Deposit position, at the same monitor each storage server working condition (such as, if break down);Client and management server Communication connection.For example, user sends a data inquiry request, after management server receives the request, root by client The storage location that corresponding data is searched according to the request, for example, the mark of storage server, the mark of storage hard disk in storage server Know etc., to read data from the storage location, and it is back to client.
In distributed memory system, if wherein failure, management service has occurred in one or more storage server Device provides the data service preserved in the storage server without normal direction client, usually reads failure to client returned data Information;Especially for enterprise-class tools, even the storage server failure of very short time, the storage of this low reliability A large amount of business service can be caused to fail, cause economic loss.
In order to make storage system that there is certain safety and reliability, it will usually which storage system is divided into multiple failures One storage server can be divided into a failure domain by domain for example, as in Fig. 1;Data to be stored are generated again multiple Copy, multiple copies are stored in respectively in different failure domains, when the storage server in one of failure domain breaks down When, identical data can be obtained from other failure domains, make storage system that there is higher fault-resistant ability, reliability is higher, To ensure that being smoothed out for data service, while the safety of data will not be affected.
The copy amount of above-mentioned generation is usually determined by the redundancy strategy of the business demand of enterprise or current storage system It is fixed;Copy amount is bigger, and storage system reliability is stronger, but the data volume that can be stored is smaller;Copy amount is smaller, storage System reliability is lower, but the data volume that can be stored is larger.
Furthermore it is also possible to be handled data to be stored using correcting and eleting codes technology;Specifically, first will be to be stored Data are divided into n original data block, obtain m verification data block further according to this n original data block operation, n+m is obtained A data block;When wherein arbitrarily no more than the error of the data block (including original data block and verification data block) of m blocks when, can be with By remaining data block, that n original data block is recovered using corresponding restructing algorithm.In storage system, by this n+m Data block is distributed in different failure domains;It, can be in this way, when the storage server in one of failure domain breaks down Data block is obtained from other failure domains, and reconstructs original data block.
The division range and rank of above-mentioned failure domain, can according to the scale of storage system, business demand of enterprise etc. because Element is adjusted, changes;For example, when storage system scale is smaller, one or more storage server can be divided into one A failure domain is usually provided with multiple storage hard disks in each storage server;When system scale is larger, it will usually use base The form stood, more storage servers of a BTS management, at this point it is possible to by one or more base stations, and its what is be managed deposit Storage server is divided into a failure domain.In failure domain partition process, it will usually keep each failure domain consistent, i.e., each Storage server quantity in failure domain is identical, and the performance configuration of each storage server in each failure domain is identical.
In general, if there are one memory nodes (for example, storage server or storage hard disk) to occur in a failure domain Failure, the failure domain just need to carry out fault recovery;In failover procedure, storage system can continue offer service, but Whole operational efficiency and performance can all be affected, and lead to the reduction of business service efficiency;If had in storage system multiple Failure domain all breaks down node, and the failure domain quantity of failure node has reached the copy amount of above-mentioned data, at this time Memory node where probably there are all copies of certain data all breaks down, and causes these loss of data, in order to The business service failure for being related to these data is avoided, the usual storage system needs to interrupt service, to the failure to break down It is repaired in domain.
When the scale of distributed memory system is increasing, in the failure domain quantity in system, and each failure domain Memory node quantity can all increase;After memory node increase in failure domain, the probability of the failure domain failure node It can increase;For example, when the memory node in failure domain is 4, then the possibility for the node that breaks down is 4;The failure domain is deposited When storing up point spread to 10, then the possibility for the node that breaks down has been increased by 10.After failure domain quantity in system, deposit The failure domain quantity of failure node easily reaches the copy amount of setting in storage system, i.e. storage system needs to interrupt clothes The probability of business greatly increases;Storage system is frequently interrupted, and causes the stability of storage system poor, and service performance is affected, It is difficult to meet the business demand of enterprise customer.
Based on the problem that above-mentioned large-scale storage systems service performance is poor, disclosure embodiment provides a kind of distribution Formula storage system, method and apparatus;The technology can be applied in distributed storage cluster, storage system, it is particularly possible to apply In the distributed memory system of enterprise customer.As shown in Fig. 2, the distributed memory system includes multiple protected fields;It is each to protect Shield domain includes multiple failure domains, and each failure domain includes one group of memory node;Each copy of same data is stored in same On the memory node in the different faults domain in protected field, in failure memory node, belonging to failed storage node It is safeguarded in protected field.
In addition, in disclosure embodiment in each protected field, the maximum allowable number of faults of failure domain is less than copy Quantity.
In present embodiment, failure domain quantity that protected field quantity that system includes, each protected field include and each The quantity for the memory node that failure domain includes is not specifically limited;The multiple copies of data are stored in the same protected field at random In different faults domain;Alternatively, according to the memory state of each failure domain in protected field, multiple copies are stored in the same protected field Specified different faults domain in.With system include two protected fields in Fig. 2, each protected field include for three failure domains into Row explanation;If same data include two copies, the two copies can be stored in two of the same protected field at random In failure domain, or according to the memory state of each failure domain, it is stored in two specified failure domains of the same protected field. If same data include three copies, these three copies can be stored in three failure domains of the same protected field respectively In.
Since each copy of same data is stored in the same protected field, what each protected field preserved has been Complete different data are not present the intersection of data, i.e., can not possibly preserve the pair of same data between protected field two-by-two each other This;When system failure memory node needs to be serviced, the operational efficiency and property of the protected field belonging to the memory node It can be affected;And the operational efficiency of other protected fields and property retention are constant, alleviate system because of a failed storage The problem of node causes systematic entirety that can reduce improves the stability and reliability of system.
It can be seen from the above, in the case where being not provided with protected field, system usually requires that the maximum allowable number of faults of failure domain Amount is less than the quantity of copy, since there may be data cross between each two failure domain in system, if there is failure section The failure domain quantity of point has reached copy amount, and in order to avoid loss of data, the usual storage system needs to interrupt service, to hair The failure domain of raw failure is repaired.If being provided with protected field through the above way, in each protected field, the maximum of failure domain permits Perhaps number of faults again smaller than copy quantity, if the failure domain quantity of failure node has reached copy in a certain protected field Quantity, it is only necessary to interrupt the data service of the protected field, other protected fields are unaffected.In general, only in all protected fields When the failure domain quantity of interior failure node has reached copy amount, system integrally can just stop servicing;It is protected by being arranged Domain greatly reduces the probability that system stops service, improves the reliability of system.
By taking distributed memory system shown in Fig. 2 as an example, it is assumed that the copy amount of data is 2;If one of protection One failure domain in domain breaks down memory node, then the protected field carries out fault restoration, and shadow is not generated to another protected field It rings;The memory node if two failure domains of one of protected field break down, which stops service, to another Protected field does not also have an impact;It is another only when the two or more failure domains of one of protected field break down memory node The two or more failure domains of a protected field also break down memory node when, the system that is equivalent to occur four failure domains occur therefore Hinder memory node, which just stops service.
As shown in Fig. 2, if the system is not provided with protected field, or the function of closing protected field, in the copy amount of data In the case of being similarly 2, the memory node if a failure domain breaks down, systematic entirety can be affected;If two A or more failure domain breaks down memory node, and the system that is equivalent to two failure domains occurs and breaks down memory node, system It will stop servicing.
By above-mentioned comparison it is found that the distributed memory system that disclosure embodiment provides, more by the way that system to be divided into Each copy of same data can be stored in the memory node in the different faults domain in the same protected field by a protected field On, in failure memory node, to be safeguarded in the protected field belonging to failed storage node, which, which improves, is It unites tolerable maximum malfunctioning node quantity, reduces the coverage of failed storage node and system stops the probability of service, Improve the safety and reliability of distributed memory system.
Referring to the structural schematic diagram of Fig. 3 and Fig. 4 another distributed memory system being respectively shown in;In the embodiment, System includes 18 hosts (i.e. above-mentioned memory node), totally 6 base stations (being referred to as Rack or rack), each BTS management 3 hosts;In the system, the host of each base station and its management is a failure domain;In Fig. 4, a protected field includes three Failure domain.
When data copy to be stored is 2, for example, copy A and copy B;Copy A and copy B is stored in protection simultaneously In domain 1 or protected field 2;If being stored in protected field 1 simultaneously, two of which base can be selected in base station 1, base station 2 and base station 3 It stands and preserves copy A and copy B, for example, copy A is stored in base station 1, copy B is stored in base station 3.At this point, even if protection There is hostdown there are one base station in domain 1 and protected field 2, system can still be run;Only protected field 1 and protected field When occurring two base stations generation hostdowns in 2, system break service.
Similarly, as shown in figure 3, if the system is not provided with protected field, copy A and copy B may be maintained in base station 1 Into any two base station in base station 6;When there are mass data, cause the base station two-by-two of base station 1 to base station 6 that may preserve The copy of same data;In 6 base stations, any one failure domain breaks down memory node, then systematic entirety can be by shadow It rings;In 6 base stations, any two or more failure domain breaks down memory node, and system will stop servicing.
It can be seen from the above, the copy amount by data is limited, the failed storage number of nodes that allows in storage system or Failed storage hard disk number in person's difference node is certain;Therefore, the memory node in system or storage hard disk quantity are more, The system is because node failure causes the affected probability of system business bigger, usual linear relation with increase.By by system Protected field is divided, this probability, the event in the failed storage node allowed in each protected field or different nodes can be reduced The event in failed storage number of nodes or different nodes that entire cluster allows when hindering storage hard disk number from without dividing protected field Barrier storage hard disk number is consistent.Therefore, for the distributed memory system of same scale, after having divided protected field, ensureing In the case of continual service, it can hold when the number for the maximum malfunctioning node that system can be tolerated is compared to unallocated protected field The number for the maximum malfunctioning node born has into the increase of multiple, by taking system shown in Fig. 3 and Fig. 4 as an example, when unallocated protected field, The maximum malfunctioning node number that system can be tolerated is 1, and if it is greater than 1, system then interrupts service;After having divided two protected fields, The maximum malfunctioning node number that system can be tolerated is 2, and if it is greater than 2, system then interrupts service.Therefore, which is effectively promoted The safety of the reliability, stability, robustness and data of distributed memory system in abnormal cases.
In another embodiment, it is contemplated that for data itself there are multiple types, different types of data are corresponding Service request is different;For example, for the data type for needing read or write speed fast, block storage service may be used;For needing altogether The high data type of performance is enjoyed, file storage service may be used;For not only needing read or write speed fast but also needing shared performance high Data type, then the larger object storage service of carrying cost may be used.
When distributed memory system is larger, the interface of above-mentioned a variety of storage services may be set, to provide difference Storage service;But the interface distributing position of various storage services is more random, when storing data, by storage location, deposits The limitation of the conditions such as space is stored up, storage system is difficult to select matched storage service according to the service request of data, usually will not Data with service request are stored in same failure domain, same host, even same storage hard disk, lead to different service requests Data on physical resource cross-distribution.
For example, the data type for needing shared performance high is preserved into block storage service, needing read or write speed fast Data type is preserved into file storage service, and the storage mode of this specific admixture prevents the storage performance of various data from meeting Service request causes the storage efficiency of system entirety relatively low.
For these reasons, embodiment further provides another distributed memory systems for the disclosure, as shown in figure 5, should System includes multiple subregions, and the identical protected field of one or more storage performances is respectively arranged in each subregion;Each protection The storage performance of memory node in domain is identical.
In order to make the protected field storage performance having the same in same subregion may be used same in same subregion Kind storage service, the i.e. storage service of the subregion can be one kind in block storage, file storage and object storage;It is every in order to make The storage performance of memory node in a protected field is identical, and it is identical that physical resource model may be used in the memory node in protected field Or similar storage device, for example, SSD (Solid State Drive, solid state disk), which may be used, in memory node provides high property Energy storage service may be used HDD (Hard Disk Drive, hard disk drive) and provide common performance storage service.
To in system memory node carry out subregion when, usually divided from physical layer, with realize each subregion it Between isolation on physical resource, do not share physical resource between each subregion in addition to monitoring node, each subregion is pre- for storing The corresponding data of storage service first configured, to ensure that different storage services use different physical resources.In physical layer On face, the total collection of each memory node in each storage system can be considered as a big storage pool, and in each subregion Memory node can also be considered as the corresponding storage pool of the subregion, each storage pool is used to provide a kind of storage clothes more segmented Business;The storage performance for the memory node that different storage pools includes can be different, can be according to service when executing storage service Requirement select different storage pools.By dividing storage pool, different storage services can be further realized on physical resource Isolation.
System can be divided into multiple points by above-mentioned distributed memory system according to the performance of each memory node first Area, then each subregion is divided into the identical protected field of storage performance, and the storage performance phase of the memory node in each protected field Together;This partitioned mode makes the data distribution of different storage services on different physical resources, by the object for being unable to performance Manage resource provide different performance storage service, so that data type is matched with storage performance, ensure system safety with Under the premise of reliability, system storage efficiency is improved.
Embodiment further provides another distributed memory systems for the disclosure, as shown in fig. 6, system can be according to each The factors such as the practical business demand of subregion, scale choose whether setting protected field;It, directly will be each in subregion 1 as shown in subregion 1 The host of base station and its management is divided into failure domain;And in subregion 2, first by base station 1, base station 2 and base station 3 and its master of management Machine is divided into protected field 1, and the host of base station 4, base station 5 and base station 6 and its management is divided into protected field 2;It is with protected field 1 Example, then the host of each base station and its management is divided into failure domain.It is appreciated that usually each base station manage there are one or Multiple main frames show that the host of portion of base stations is not shown in Fig. 6 for convenience.
Above-mentioned subregion can be as the storage region unit in cluster (i.e. above-mentioned distributed memory system), each subregion packet Containing host in several clusters, different storage service services is provided to the user in different subregions;It can be met not by subregion With between storage service data isolation and user use different performance block storage demand.On the basis of subregion, Mei Gefen Area has further divided multiple protected fields, to improve the reliability of cluster;And failure domain is the minimum list of data distribution in cluster Member, due to having divided failure domain, the several copies or data block of data can be distributed in several different under same subregion In failure domain, when cluster breaks down, the reliability of cluster can be further increased.
Corresponding to above system embodiment, the disclosure additionally provides a kind of distributed storage method;This method is applied to The monitoring node of distributed memory system, the monitoring node connect with each memory node in distributed memory system, are used for Carry out the failure monitoring and Breakdown Maintenance of data storage and memory node.The system includes multiple protected fields;Each protected field packet Multiple failure domains are included, each failure domain includes one group of memory node;As shown in fig. 7, this method comprises the following steps:
Step S702 receives data to be stored;
Step S704 generates multiple copies of data;
Step S706 stores multiple copies of data respectively to the storage section in the different faults domain in the same protected field Point on.
The distributed storage method after receiving data to be stored, generates multiple copies of the data, and then will count again According to multiple copies store respectively to the memory node in the different faults domain in the same protected field;Which can occur It when failed storage node, is safeguarded in the protected field belonging to failed storage node, improves the tolerable most die of system Hinder number of nodes, the coverage and system for reducing failed storage node stop the probability servicing, improve distributed storage The safety and reliability of system.
In above system, the storage performance of the memory node in each protected field is identical;System further includes multiple subregions, often It is respectively arranged with the identical protected field of one or more storage performances in a subregion, multiple copies are stored respectively to the same guarantor The step on the memory node in the different faults domain in domain is protected, including:Judge the corresponding storage service of data, the storage service packet Include block storage, object storage or file storage;Subregion is selected according to the corresponding storage service of data;By multiple copies of data point It does not store to the memory node in the different faults domain in the same protected field of selected subregion.
The above method further includes:Monitor the memory node in each protected field;If monitoring failed storage node, and protect Failure domain quantity in shield domain is not up to preset maximum allowable number of faults, in the protected field belonging to failed storage node into Row is safeguarded;If monitoring failed storage node, and the failure domain quantity in protected field reaches preset maximum allowable number of faults It measures, the storage service of the protected field belonging to outage memory node.
Present embodiments provide for a kind of a kind of distributed storage devices corresponding with above method embodiment.Fig. 8 For the structural schematic diagram of the distributed storage devices, which can be arranged the prison in distributed memory system It controls on node;As shown in figure 8, the device includes memory 100 and processor 101;Wherein, memory 100 is for storing one Or a plurality of computer instruction, one or more computer instruction are executed by processor, it, should to realize above-mentioned distributed storage method Method may include one or more in above method.
Further, distributed storage devices shown in Fig. 8 further include bus 102 and communication interface 103, and processor 101 leads to Letter interface 103 and memory 100 are connected by bus 102.
Wherein, memory 100 may include high-speed random access memory (RAM, Random Access Memory), May further include non-labile memory (non-volatile memory), for example, at least a magnetic disk storage.By extremely A few communication interface 103 (can be wired or wireless) is realized logical between the system network element and at least one other network element Letter connection can use internet, wide area network, local network, Metropolitan Area Network (MAN) etc..Bus 102 can be isa bus, pci bus or Eisa bus etc..The bus can be divided into address bus, data/address bus, controlling bus etc..For ease of indicating, only used in Fig. 8 One four-headed arrow indicates, it is not intended that an only bus or a type of bus.
Processor 101 may be a kind of IC chip, the processing capacity with signal.It is above-mentioned during realization Each step of method can be completed by the integrated logic circuit of the hardware in processor 101 or the instruction of software form.On The processor 101 stated can be general processor, including central processing unit (Central Processing Unit, abbreviation CPU), network processing unit (Network Processor, abbreviation NP) etc.;It can also be digital signal processor (Digital Signal Processing, abbreviation DSP), application-specific integrated circuit (Application Specific Integrated Circuit, abbreviation ASIC), ready-made programmable gate array (Field-Programmable Gate Array, abbreviation FPGA) or Person other programmable logic device, discrete gate or transistor logic, discrete hardware components.It may be implemented or execute sheet Disclosed each method, step and logic diagram in disclosed embodiment.General processor can be microprocessor or this at It can also be any conventional processor etc. to manage device.The step of method in conjunction with disclosed in disclosure embodiment, can direct body Be now that hardware decoding processor executes completion, or in decoding processor hardware and software module combine and execute completion.It is soft Part module can be located at random access memory, and flash memory, read-only memory, programmable read only memory or electrically erasable programmable are deposited In the storage medium of this fields such as reservoir, register maturation.The storage medium is located at memory 100, and processor 101 reads storage Information in device 100, in conjunction with its hardware complete aforementioned embodiments method the step of.
Embodiment of the present invention additionally provides a kind of machine readable storage medium, and machine readable storage medium storage is organic Device executable instruction, for the machine-executable instruction when being called and being executed by processor, machine-executable instruction promotes processor Realize that above-mentioned distributed storage method, specific implementation can be found in method embodiment, details are not described herein.
The technique effect of the distributed storage devices that embodiment of the present invention is provided, realization principle and generation and aforementioned Method embodiment is identical, and to briefly describe, device embodiments part does not refer to place, can refer to preceding method embodiment Middle corresponding contents.
In several embodiments provided herein, it should be understood that disclosed device and method can also lead to Other modes are crossed to realize.Device embodiments described above are only schematical, for example, the flow chart in attached drawing and Block diagram show the device of multiple embodiments according to the disclosure, method and computer program product system in the cards Framework, function and operation.In this regard, each box in flowchart or block diagram can represent a module, program segment or generation A part for code, the part of the module, section or code include one or more for realizing defined logic function Executable instruction.It should also be noted that at some as in the realization method replaced, the function that is marked in box can also be with Occur different from the sequence marked in attached drawing.For example, two continuous boxes can essentially be basically executed in parallel, they Sometimes it can also execute in the opposite order, this is depended on the functions involved.It is also noted that block diagram and or flow chart In each box and the box in block diagram and or flow chart combination, the special of function or action as defined in executing can be used Hardware based system is realized, or can be realized using a combination of dedicated hardware and computer instructions.
In addition, each function module or unit in each embodiment of the disclosure can integrate to form one solely Vertical part can also be modules individualism, can also two or more modules be integrated to form one it is independent Part.
It, can be with if the function is realized in the form of SFU software functional unit and when sold or used as an independent product It is stored in a computer read/write memory medium.Based on this understanding, the technical solution of the disclosure is substantially in other words The part of the part that contributes to existing technology or the technical solution can be expressed in the form of software products, the meter Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be People's computer, server or network equipment etc.) execute all or part of step of each embodiment the method for the disclosure Suddenly.And storage medium above-mentioned includes:USB flash disk, read-only memory (ROM, Read-Only Memory), is deposited mobile hard disk at random The various media that can store program code such as access to memory (RAM, Random Access Memory), magnetic disc or CD.
Finally it should be noted that:Embodiment described above, the only specific implementation mode of the disclosure, to illustrate this public affairs The technical solution opened, rather than its limitations, the protection domain of the disclosure are not limited thereto, although with reference to aforementioned embodiments pair The disclosure is described in detail, it will be understood by those of ordinary skill in the art that:Any technology for being familiar with the art Personnel can still modify to the technical solution recorded in aforementioned embodiments in the technical scope that the disclosure discloses Or variation or equivalent replacement of some of the technical features can be readily occurred in;And these modifications, variation or replacement, The spirit and scope for disclosure embodiment technical solution that it does not separate the essence of the corresponding technical solution, should all cover in this public affairs Within the protection domain opened.Therefore, the protection domain of the disclosure shall be subject to the protection scope of the claim.

Claims (10)

1. a kind of distributed memory system, which is characterized in that the system comprises multiple protected fields;Each the protected field includes Multiple failure domains, each failure domain include one group of memory node;
Each copy of same data is stored on the memory node in the different faults domain in the same protected field, to go out When existing failed storage node, safeguarded in the protected field belonging to the failed storage node;
In each protected field, the maximum allowable number of faults of the failure domain is less than the quantity of the copy.
2. system according to claim 1, which is characterized in that the storage performance of the memory node in each protected field It is identical;
The system also includes multiple subregions, the identical institute of one or more storage performances is respectively arranged in each subregion State protected field.
3. system according to claim 2, which is characterized in that each subregion is for storing preconfigured storage clothes It is engaged in corresponding data;The storage service includes block storage, object storage or file storage.
4. system according to claim 2, which is characterized in that the physical resource of the memory node in each protected field Model is identical;The physical resource model includes solid-state hard disk SSD hard disk or hard disk drive HDD hard disks.
5. system according to claim 1, which is characterized in that multiple copies are stored in the same protected field at random In different faults domain;Alternatively, according to the memory state of each failure domain in the protected field, multiple copies are stored in same In the specified different faults domain of a protected field.
6. system according to claim 2, which is characterized in that the system also includes the prisons being connect with the memory node Node is controlled, failure monitoring and Breakdown Maintenance for carrying out data storage and memory node.
7. a kind of distributed storage method, which is characterized in that the method is applied to the monitoring node of distributed memory system, institute The system of stating includes multiple protected fields;Each protected field includes multiple failure domains, and each failure domain includes one group of storage Node;The method includes:
Receive data to be stored;
Generate multiple copies of the data;
Multiple copies of the data are stored respectively to the memory node in the different faults domain in the same protected field.
8. the method according to the description of claim 7 is characterized in that the storage performance of the memory node in each protected field It is identical;It is identical it to be respectively arranged with one or more storage performances the system also includes multiple subregions, in each subregion The protected field, it is described to store multiple copies to the memory node in the different faults domain in the same protected field respectively The step of, including:
Judge that the corresponding storage service of the data, the storage service include block storage, object storage or file storage;
Subregion is selected according to the corresponding storage service of the data;
Multiple copies of the data are stored into depositing to the different faults domain in the same protected field of selected subregion respectively It stores up on node.
9. the method according to the description of claim 7 is characterized in that the method further includes:
Memory node in each protected field of monitoring;
If monitoring failed storage node, and the failure domain quantity in the protected field is not up to preset maximum allowable failure Quantity is safeguarded in the protected field belonging to the failed storage node;
If monitoring failed storage node, and the failure domain quantity in the protected field reaches preset maximum allowable number of faults Amount, interrupts the storage service of the protected field belonging to the failed storage node.
10. a kind of distributed storage devices, which is characterized in that including memory and processor, wherein the memory is for depositing One or more computer instruction is stored up, one or more computer instruction is executed by the processor, to realize that right is wanted Seek 7 to 9 any one of them methods.
CN201810269046.1A 2018-03-28 2018-03-28 Distributed storage system, method and device Active CN108540315B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810269046.1A CN108540315B (en) 2018-03-28 2018-03-28 Distributed storage system, method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810269046.1A CN108540315B (en) 2018-03-28 2018-03-28 Distributed storage system, method and device

Publications (2)

Publication Number Publication Date
CN108540315A true CN108540315A (en) 2018-09-14
CN108540315B CN108540315B (en) 2021-12-07

Family

ID=63482272

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810269046.1A Active CN108540315B (en) 2018-03-28 2018-03-28 Distributed storage system, method and device

Country Status (1)

Country Link
CN (1) CN108540315B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110096301A (en) * 2019-05-08 2019-08-06 深信服科技股份有限公司 A kind of hot upgrade method of storage system, system and electronic equipment and storage medium
WO2020082888A1 (en) * 2018-10-25 2020-04-30 华为技术有限公司 Method, system and apparatus for restoring data in storage system
CN111104056A (en) * 2018-10-25 2020-05-05 华为技术有限公司 Data recovery method, system and device in storage system
CN111381770A (en) * 2018-12-30 2020-07-07 浙江宇视科技有限公司 Data storage switching method, device, equipment and storage medium
CN111522675A (en) * 2019-02-05 2020-08-11 希捷科技有限公司 Data allocation within a fault domain tree
CN111625421A (en) * 2020-05-26 2020-09-04 云和恩墨(北京)信息技术有限公司 Method and device for monitoring distributed storage system, storage medium and processor
CN113821165A (en) * 2021-08-20 2021-12-21 济南浪潮数据技术有限公司 Distributed cluster fusion storage method, system and equipment
CN114466030A (en) * 2021-12-27 2022-05-10 天翼云科技有限公司 Management method and device for data distribution storage strategy and distribution storage system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010048048A2 (en) * 2008-10-24 2010-04-29 Microsoft Corporation Configuration management in distributed data systems
CN104735107A (en) * 2013-12-20 2015-06-24 中国移动通信集团公司 Recovery method and device for data copies in distributed storage system
CN105706056A (en) * 2013-10-03 2016-06-22 微软技术许可有限责任公司 Fault domains on modern hardware
CN105912612A (en) * 2016-04-06 2016-08-31 中广天择传媒股份有限公司 Distributed file system and data equilibrium distribution method orienting same
CN107085546A (en) * 2016-02-16 2017-08-22 深圳市深信服电子科技有限公司 Data managing method and device based on failure field technique

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010048048A2 (en) * 2008-10-24 2010-04-29 Microsoft Corporation Configuration management in distributed data systems
CN105706056A (en) * 2013-10-03 2016-06-22 微软技术许可有限责任公司 Fault domains on modern hardware
CN104735107A (en) * 2013-12-20 2015-06-24 中国移动通信集团公司 Recovery method and device for data copies in distributed storage system
CN107085546A (en) * 2016-02-16 2017-08-22 深圳市深信服电子科技有限公司 Data managing method and device based on failure field technique
CN105912612A (en) * 2016-04-06 2016-08-31 中广天择传媒股份有限公司 Distributed file system and data equilibrium distribution method orienting same

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111104056B (en) * 2018-10-25 2021-12-31 华为技术有限公司 Data recovery method, system and device in storage system
WO2020082888A1 (en) * 2018-10-25 2020-04-30 华为技术有限公司 Method, system and apparatus for restoring data in storage system
CN111104056A (en) * 2018-10-25 2020-05-05 华为技术有限公司 Data recovery method, system and device in storage system
US12111728B2 (en) 2018-10-25 2024-10-08 Huawei Technologies Co., Ltd. Data recovery method, system, and apparatus in storage system
CN111381770A (en) * 2018-12-30 2020-07-07 浙江宇视科技有限公司 Data storage switching method, device, equipment and storage medium
CN111381770B (en) * 2018-12-30 2021-07-06 浙江宇视科技有限公司 Data storage switching method, device, equipment and storage medium
CN111522675A (en) * 2019-02-05 2020-08-11 希捷科技有限公司 Data allocation within a fault domain tree
CN111522675B (en) * 2019-02-05 2024-04-05 希捷科技有限公司 Data allocation within fault domain trees
CN110096301A (en) * 2019-05-08 2019-08-06 深信服科技股份有限公司 A kind of hot upgrade method of storage system, system and electronic equipment and storage medium
CN111625421A (en) * 2020-05-26 2020-09-04 云和恩墨(北京)信息技术有限公司 Method and device for monitoring distributed storage system, storage medium and processor
CN111625421B (en) * 2020-05-26 2021-07-16 云和恩墨(北京)信息技术有限公司 Method and device for monitoring distributed storage system, storage medium and processor
CN113821165A (en) * 2021-08-20 2021-12-21 济南浪潮数据技术有限公司 Distributed cluster fusion storage method, system and equipment
CN113821165B (en) * 2021-08-20 2023-12-22 济南浪潮数据技术有限公司 Distributed cluster fusion storage method, system and equipment
CN114466030A (en) * 2021-12-27 2022-05-10 天翼云科技有限公司 Management method and device for data distribution storage strategy and distribution storage system
CN114466030B (en) * 2021-12-27 2024-03-12 天翼云科技有限公司 Management method and device of data distributed storage strategy and distributed storage system

Also Published As

Publication number Publication date
CN108540315B (en) 2021-12-07

Similar Documents

Publication Publication Date Title
CN108540315A (en) Distributed memory system, method and apparatus
CN107943421B (en) Partition division method and device based on distributed storage system
CN109725822B (en) Method, apparatus and computer program product for managing a storage system
US20190220379A1 (en) Troubleshooting Method, Apparatus, and Device
CN103929500A (en) Method for data fragmentation of distributed storage system
CN108319618B (en) Data distribution control method, system and device of distributed storage system
WO2017143957A1 (en) Data redistribution method and device
US11442827B2 (en) Policy-based hierarchical data protection in distributed storage
CN110737924B (en) Data protection method and equipment
CN110825704B (en) Data reading method, data writing method and server
US11023141B2 (en) Resiliency schemes for distributed storage systems
US8060773B1 (en) Systems and methods for managing sub-clusters within a multi-cluster computing system subsequent to a network-partition event
CN113625945A (en) Distributed storage slow disk processing method, system, terminal and storage medium
CN106899654A (en) A kind of sequence value generation method, apparatus and system
CN111124264A (en) Method, apparatus and computer program product for reconstructing data
CN105468296A (en) No-sharing storage management method based on virtualization platform
CN105404565B (en) A kind of dual-active data guard method and device
CN107422989A (en) A kind of more copy read methods of Server SAN systems and storage architecture
CN111866210A (en) Virtual IP balance distribution method, system, terminal and storage medium
CN112748867A (en) Method, electronic device and computer program product for storage management
CN115756955A (en) Data backup and data recovery method and device and computer equipment
CN105262640A (en) System and method for improving reliability of cloud platform server, and disposition framework of system
US11886309B2 (en) Cell-based storage system with failure isolation
CN109840051B (en) Data storage method and device of storage system
CN109117292A (en) A kind of method and apparatus and cluster storage system of cluster-based storage

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant