WO2017067402A1 - Dispositif de calcul et procédé et système de gestion de composants de mémoire d'un dispositif de calcul - Google Patents

Dispositif de calcul et procédé et système de gestion de composants de mémoire d'un dispositif de calcul Download PDF

Info

Publication number
WO2017067402A1
WO2017067402A1 PCT/CN2016/101735 CN2016101735W WO2017067402A1 WO 2017067402 A1 WO2017067402 A1 WO 2017067402A1 CN 2016101735 W CN2016101735 W CN 2016101735W WO 2017067402 A1 WO2017067402 A1 WO 2017067402A1
Authority
WO
WIPO (PCT)
Prior art keywords
storage
component
computing
switch
components
Prior art date
Application number
PCT/CN2016/101735
Other languages
English (en)
Chinese (zh)
Inventor
牛功彪
李舒
Original Assignee
阿里巴巴集团控股有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 阿里巴巴集团控股有限公司 filed Critical 阿里巴巴集团控股有限公司
Publication of WO2017067402A1 publication Critical patent/WO2017067402A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored program computers

Definitions

  • the present application relates to the field of computers, and in particular, to a computing device and a method and system for managing storage components of a computing device.
  • the storage and computing components of the data center are configured based on a single computing device, that is, the ratio of storage components to computing components for a computing device is fixed.
  • the configuration of the storage unit and the computing unit based on the single computing device has at least the following problems:
  • One of the technical problems solved by the present application is to provide a management method and system for a computing device and a storage component of a computing device, and to realize the flexible allocation of the storage component and the computing component, and fully utilize the service time limit of each component.
  • a computing device including: a computing component, further comprising:
  • the computing component is coupled to the SAS SWITCH via the interface, the SAS SWITCH being simultaneously coupled to the storage component, the storage component being allocated for the computing component connected to the SAS SWITCH based on an allocation rule.
  • a management method of a computing device storage component comprising a computing component, the computing component being connected to a SAS SWITCH, the SAS SWITCH When connected to the storage component, the method includes:
  • a method for managing a storage component including:
  • a management system for a computing device storage component comprising: a management node, at least one SAS SWITCH connected to the management node, at least one storage component connected to the SAS SWITCH, and At least one computing device computing component, wherein
  • the management node is connected to the SAS SWITCH through a management interface provided by the SAS SWITCH, and is configured to allocate a storage component to a computing component connected to the SAS SWITCH.
  • the storage component in the computing device is separated from the computing device, and the separated storage component is connected to the computing component in the computing device through the SAS SWITCH, which has at least the following advantages:
  • the separation of computing resources and storage resources is realized, which facilitates separate maintenance and management of computing resources and storage resources;
  • the life cycle of the computing component and the storage component are separated to maximize the use of components of different life cycles
  • the decoupling between the computing component and the storage component is realized, and the storage component can be allocated according to the needs of the computing component, thereby realizing the refined and flexible allocation of the computing component and the storage component.
  • FIG. 1 is a schematic structural view of a prior art computing device.
  • FIG. 2 is a block diagram of a computing device in accordance with an embodiment of the present application.
  • FIG. 3 is a flow chart of a method of managing a computing device storage component in accordance with an embodiment of the present application.
  • FIG. 4 is a schematic structural diagram of a management system of a computing device storage component according to an embodiment of the present application.
  • FIG. 5 is a timing diagram of powering up devices in a management system of a computing device storage component according to an embodiment of the present application.
  • the computer device includes a user device and a network device.
  • the user equipment includes, but is not limited to, a computer, a smart phone, a PDA, etc.
  • the network device includes but is not limited to a single network server, a server group composed of multiple network servers, or a cloud computing based computer Or a cloud composed of a network server, wherein cloud computing is a type of distributed computing, a super virtual computer composed of a group of loosely coupled computers.
  • the computer device can be operated separately to implement the present application, and can also access the network and implement the application through interaction with other computer devices in the network.
  • the network in which the computer device is located includes, but is not limited to, the Internet, a wide area network, a metropolitan area network, a local area network, a VPN network, and the like.
  • the user equipment, the network equipment, the network, and the like are only examples, and other existing or future computer equipment or networks may be applicable to the present application, and should also be included in the scope of the present application. It is included here by reference.
  • the computing device described in the embodiment of the present application is a general term for a device that provides a computing service function, and includes, but is not limited to, a server or a PC.
  • the storage components of a computing device in the prior art are fully coupled to the computing components. As shown in FIG. 1 , the structure of the existing computing device is shown.
  • the computing component 11 in the computing device corresponds to a fixed set of storage components 10 .
  • the storage component and the computing component have at least the following disadvantages based on the configuration scheme of the single computing device:
  • the warranty life of the computing device is 3 years.
  • the computing unit of the computing device should have a service life of 3 years, while the storage component has a warranty period of 5 years or longer.
  • the warranty period of the storage component is not fully utilized, but is prematurely eliminated, resulting in storage components. Part of the warranty time is wasted, which ultimately leads to a waste of costs.
  • the embodiment of the present application provides a simplified computing device and a simplified management method and system for the computing device storage component.
  • the technical solution of the present application is further described in detail below with reference to the accompanying drawings.
  • FIG. 2 is a block diagram of a computing device including a computing component 20 and an interface 21 coupled to a SAS SWITCH, in accordance with an embodiment of the present application.
  • the computing component 20 of the computing device is still disposed within the computing device, i.e., the computing component 20 is arranged in the same manner as in the prior art.
  • the computing component 20 is an I/O (input/output) module that retains memory and CPU (or GPU) computing functions.
  • the computing component 20 is connectable to the SAS SWITCH via the interface 21, which is simultaneously connected to the storage component, ie the storage component and the computing component 20 are simultaneously connected to the SAS SWITCH.
  • the SAS SWITCH is an existing IP switch applying the SAS protocol.
  • the computing component 20 is assigned a storage component coupled to the SAS SWITCH based on an allocation rule by a management node or hypervisor.
  • the embodiment of the present application is to implement flexible allocation of the computing device storage component and the computing component 20, and to enable the storage component with different lifecycles and the computing component 20 to implement the best service time, and separate the storage component of the computing device from the computing device. Only the computing component 20 is retained in the computing device. The computing component 20 retained in the computing device and the separated storage components are connected by SAS SWITCH.
  • the separated storage unit is connected to the SAS SWITCH through a unified interface in a storage array manner.
  • One embodiment of the SAS SWITCH has a management interface, and the management node can be connected to the management node, and the management node can implement the connection based on the distribution rule.
  • the storage component of the SAS SWITCH and the computing component 20 are matched, that is, the storage component connected to the SAS SWITCH is assigned a storage component connected to the SAS SWITCH.
  • Another embodiment is to install a hypervisor in one of the computing devices connected to the SAS SWITCH, which can allocate storage components for computing components connected to the SAS SWITCH based on the allocation rules.
  • the storage component 23 When the storage component 23 is allocated to the computing component 20, it may be allocated according to the actual needs of the computing component 20, that is, the number of storage components allocated for the computing component 20 varies with the actual needs of the computing component, wherein the computing component 20 The number of connected storage components can be proportional to the requirements of computing component 20.
  • embodiments of the present application enable flexible allocation of storage components and computing components 20 of computing devices.
  • At least one storage array and computing component 20 can be simultaneously connected to the SAS SWITCH, and the storage array At least one storage component is included, so when the storage component is allocated to the computing component 20, the storage components located in different storage arrays can be allocated to the same computing component 20, that is, the storage components connected to the same computing device can be located in multiple In the storage array, to ensure that the impact on the computing device computing component 20 is reduced in the event of a failure of one of the storage arrays.
  • there are 20 storage arrays connected to a SAS SWITCH each storage array includes 40 storage components, and 5 computing components connected to the SAS SWITCH are allocated for each of the 5 computing components. When the component is stored, one-fifth of the storage components selected from the 20 storage arrays are allocated to one computing component.
  • the computing device of the embodiment of the present application separates the storage component and connects with the computing component by using the SAS SWITCH, so that the storage component is separated from the computing component life cycle, so that the computing component reaches the specified service time limit (ie, the quality assurance is achieved). Period) does not affect the operation of the storage component, that is, the calculation component connected to the storage component can be replaced at this time, so as to maximize the utilization of the storage component and the calculation component of different life cycles, thereby avoiding the storage component being eliminated in advance The resulting resources are wasted.
  • the embodiment of the present application further provides a management method for a storage device of a computing device, as shown in FIG. 3 is an operation flowchart of the method, where the method mainly includes the following steps:
  • An application scenario of the embodiment of the present application may be: the computing device is the same as the computing device described in the foregoing embodiment, that is, the computing device includes a computing component, and the computing component is connected to the SAS SWITCH, the SAS SWITCH is also connected to the storage unit.
  • the method of this embodiment is for implementing a storage component for a computing component of the computing device.
  • the storage component of the embodiment of the present application is located in the storage array, and is connected to the SAS SWITCH by a unified interface of the storage array, that is, the storage array includes at least one storage component.
  • the storage component information acquired in step S320 includes: the number of storage components connected to the SAS SWITCH and the storage capacity, and may also include a state in which the storage component is used by the computing component and/or a service time limit of the storage component.
  • the obtained computing component information of the computing device includes at least: a number of computing components connected to the SAS SWITCH, and may also include a computing time limit of the computing component.
  • the storage component information and the calculation component information can be obtained by:
  • SAS SWITCH scan each interface to get the internal signal of the cable connected to each interface of SAS SWITCH to identify whether it is connected to the interface as a computing component or a storage component (or storage array). That is, by scanning the chip connected to each interface of the SAS SWITCH, it is known whether the computing component or the storage component (or storage array) is connected to each interface of the SAS SWITCH. For example, if the chip of an interface of the SAS SWITCH is scanned, the port number of the interface is obtained, indicating that the interface is connected to the interface; if the chip of the interface of the SAS SWITCH is scanned, the controller signal is obtained, Indicates that the storage unit (or storage array) is connected to the interface.
  • the storage unit information connected to the SAS SWITCH and the calculation part information are obtained in the case where all SAS SWITCH interfaces are scanned.
  • the storage component information and the calculation component information can be obtained by acquiring the scan result of the SAS SWITCH.
  • the execution timing of this step S310 can be during or after the SAS SWITCH startup process.
  • a scan operation is performed to obtain storage part information and calculation part information connected to the SAS SWITCH. It is also possible to set the SAS SWITCH to perform the scanning operation periodically after startup, so as to timely acquire the storage component information and calculate the change of the component information, for example, the change of the usage state of the storage component by the computing component.
  • the purpose of obtaining the storage component information connected to the SAS SWITCH and the computing component information of the computing device is that the storage component can be allocated to the computing component based on the obtained storage component information and the computing component information based on the allocation rule.
  • Step S320 is to allocate a storage component to the computing component based on the obtained storage component information and the computing component information based on the allocation rule.
  • the storage component is allocated to the computing component based on the number of the obtained storage component and the number of the calculated component based on the allocation rule. It is also possible to assign a storage component to the computing component based on the acquired storage component capacity and the number of calculated components.
  • the storage component is adjusted to the storage component allocated by the computing component according to the state of the obtained storage component used by the computing component, and the storage component according to the obtained storage component.
  • the service time limit and/or the service time limit of the computing component adjusts the distribution relationship between the storage component and the computing component.
  • the storage component may be allocated to the computing component according to the configured allocation rule when the storage component is initially allocated to the computing component, and the distribution rule may include:
  • a storage unit other than the specified number of storage units is allocated to the computing unit while retaining the specified number of storage units in an unassigned state.
  • the case of retaining the specified number of storage components includes: retaining a specified number of storage components in each storage array in an unallocated state, that is, retaining the storage array middle finger
  • a fixed number of storage components are not assigned to any computing component, and the reserved specified number of storage components can be used as a backup of the failed storage component, ie, assigned to the computing component, in the event of a storage component failure assigned to the computing component.
  • the reserved number of storage components are used in place of the failed storage component to ensure that the normal operation of the computing component connected to the failed storage component is not affected.
  • the number of storage components reserved in each storage array may be one or two, or the same number of storage components assigned to any of the computing components. Additionally, the number of specified number of storage components retained in each storage array may be the same or different. It will be appreciated that for ease of management, the number of specified number of storage components retained in each storage array is typically set to be the same.
  • the other storage components are all allocated as much as possible to achieve full utilization of the storage components. For example, if one of the storage arrays connected to the SAS SWITCH includes 60 storage components, and it is determined that the specified number of storage components to be retained are in the unallocated state, the remaining 58 storage components are all allocated.
  • the computing component that is connected to the SAS SWITCH includes 60 storage components, and it is determined that the specified number of storage components to be retained are in the unallocated state, the remaining 58 storage components are all allocated.
  • the computing component that is connected to the SAS SWITCH is connected to the SAS SWITCH.
  • the number of storage components included in the storage array connected to the SAS SWITCH can be set to be the same, and the number of storage components initially allocated for each computing component according to the distribution rule can also be set to be the same. It can be understood that the embodiment of the present application also supports the case where the number of storage components included in each storage array connected to the SAS SWITCH is different, and the case where a different number of storage components are initially allocated for each computing component.
  • the allocation rule may further include:
  • the storage components in the storage array are divided into the same number of copies as the number of computing components assigned to the computing components.
  • a storage array a plurality of storage components it contains are divided into the same number of copies as the number of computing components. Since in general, the number of storage arrays connected to the SAS SWITCH and the number of storage components included in the storage array will be greater than the number of computing components connected to the SAS SWITCH, that is, the storage components connected to the SAS SWITCH are sufficient to be allocated to the connection. The computational component on SAS SWITCH, therefore, does not appear to have fewer storage components in the storage array than the number of computational components connected to SAS SWITCH.
  • each storage array contains 60 storage components, and 10 computing units connected to the SAS SWITCH, according to this rule, 60 in each storage array.
  • the storage components are divided into 10 parts, each of which is assigned to one computing unit.
  • the storage in the storage array can be The storage unit is equally divided into the same number of parts as the number of calculation parts, and can also be randomly divided into the same number of parts as the number of calculation parts. That is, the 60 storage units in the storage array can be equally divided into 10 parts, and each storage unit includes 6 storage units, that is, 6 storage units assigned to each calculation unit. Or randomly divide 60 storage components in the storage array into 10 shares, and the number of storage components included in each share may be unequal, for example, may be 5, 10, 6, or 8, etc., and the randomly divided storage may be Parts are assigned to the calculation part.
  • the allocation rule does not conflict with the above-mentioned allocation rule, that is, the storage unit in the storage array can be divided into the same number of copies as the number of computing units, if the specified number of storage units in the reserved storage array are in an unallocated state.
  • the storage components in each storage array can be divided into 11 parts, one of which is reserved and not allocated to any computing part, and the remaining 10 parts are allocated to the computing part; of course, the allocation rule can also be Alone.
  • the rule is to divide the storage components in each storage array into the same number of copies as the computing components, so that the allocation rule can allocate the storage components in the same storage array to the connected components.
  • Each computing component on the SAS SWITCH that is, the storage component assigned to the same computing component, is located in multiple storage arrays. Thus, in the case of a storage array failure, only a small portion of the operation of a computing component is affected, that is, the impact of the storage component failure on the computing component is effectively reduced.
  • the allocation rule can also be:
  • All storage components connected to the SAS SWITCH are allocated to the computing component in the same number of copies as the number of computing components.
  • the allocation rule is to assign the storage units in all the storage arrays connected to the SAS SWITCH to the computing unit by the same number of copies as the number of computing units connected to the SAS SWITCH.
  • the allocation rule differs from the previous allocation rule in that it is possible for storage components in each storage array to be assigned to the same computing component.
  • the 1200 storage components are divided into 10 shares and assigned to 10 computing components.
  • the storage unit may be equally divided into the same number of copies as the number of calculated parts, or may be randomly divided into the same number of parts as the number of calculated parts. According to the present rule, in the case of dividing 1200 storage parts into 10 shares, each of which includes 120 storage parts, 120 storage parts are assigned to one calculation part.
  • the 120 storage elements may be selected in the order of the interface.
  • the interface number of the SAS SWITCH connected to the computing component is the interface. 21 to interface 31. Then, 60 storage components are selected from the interface 1, and then 60 storage components are selected from the interface 2 and assigned to the interface connected to the interface 21.
  • the computing component; the 120 storage components acquired from interface 3 and interface 4 are assigned to computing components connected to interface 22, and so on.
  • the allocation rule can also be:
  • a fixed capacity storage unit is assigned to each computing component.
  • each storage component has a storage capacity of 500G
  • 10 computing components are connected to the SAS SWITCH.
  • the rule stipulates that each computing component is allocated 1000G storage components. Then, a storage component of 1000 G randomly selected from the 20 storage components can be allocated to the computing component, and the allocated storage components are guaranteed not to collide.
  • the selection method can be selected according to the interface order as described in the above rules. Of course, other methods can be used to select the storage components for the computing component, and only need to ensure that the capacity of the storage component allocated for each computing component is equal to the fixed rule. Capacity is fine.
  • the specific representation of the storage component for the computing component is that the computing component can access and use the storage component assigned to it. That is, the process of allocating storage components for computing components is the process of opening the access and usage rights of the corresponding storage components for the computing components, and also the process of setting the matching relationship between the computing components and the storage components. After the storage component is allocated to the computing component according to the distribution rule, the matching relationship between the computing component and the storage component can be recorded. Since each interface of the SAS SWITCH has an address (or number), and each interface is connected to a computing component or a storage array, the address of the computing component or storage array connected to the interface is the address of the interface of the SAS SWITCH.
  • the storage array contains a plurality of storage components having a unique address (or number) for each storage component in the storage array. Then, after the storage component is allocated to the computing component, the correspondence between the computing component and the address of the allocated storage component can be recorded when the matching relationship between the computing component and the storage component is recorded. For example, the interface 1 of the SAS SWITCH is connected to the storage array.
  • the address of the interface 1 is assumed to be address 1, and the address of the storage array 1 is the address 1; the interface 5 is connected to the computing component, and the address of the interface 5 is assumed to be the address 5, then
  • the address of the computing component is the address 5; if the storage component X in the storage array is assigned to the computing component, and the address of the storage component X is the address X, the matching relationship between the computing component and the storage component is: address 5 Corresponding to the address X in address 1, that is, if the computing component wants to access the storage component X allocated to it, it needs to find the address of the storage array where the storage component X is located, that is, the address 1, and then find the address X in the address 1 to find the address X.
  • the storage component allocated for the computing component may be located in multiple storage arrays, when the computing component and the storage component are matched, the same computing component can be simultaneously Storage component matching.
  • the embodiment of the present application may save the matching relationship between the computing component and the storage component by matching the relationship table.
  • the matching relationship between the calculation part and the storage part recorded in the matching relation table.
  • the matching relationship table can be synchronously updated when the matching relationship between the computing component and the storage component changes.
  • the initial allocation process is the process of allocating storage components for the computing component for the first time after the SAS SWITCH is started, and after the initial allocation is completed, the matching relationship table is generated, and the subsequent possible The matching relationship is adjusted, and each time the matching relationship is adjusted, the matching relationship table is updated correspondingly.
  • the storage component information and the calculation component information may be obtained periodically, and the matching relationship needs to be adjusted when the acquired storage component information and the calculation component information change.
  • the specific scenarios include:
  • the acquired storage component information includes a state in which the storage component is used by the computing component; the storage component assigned to the computing component is adjusted according to a state in which the storage component is used by the computing component.
  • the embodiment of the present application acquires the state in which the storage component allocated to the computing component is used by the computing component in real time or periodically after initially allocating the storage component for the computing component.
  • the use state is whether the storage capacity of the storage unit is used, the ratio of use, and the like.
  • the purpose of obtaining the usage state is to adjust the storage component allocated to the computing component according to the usage state, and mainly adjust the number of storage components allocated to the computing component.
  • Specific adjustment methods include:
  • the storage component allocated to the computing component is reduced, that is, a part of the unused storage component is removed from the storage component allocated to the computing component or All. For example, if the usage rate of the calculated component of the storage component is less than 60%, the unused 40% of the storage component is canceled, or 30% of the unused storage component is canceled, and the like. or,
  • the storage component allocated for the computing component is added. For example, if the usage rate of the calculated component of the storage component is higher than 95%, then the computing component is assigned one or two storage components, etc., or all or part of the storage component that is canceled from other computing components is assigned to The calculation unit whose usage rate is higher than the upper limit value.
  • the usage rate of the obtained storage component by the calculation component may be an average usage rate within a predetermined time length range.
  • the scenario in which the matching relationship needs to be adjusted further includes:
  • the acquired storage component information includes a service time limit of the storage component
  • the obtained information of the computing component includes a service time limit of the computing component, a service time limit of the storage component, and a service time limit of the computing component are reached respectively, and a predetermined time limit is reached;
  • the service time limit reaches the specified time limit and/or the service time limit of the computing component reaches the specified time limit.
  • providing a prompt to replace the storage component or the computing component that reaches the specified time limit so that the worker adjusts according to the replacement result according to the storage component or the computing component according to the prompt according to the specified time limit.
  • the matching relationship in particular, providing a prompt to replace the storage component or the computing component that reaches the specified time limit, so that the worker adjusts according to the replacement result according to the storage component or the computing component according to the prompt according to the specified time limit.
  • the above-described operation of determining the storage means and calculating whether or not the time limit of the component reaches the respective predetermined time limit can be periodically performed.
  • the warranty time is reached when the specified time limit is reached.
  • the predetermined time limit of each computing component and the storage component is saved, and whether the current time is a specified time limit of the saved computing component and the storage component, and if the predetermined time limit of any of the computing components is specified, the computing component is prompted to be replaced. .
  • a new computing component can be created to replace the computing component that reaches the specified time limit without affecting the use of the storage component, ie the storage component can continue to be used. Both the computing component and the storage component are utilized to the maximum extent.
  • the embodiment of the present application further provides a management system for a computing device storage component, as shown in FIG. 4 , which is a schematic structural diagram of the system.
  • the system mainly includes the following: a management node 40, at least one SAS SWITCH 41 connected to the management node 40, At least one storage component 420 coupled to the SAS SWITCH 41 and computing component 43 of at least one computing device.
  • storage component 420 can be located in storage array 42, with at least one storage component 420 included in each storage array.
  • the storage array 42 is a combination of storage components 420 separated from the computing device, and each storage array 42 is connected to the SAS SWITCH 41 through a unified interface.
  • the computing component 43 is an I/O (input/output) module that retains memory and CPU (or GPU) computing functions in the computing device, which is internal to the computing device.
  • I/O input/output
  • the storage array 42 and the computing component 43 can be located in the same cabinet, that is, the storage array 42 and the computing component 43 are included in the same cabinet.
  • the storage arrays 42 can be separately disposed in one cabinet, and the computing component 43 can be disposed in another cabinet. This arrangement is more advantageous for storing different storages in different storage arrays 42.
  • Component 420 is assigned to the same computing component 43, providing the benefit of redundancy.
  • the storage array 42 and the computing component 43 are connected by a SAS SWITCH 41, which is an existing IP switch applying the SAS protocol.
  • the SAS SWITCH 41 has a plurality of interfaces including a management interface for connecting to the management node 40, and a plurality of interfaces connecting the storage array 42 and the computing component 43.
  • the SAS SWITCH 41 can obtain the storage array 42 or the computing component 43 connected to each interface by scanning each interface after booting, thereby obtaining the number of storage arrays 42 connected to the SAS SWITCH 41 and the number of computing components 43 for connecting the storage array 42.
  • the interface can be further scanned to obtain the number of storage components 420 included in the storage array 42 and the storage space size of each storage component 420.
  • the management node 40 is configured to allocate, to the computing component 43, a storage component 42 connected to the computing component 43 on the SAS SWITCH 41. It should be noted that at least one SAS SWITCH 41 is connected to one management node 40 in the management system. In FIG. 4, only the management node 40 is connected to one SAS SWITCH 41 as an example. That is, the same management node 40 can simultaneously manage a plurality of storage arrays 42 and computing components 43 connected by SAS SWITCH 41.
  • the management node 40 is connected to the SAS SWITCH 41 through a management interface provided by the SAS SWITCH41.
  • the number of computing components 43 and storage arrays 42 connected to the plurality of SAS SWITCHs 41 connected to the same management node 40 may be the same or different. In the case where the number of connections is the same, the same management policy (i.e., the policy of allocating the storage unit 420 to the computing component 43) may be employed for the plurality of SAS SWITCHs 41 for ease of management.
  • the management node 40 may be an existing computing device, or may be one of the computing devices connected to the SAS SWITCH 41 provided by the embodiment of the present application, or a management program in the computing device.
  • the management node 40 of the embodiment of the present application includes two management devices (dual management nodes) that are mutually active and standby, and two management devices that are mutually active and standby can synchronize the heartbeat through a USB (Universal Serial Bus) signal.
  • the two management devices that are mutually active and standby synchronize the saved information in real time or periodically to ensure that the standby management device can replace the management of the failed management device in the event of a failure of the primary management device.
  • the storage component included in each management device may be set to be a primary-standby relationship, that is, the storage components included in each management device are divided into two groups, one of which is a group.
  • a primary storage component another set serves as a backup of the primary storage component to ensure that in the event of a primary storage component failure, the alternate storage component can replace the primary storage component's operation to reduce the impact of the management node failure on the operation of the computing device.
  • the reliability of the system is ensured by the above-mentioned management devices that are active and standby, and the storage components that are active and standby in the management device. This reduces the probability that the computing device cannot work normally due to the failure of the management device.
  • the system described in this embodiment can control the power-on sequence of each device.
  • the power-on sequence diagram of each device is as shown in FIG. 5.
  • the management node 40 is powered on first in the system. That is, the bypass power supply is powered.
  • the main power is ready, and the main power is the power of other devices in the system except the management node 40.
  • the management node 40 controls the power-on sequence of other devices. That is, after the power-on startup of the management node 40 is completed, the power-on sequence of other devices including the SAS SWITCH 41, the storage array 42, and the computing component 43 can be controlled.
  • the power-on sequence of the other devices can be:
  • the SAS SWITCH 41 is controlled to power up first, after which the storage array 42 and the computing component 43 can be powered up simultaneously.
  • the storage array 42 is powered on to power up the storage component 420.
  • the work of allocating the storage unit 42 to the computing component 43 of the computing device can be performed only after the management node 40 is powered on. If the power-on sequence of the foregoing devices is disordered, the storage array 42 and the computing component 43 are powered on before the SAS SWITCH 41, or before the management node is powered on, or the SAS SWITCH 41 is powered on before the management node 40, and the devices after the power-on are required. Waiting for the management node 40 to be powered on after the boot is completed, this process will increase the power consumption of the system. It can be seen that the power-on sequence of each device provided in this embodiment of the present application can effectively reduce power consumption.
  • the specific representation of the management node 40 assigning the storage component 420 to the computing component 43 is that the computing component 43 can access and use the storage component 420 assigned thereto. That is, the process in which the management node 40 allocates the storage component 420 to the computing component 43 is the process of opening the access and usage rights of the corresponding storage component 420 for the computing component 43, and also the process of setting the matching relationship between the computing component 43 and the storage component 420. After the storage unit 420 is allocated to the calculation unit 43 in accordance with the distribution rule, the matching relationship between the calculation unit 43 and the storage unit 420 can be recorded.
  • each interface of the SAS SWITCH 41 has an address (or number), and each interface is connected to the computing component 43 or the storage array 42, the address of the computing component 43 or the storage array 42 connected to the interface is the interface of the SAS SWITCH. the address of. If the interface to a SAS SWITCH 41 is connected to a storage array 42, the storage array 42 includes a plurality of storage components 420 having a unique address (or number) for each storage component 420 in the storage array. Then, after the storage unit 420 is allocated to the calculation unit 43, the correspondence relationship between the calculation unit 43 and the address of the allocated storage unit 420 can be recorded when the matching relationship between the calculation unit 43 and the storage unit 420 is recorded.
  • management node 40 is configured to allocate the storage component 42 to the computing component 43 as:
  • the storage unit 420 other than the specified number of storage units 420 is assigned to the calculation unit 43 while retaining the specified number of storage units 420 in an unallocated state.
  • the reserved number of storage components 420 is reserved for each storage array 42 to reserve a specified number of storage components 420 in an unallocated state, ie, the specified number of reserved storage arrays 42
  • the storage component 420 is not assigned to any of the computing components 43.
  • the reserved number of reserved storage components 420 can be used as a backup for the failed storage component in the event of a storage component failure assigned to computing component 43. That is, the management node 40 is configured to replace the failed storage component 420 with the specified number of storage components 420 retained in the event of a failure of the storage component 420 assigned to the computing component 43 to ensure that the failure is not affected.
  • the number of storage components 420 reserved in each storage array 42 may be one or two, or the same number as the storage components 420 assigned to any of the computing components. Additionally, the number of designated number of storage components 420 remaining in each storage array 42 may be the same or different. It will be appreciated that the number of designated number of storage components 420 retained in each storage array is typically set to be the same for ease of management.
  • the management node 40 may also be configured to: when allocating the storage component 42 to the computing component 43:
  • the plurality of storage sections 420 in the storage array 42 are divided into the same number of copies as the calculation section 43 and assigned to the calculation section 43.
  • a plurality of storage sections 420 contained therein are divided into the same number of copies as the number of calculation sections 43 and assigned to the calculation section 43. Since in general, the number of storage arrays connected to the SAS SWITCH and the number of storage components included in the storage array will be greater than the number of computing components connected to the SAS SWITCH, that is, the storage components connected to the SAS SWITCH are sufficient to be allocated to the connection. The computational component on SAS SWITCH, therefore, does not appear to have fewer storage components in the storage array than the number of computational components connected to SAS SWITCH.
  • each storage array is configured according to the present rule.
  • the 60 storage units 420 of 42 are divided into 10 shares, one for each of which is assigned to one calculation unit 43.
  • the storage unit 420 in the storage array 42 may be equally divided into the same number of copies as the number of the calculation unit 43, or may be randomly divided into the same number of copies as the number of calculation units. That is, the 60 storage units 420 in the storage array 42 can be equally divided into 10 shares, and each storage unit includes six storage units, that is, six storage units for each calculation unit.
  • the subsequent storage unit 420 is assigned to the calculation unit 43.
  • the rule is that the storage unit 420 in each storage array 42 is divided into the same number of copies as the calculation unit 43 and assigned to the calculation unit 43, so that the allocation rule can assign the storage unit 420 in the same storage array to the connection unit.
  • Each computing component 43 on the SAS SWITCH 41 thus, in the event of a failure of the storage array 42, affects only a small portion of the operation of the computing component 430, i.e., effectively reduces the impact of the storage component 420 failure on the computing component.
  • the management node 40 can also be configured to:
  • the state in which the storage unit 420 is used by the calculation unit 43 is acquired; the storage unit 420 assigned to the calculation unit 43 is adjusted in accordance with the state in which the storage unit 420 is used by the calculation unit 43.
  • the embodiment of the present application acquires the state in which the storage component allocated to the computing component is used by the computing component in real time or periodically after initially allocating the storage component for the computing component.
  • the use state is whether the storage space of the storage component is used, the ratio of use, and the like.
  • the purpose of obtaining the state in which the storage component is used by the computing component is that the number of storage components allocated to the computing component can be adjusted according to the state of the storage component computing component, including:
  • the storage component allocated to the computing component is reduced, that is, part or all of the unoccupied storage component is removed from the storage component allocated to the computing component . For example, if the acquired storage component is used by the computing component below 60%, the unused 40% storage component is cancelled, or 30% of the unused storage component is cancelled, and the like. or,
  • the storage component allocated for the computing component is added. For example, if the acquired storage component is used by the computing component higher than 95%, then the computing component is assigned one or two storage components, etc., or all or part of the storage components that are canceled from other computing components are allocated for use. The rate is higher than the calculated part of the upper limit.
  • the usage rate of the obtained storage component may be an average occupancy rate within a predetermined length of time.
  • the management node 40 can also be configured to:
  • the determining operation may be performed periodically, and it is determined whether the storage time limit of the storage component and the computing component reaches a predetermined time limit, that is, whether the storage component and the computing component meet the warranty time.
  • the predetermined time limit of each computing component and the storage component is saved, and whether the current time is a specified time limit of the saved computing component and the storage component is determined. If the predetermined time limit of any of the computing component or the storage component is reached, the prompting is replaced.
  • the computing component or storage component For example, where the service time limit of one of the computing components reaches a specified time limit, a new computing component can be created to replace the computing component that reaches the specified time limit without affecting the use of the storage component, ie, the storage component can continue to be used. Both the computing component and the storage component are utilized to the maximum extent.
  • the embodiment of the present application separates the storage component in the computing device from the computing device, and connects the separated storage component to the computing component in the computing device through SAS SWITCH, which has at least the following advantages:
  • the decoupling between the computing component and the storage component is realized, and the storage component can be allocated according to the needs of the computing component, thereby realizing the refined and flexible allocation of the computing component and the storage component;
  • the present application can be implemented in software and/or a combination of software and hardware, for example, using an application specific integrated circuit (ASIC), a general purpose computer, or any other similar hardware device.
  • the software program of the present application can be executed by a processor to implement the steps or functions described above.
  • the software programs (including related data structures) of the present application can be stored in a computer readable recording medium such as a RAM memory, a magnetic or optical drive or a floppy disk and the like.
  • some of the steps or functions of the present application may be implemented in hardware, for example, as a circuit that cooperates with a processor to perform various steps or functions.
  • a portion of the present application can be applied as a computer program product, such as computer program instructions, which, when executed by a computer, can invoke or provide a method and/or technical solution in accordance with the present application.
  • the program instructions for invoking the method of the present application may be stored in a fixed or removable recording medium, and/or transmitted by a data stream in a broadcast or other signal bearing medium, and/or stored in a The working memory of the computer device in which the program instructions are run.
  • an embodiment in accordance with the present application includes a device including a memory for storing computer program instructions and a processor for executing program instructions, wherein when the computer program instructions are executed by the processor, triggering
  • the apparatus operates based on the aforementioned methods and/or technical solutions in accordance with various embodiments of the present application.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Memory System (AREA)
  • Hardware Redundancy (AREA)
  • Debugging And Monitoring (AREA)

Abstract

Dispositif de calcul et procédé et système de gestion de composants de mémoire d'un dispositif de calcul, ledit dispositif de calcul comportant: un composant (20) de calcul et une interface (21) reliée à un commutateur SAS; le composant (20) de calcul est relié au commutateur SAS au moyen de l'interface (21), et le commutateur SAS est également relié à un composant de mémoire; selon une règle d'affectation, un composant de mémoire est affecté au composant (20) de calcul relié au commutateur SAS. La séparation des cycles de vie d'un composant de calcul et d'un composant de mémoire contribue à réaliser l'utilisation la plus large de composants présentant des cycles de vie différents et, en découplant les composants de calcul et les composants de mémoire, il est possible d'affecter de manière plus fine et plus souple les composants de calcul et les composants de mémoire.
PCT/CN2016/101735 2015-10-19 2016-10-11 Dispositif de calcul et procédé et système de gestion de composants de mémoire d'un dispositif de calcul WO2017067402A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510681163.5 2015-10-19
CN201510681163.5A CN106598908B (zh) 2015-10-19 2015-10-19 计算设备及计算设备存储部件的管理方法及系统

Publications (1)

Publication Number Publication Date
WO2017067402A1 true WO2017067402A1 (fr) 2017-04-27

Family

ID=58555196

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/101735 WO2017067402A1 (fr) 2015-10-19 2016-10-11 Dispositif de calcul et procédé et système de gestion de composants de mémoire d'un dispositif de calcul

Country Status (2)

Country Link
CN (1) CN106598908B (fr)
WO (1) WO2017067402A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109960614A (zh) * 2019-03-27 2019-07-02 英业达科技有限公司 服务器系统与管理方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103092786A (zh) * 2013-02-25 2013-05-08 浪潮(北京)电子信息产业有限公司 一种双控双活存储控制系统及方法
CN103152397A (zh) * 2013-02-06 2013-06-12 浪潮电子信息产业股份有限公司 一种多控存储系统设计方法
CN104486384A (zh) * 2014-11-28 2015-04-01 华为技术有限公司 一种存储系统及交换扩展装置
CN104657316A (zh) * 2015-03-06 2015-05-27 北京百度网讯科技有限公司 服务器
CN104967577A (zh) * 2015-06-25 2015-10-07 北京百度网讯科技有限公司 Sas交换机和服务器

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6977927B1 (en) * 2000-09-18 2005-12-20 Hewlett-Packard Development Company, L.P. Method and system of allocating storage resources in a storage area network
JP2005038071A (ja) * 2003-07-17 2005-02-10 Hitachi Ltd ストレージの容量を最適化する管理方法
CN104461824A (zh) * 2014-12-01 2015-03-25 北京同有飞骥科技股份有限公司 一种磁盘健康信息优化管理方法和装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103152397A (zh) * 2013-02-06 2013-06-12 浪潮电子信息产业股份有限公司 一种多控存储系统设计方法
CN103092786A (zh) * 2013-02-25 2013-05-08 浪潮(北京)电子信息产业有限公司 一种双控双活存储控制系统及方法
CN104486384A (zh) * 2014-11-28 2015-04-01 华为技术有限公司 一种存储系统及交换扩展装置
CN104657316A (zh) * 2015-03-06 2015-05-27 北京百度网讯科技有限公司 服务器
CN104967577A (zh) * 2015-06-25 2015-10-07 北京百度网讯科技有限公司 Sas交换机和服务器

Also Published As

Publication number Publication date
CN106598908A (zh) 2017-04-26
CN106598908B (zh) 2020-09-08

Similar Documents

Publication Publication Date Title
US11023143B2 (en) Node interconnection apparatus, resource control node, and server system
JP5068056B2 (ja) 障害回復方法、計算機システム及び管理サーバ
US9411646B2 (en) Booting secondary processors in multicore system using kernel images stored in private memory segments
US9110717B2 (en) Managing use of lease resources allocated on fallover in a high availability computing environment
JP4842210B2 (ja) フェイルオーバ方法、計算機システム、管理サーバ及び予備サーバの設定方法
JP5352132B2 (ja) 計算機システム及びそのi/o構成変更方法
US20120272243A1 (en) Protecting high priority workloads in a virtualized datacenter
JP2016526735A (ja) 仮想ハドゥープマネジャ
US8677374B2 (en) Resource management in a virtualized environment
JP2009075718A (ja) 仮想i/oパスの管理方法、情報処理システム及びプログラム
US10331581B2 (en) Virtual channel and resource assignment
JP6708928B2 (ja) ストレージ管理装置、ストレージ管理プログラム、およびストレージシステム
WO2017067402A1 (fr) Dispositif de calcul et procédé et système de gestion de composants de mémoire d'un dispositif de calcul
JP2007286975A (ja) 計算機システム及びストレージシステム並びにボリューム割り当て方法
TWI733744B (zh) 計算設備及計算設備儲存部件的管理方法及系統
JP5266347B2 (ja) 引継方法、計算機システム及び管理サーバ
JP6774147B2 (ja) 制御装置
US20150082311A1 (en) Method and system for logging into a virtual environment executing on a host
JP2010026828A (ja) 仮想計算機の制御方法
JP5970846B2 (ja) 計算機システム及び計算機システムの制御方法
US20230118846A1 (en) Systems and methods to reserve resources for workloads
US9778948B2 (en) Information processing apparatus, computer readable storage medium, and collecting method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16856835

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16856835

Country of ref document: EP

Kind code of ref document: A1