CN113867648B - Server storage subsystem and control method thereof - Google Patents

Server storage subsystem and control method thereof Download PDF

Info

Publication number
CN113867648B
CN113867648B CN202111199036.3A CN202111199036A CN113867648B CN 113867648 B CN113867648 B CN 113867648B CN 202111199036 A CN202111199036 A CN 202111199036A CN 113867648 B CN113867648 B CN 113867648B
Authority
CN
China
Prior art keywords
raid controller
power supply
raid
storage subsystem
server storage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111199036.3A
Other languages
Chinese (zh)
Other versions
CN113867648A (en
Inventor
义日贵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Power Commercial Systems Co Ltd
Original Assignee
Inspur Power Commercial Systems Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Power Commercial Systems Co Ltd filed Critical Inspur Power Commercial Systems Co Ltd
Priority to CN202111199036.3A priority Critical patent/CN113867648B/en
Publication of CN113867648A publication Critical patent/CN113867648A/en
Application granted granted Critical
Publication of CN113867648B publication Critical patent/CN113867648B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0614Improving the reliability of storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1448Management of the data involved in backup or backup restore
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0653Monitoring storage devices or systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0655Vertical data movement, i.e. input-output transfer; data movement between one or more hosts and one or more storage devices
    • G06F3/0658Controller construction arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0683Plurality of storage devices
    • G06F3/0689Disk arrays, e.g. RAID, JBOD
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Quality & Reliability (AREA)
  • Power Sources (AREA)

Abstract

The application discloses a server storage subsystem, including: the CPU, the first RAID controller, the second RAID controller, the first expansion chip, the second expansion chip and the plurality of hard disk connectors are connected with other elements by arranging the first RAID controller, the second RAID controller, the first expansion chip and the second expansion chip, so that the redundancy of the system is improved, and the single faults of any controller, the expansion chip, the hard disk connectors, the hard disk and even the links do not affect the data transmission between the CPU and the hard disk, thereby ensuring the data safety and meeting the availability requirement of the server for no downtime. Correspondingly, the application also discloses a control method of the server storage subsystem, which has the same technical effects as the server storage subsystem.

Description

Server storage subsystem and control method thereof
Technical Field
The present invention relates to the field of server storage systems, and in particular, to a server storage subsystem and a control method thereof.
Background
In the information age of the current explosive development, various industries have a great deal of data storage requirements, and server users continuously put higher demands on the security of data storage on a server. How to ensure the reliable operation of the server and the reliability and safety of data on the server is a key problem to be solved in the field of the server. The server usually hangs a plurality of hard disks under a RAID (Redundant Array of Independent Disks ) controller to improve the data security level of the server storage subsystem, and when unexpected data happens, the server is ensured to continue to operate normally and the lost data is recovered as much as possible. The RAID technology has a better data protection function when a single hard disk in the server fails, but the design of the server is more and more complex, once other links on a data storage link fail, the safety of data on the server cannot be ensured and the availability requirement of the server for no downtime cannot be met by only hanging a plurality of hard disks through a single RAID.
Therefore, how to provide a solution to the above technical problem is a problem that needs to be solved by those skilled in the art.
Disclosure of Invention
In view of the above, the present invention aims to provide a server storage subsystem with higher redundancy and further improved data security, and a control method thereof. The specific scheme is as follows:
a server storage subsystem, comprising: CPU, first RAID controller, second RAID controller, first extension chip, second extension chip, a plurality of hard disk connectors, wherein:
the first ports of the first RAID controller are connected with the CPU, and the second ports of the first RAID controller are connected with the uplink ports of the first expansion chip and the uplink ports of the second expansion chip;
the first ports of the second RAID controllers are connected with the CPU, and the second ports of the second RAID controllers are connected with the uplink ports of the first expansion chip and the uplink ports of the second expansion chip;
the plurality of downlink ports of the first expansion chip are respectively connected with the first ports of the hard disk connectors one by one, and the plurality of downlink ports of the second expansion chip are respectively connected with the second ports of the hard disk connectors one by one;
the output port of each hard disk connector is used for connecting a hard disk.
Preferably, the server storage subsystem includes one of the CPUs, and the first RAID controller and the second RAID controller are connected to achieve synchronization of operation data.
Preferably, the server storage subsystem includes 2 CPUs interconnected with each other, specifically a first CPU and a second CPU, where the first RAID controller is connected to the first CPU, and the second RAID controller is connected to the second CPU.
Preferably, the first RAID controller and the second RAID controller are connected to achieve synchronization of operation data.
Correspondingly, the application also discloses a control method of the server storage subsystem, which is applied to any one of the server storage subsystems, and comprises the following steps:
monitoring the running states of all RAID controllers; the RAID controller comprises a first RAID controller and a second RAID controller;
if the running state of one RAID controller is abnormal and the running state of the other RAID controller is normal, transferring all task data to the RAID controller with the normal running state so that the RAID controller executes all tasks.
Preferably, before the monitoring of the operation states of all RAID controllers, the method further includes:
acquiring the bit states of the connecting cables of all RAID controllers, the first expansion chip and the second expansion chip;
and determining the data transmission paths of all the hard disk connectors according to the in-place state.
Preferably, the control method further includes:
monitoring a power supply good signal and an on-site signal of a power supply module of the server storage subsystem;
judging whether the operation state of the power supply module is normal or not according to the power supply good signal and the on-site signal;
if not, enabling all RAID controllers to start a power failure protection mechanism.
Preferably, the process of monitoring the power good signal and the in-place signal of the power supply module of the server storage subsystem includes:
monitoring power supply good signals and in-place signals of all power supply modules of the server storage subsystem;
correspondingly, the process of judging whether the operation state of the power supply module is normal according to the power supply good signal and the in-place signal comprises the following steps:
judging whether the operation states of all the power supply modules are normal or not according to the power supply good signals and the on-site signals;
and if the running states of all the power supply modules are abnormal, enabling all the RAID controllers to start a power failure protection mechanism.
Preferably, the process of monitoring the power good signal and the in-place signal of the power supply module of the server storage subsystem includes:
monitoring a power supply good signal and an on-site signal of a power supply module of the server storage subsystem through a CPLD;
the process of judging whether the operation state of the power supply module is normal according to the power supply good signal and the on-site signal comprises the following steps:
judging whether the running state of the power supply module is normal or not according to the power supply good signal and the on-site signal through the CPLD;
if not, enabling all RAID controllers to start a power failure protection mechanism through the CPLD.
The application discloses a server storage subsystem, including: the CPU, the first RAID controller, the second RAID controller, the first expansion chip, the second expansion chip and the plurality of hard disk connectors are connected with other elements by arranging the first RAID controller, the second RAID controller, the first expansion chip and the second expansion chip, so that the redundancy of the system is improved, and the single faults of any controller, the expansion chip, the hard disk connectors, the hard disk and even the links do not affect the data transmission between the CPU and the hard disk, thereby ensuring the data safety and meeting the availability requirement of the server for no downtime.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are required to be used in the embodiments or the description of the prior art will be briefly described below, and it is obvious that the drawings in the following description are only embodiments of the present invention, and that other drawings can be obtained according to the provided drawings without inventive effort for a person skilled in the art.
FIG. 1 is a diagram illustrating a structure of a server storage subsystem according to an embodiment of the present invention;
FIG. 2 is a diagram illustrating a structure of a server storage subsystem according to an embodiment of the present invention;
fig. 3 is a flowchart illustrating a control method of a server storage subsystem according to an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Currently, a server is generally connected with a plurality of hard disks in a hanging manner under a RAID controller to improve the data security level of a server storage subsystem, and when unexpected data happens, the server is ensured to continue to operate normally as much as possible and lost data is recovered as much as possible. The RAID technology has a better data protection function when a single hard disk in the server fails, but the design of the server is more and more complex, once other links on a data storage link fail, the safety of data on the server cannot be ensured and the availability requirement of the server for no downtime cannot be met by only hanging a plurality of hard disks through a single RAID.
The application discloses a server storage subsystem, including: CPU (Central Processing Unit ), first RAID controller, second RAID controller, first expansion chip, second expansion chip, a plurality of hard disk connectors, through setting up the relation of connection of first RAID controller, second RAID controller, first expansion chip and second expansion chip and other components, improved the system redundancy, the single trouble of arbitrary controller, expansion chip, hard disk connector, hard disk even link can not influence the data transmission between CPU and the hard disk to data security has been guaranteed, and the availability demand of server non-downtime has been satisfied.
The embodiment of the invention discloses a server storage subsystem, which is shown in fig. 1 and comprises the following components: CPU, first RAID controller, second RAID controller, first extension chip, second extension chip, a plurality of hard disk connectors, wherein:
the first ports of the first RAID controllers are connected with the CPU, and the second ports of the first RAID controllers are connected with the uplink ports of the first expansion chip and the uplink ports of the second expansion chip;
the first ports of the second RAID controllers are connected with the CPU, and the second ports of the second RAID controllers are connected with the uplink ports of the first expansion chip and the uplink ports of the second expansion chip;
the plurality of downlink ports of the first expansion chip are respectively connected with the first ports of the plurality of hard disk connectors one by one, and the plurality of downlink ports of the second expansion chip are respectively connected with the second ports of the plurality of hard disk connectors one by one;
the output port of each hard disk connector is used for connecting with a hard disk.
It can be understood that, through the connection between the units in this embodiment, redundancy of the RAID controller, the expansion chip, the data link, and the hard disk data can be ensured, and in the whole transmission system of the CPU-RAID controller-expansion chip-hard disk connector, each transmission layer has a plurality of units that are hot standby.
In order to improve the data synchronization hot standby efficiency in this embodiment, in addition to synchronizing data of the first RAID controller and the second RAID controller by the CPU, when the system includes one CPU, the first RAID controller and the second RAID controller may be connected to achieve synchronization of operation data.
Further, the server storage subsystem may include more than one CPU, as shown in fig. 2, where the server storage subsystem includes 2 CPUs with data interconnected, specifically a first CPU and a second CPU, where the first RAID controller is connected to the first CPU, and the second RAID controller is connected to the second CPU. Furthermore, the first RAID controller and the second RAID controller can be connected to realize the synchronization of the operation data.
It will be appreciated that in actual setting, the CPU and the first RAID controller and the second RAID controller are typically arranged on a hard disk motherboard, the first expansion chip, the second expansion chip, and a plurality of hard disk connectors are arranged on a hard disk backplane, and the RAID controller and the expansion chips are connected between the hard disk motherboard and the hard disk backplane by using SAS (Serial Attached SCSI, serial attached SCSI interface) connectors.
When the specific wiring is carried out, CC buses can be selected to be connected between the CPUs; the CPU and the two RAID controllers are interconnected through PCIe x8 signals, the first RAID controller and the second RAID controller can select an onboard controller and also can select PCIe (peripheral component interconnect express, latest bus and interface standard) card insertion, when the RAID controllers are PCIe card insertion type, the RAID controllers are connected with PCIe x16 slots on a main board through PCIe x16 golden fingers, PCIe x8 signals are arranged on a PCIe x16 connector to be connected with the CPU, and 8 SAS signals are arranged to enable the two RAID controllers to be connected; further, the low speed control signals connected between RAID controllers may also include I2C signals and fault alert signals, which may be transmitted through the RSVD pin on the PCIe x16 connector.
Further, two expansion chips, namely SAS expander chips, on the hard disk backboard comprise a first expansion chip and a second expansion chip, each expansion chip is provided with 8 uplink SAS ports, wherein 4 SAS ports are connected with a first RAID controller on the main board through an SAS connector, and the other 4 SAS ports are connected with a second RAID controller on the main board; each expander chip has 8 SAS ports downstream, wherein the 8 SAS ports of the first expander chip are respectively connected with the first ports of the 8 hard disk connectors, and the 8 SAS ports of the second expander chip are respectively connected with the second ports of the 8 hard disk connectors.
When the CPU reads and writes data to the hard disk, the CPU can be carried out through any one RAID controller, the two RAID controllers are in an active state without dividing the main controller from the standby controller. Taking fig. 2 as an example, when the first CPU reads and writes data to and from the hard disk through the first RAID controller, the first RAID controller also sends the read and write data to the second RAID controller in real time through an SAS signal. Similarly, when the second CPU reads and writes data to the hard disk through the second RAID controller, the second RAID controller also sends the read and write data to the first RAID controller in real time through the SAS signal. The control signal is used for communicating the state and the health state of the register between the two RAID controllers, and when one RAID controller is abnormal, the other RAID controller takes over the task of reading and writing the hard disk. Because the hard disk data between the two RAID controllers are kept synchronous in real time, dynamic switching can be realized during task transfer, so that the possibility of data loss or data retransmission is reduced, the operation of shutdown maintenance of the server due to abnormality of the RAID controllers is avoided, and the availability of the server system is improved.
Data transmission between the RAID controller and the expansion chip can be realized through 4 groups of cables and connectors of the MiniSAS HD x4, and each group of cables and connectors of the MiniSAS HD x4 can transmit 4 groups of SAS differential signals and Sideband signals, wherein the Sideband signals comprise I2C signals and cable in-bit detection signals. The I2C signal is used for controlling the hard disk status indicator lamp by the RAID controller, the RAID controller sends a control command to the expansion chip through the I2C, and the expansion chip decodes the I2C signal and then controls the status indicator lamp of the corresponding hard disk. The cable in-place detection signal is pulled up to high level on the main board, pulled down to low level on the hard disk backboard, the cable plays a role of communication, and the low effective in-place state of the cable can be detected through BMC (Baseboard Management Controller ) on the main board after the cable is communicated, and the signal is high if the cable is not successfully installed. The BMC can detect the in-place state of four groups of cables when the system is started each time so as to ensure the redundancy of the SAS link of the storage subsystem.
Besides being connected with the hard disk backboard through an SAS cable, the main board is generally provided with a power supply cable and a hard disk backboard control signal cable, wherein the power supply cable is used for supplying power to the hard disk backboard and the hard disk, and the hard disk backboard control signal cable is used for detecting the power supply state, the health state management and the like of the hard disk backboard.
The application discloses a server storage subsystem, including: the CPU, the first RAID controller, the second RAID controller, the first expansion chip, the second expansion chip and the plurality of hard disk connectors are connected with other elements by arranging the first RAID controller, the second RAID controller, the first expansion chip and the second expansion chip, so that the redundancy of the system is improved, and the single faults of any controller, the expansion chip, the hard disk connectors, the hard disk and even the links do not affect the data transmission between the CPU and the hard disk, thereby ensuring the data safety and meeting the availability requirement of the server for no downtime.
Correspondingly, the embodiment of the application also discloses a control method of the server storage subsystem, which is applied to any one of the server storage subsystems, and is shown in fig. 3, and the control method comprises the following steps:
s1: monitoring the running states of all RAID controllers; the RAID controller comprises a first RAID controller and a second RAID controller;
s2: if the running state of one RAID controller is abnormal and the running state of the other RAID controller is normal, transferring all task data to the RAID controller with normal running state so as to enable the RAID controller to execute all tasks.
Further, in consideration of the wiring between the motherboard and the backplane, before monitoring the operation states of all RAID controllers, the method may further include:
acquiring the bit states of connection cables of all RAID controllers and the first expansion chip and the second expansion chip;
and determining the data transmission paths of all the hard disk connectors according to the in-place state.
Further, in order to further improve the data security of the storage subsystem in this embodiment, this embodiment further sets the data protection of the power supply module when power is lost, and by knowing the running state of the power supply module in advance, the data loss caused by unexpected power failure of the server is reduced by using the power failure protection function of the RAID controller, and the control method further includes:
monitoring a power supply good signal and an on-site signal of a power supply module of a server storage subsystem;
judging whether the operation state of the power supply module is normal or not according to the power supply good signal and the on-site signal;
if not, enabling all RAID controllers to start a power failure protection mechanism.
Further, considering that the power supply module has a standby condition, the process of monitoring the power supply good signal and the on-site signal of the power supply module of the server storage subsystem includes:
monitoring power supply good signals and in-place signals of all power supply modules of a server storage subsystem;
correspondingly, the process of judging whether the running state of the power supply module is normal according to the power supply good signal and the in-place signal comprises the following steps:
judging whether the running states of all power supply modules are normal or not according to the power supply good signals and the in-place signals;
if the running states of all the power supply modules are abnormal, enabling all RAID controllers to start a power failure protection mechanism.
Specifically, regarding the logic control of whether the power-down protection mechanism is started or not corresponding to the power-up good signal and the bit signal of the power supply module, as shown in the following table 1, the power-up good signals of the two power supply modules are pg_1 and pg_2, the bit signals are prsnt_n_1 and prsnt_n_2, the enable signal for enabling the RAID controller to start the power-down protection mechanism is epow_n, and the enable signals of two RAID controllers with the same power supply module are the same.
Table 1 Signal logic relationship of Power supply Module and Enable RAID controller
PRSNT_N_1 PG_1 PRSNT_N_2 PG_2 EPOW_N
0 1 0 X 1
0 X 0 1 1
0 0 0 0 0
1 X 0 1 1
1 X 0 0 0
0 1 1 X 1
0 0 1 X 0
1 X 1 X 0
When EPOW_N is high level, the RAID controller works normally, when EPOW_N is low level, the RAID controller starts emergency storage, and the current state of the system is stored and ready to meet the impending endpoint.
Further, the method for the data power-down protection part in this embodiment is implemented by a CPLD, and a process for monitoring a power supply good signal and an in-place signal of a power supply module of a server storage subsystem includes:
monitoring a power supply good signal and an on-site signal of a power supply module of a server storage subsystem through a CPLD;
the process for judging whether the running state of the power supply module is normal or not according to the power supply good signal and the in-place signal comprises the following steps:
judging whether the running state of the power supply module is normal or not according to the power supply good signal and the on-site signal through the CPLD;
if not, enabling all RAID controllers to start a power failure protection mechanism through the CPLD.
Aiming at the problems of data loss and system downtime caused by hardware link failure in the prior art, the embodiment provides a high-availability and high-redundancy server storage subsystem, and ensures the real-time redundancy backup of data through the full-link redundancy design and the real-time backup technology of a redundant RAID controller, and the system has the capability of real-time switching when any part of the link fails.
Finally, it is further noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
The foregoing has described in detail a server storage subsystem and a control method thereof, and specific examples have been applied to illustrate the principles and embodiments of the present invention, and the above description of the examples is only for aiding in understanding the method and core idea of the present invention; meanwhile, as those skilled in the art will have variations in the specific embodiments and application scope in accordance with the ideas of the present invention, the present description should not be construed as limiting the present invention in view of the above.

Claims (7)

1. A server storage subsystem, comprising: CPU, first RAID controller, second RAID controller, first extension chip, second extension chip, a plurality of hard disk connectors, wherein:
the first ports of the first RAID controller are connected with the CPU, and the second ports of the first RAID controller are connected with the uplink ports of the first expansion chip and the uplink ports of the second expansion chip;
the first ports of the second RAID controllers are connected with the CPU, and the second ports of the second RAID controllers are connected with the uplink ports of the first expansion chip and the uplink ports of the second expansion chip;
the plurality of downlink ports of the first expansion chip are respectively connected with the first ports of the hard disk connectors one by one, and the plurality of downlink ports of the second expansion chip are respectively connected with the second ports of the hard disk connectors one by one;
the output port of each hard disk connector is used for connecting a hard disk;
the server storage subsystem comprises 2 CPUs (central processing units) with data interconnection, in particular a first CPU and a second CPU, wherein the first RAID controller is connected with the first CPU, and the second RAID controller is connected with the second CPU;
the third port of the first RAID controller is connected with the third port of the second RAID controller; wherein the third port is an RSVD pin on a PCIe x16 connector; and the third port of the first RAID controller and the third port of the second RAID controller realize mutual state monitoring between the first RAID controller and the second RAID controller through control signals.
2. The server storage subsystem of claim 1, wherein the first RAID controller and the second RAID controller are coupled to achieve synchronization of operational data.
3. A control method of a server storage subsystem, applied to the server storage subsystem of claim 1 or 2, the control method comprising:
monitoring the running states of all RAID controllers; the RAID controller comprises a first RAID controller and a second RAID controller;
if the running state of one RAID controller is abnormal and the running state of the other RAID controller is normal, transferring all task data to the RAID controller with the normal running state so that the RAID controller executes all tasks.
4. A control method according to claim 3, wherein before monitoring the operating states of all RAID controllers, further comprising:
acquiring the bit states of the connecting cables of all RAID controllers, the first expansion chip and the second expansion chip;
and determining the data transmission paths of all the hard disk connectors according to the in-place state.
5. A control method according to claim 3, characterized by further comprising:
monitoring a power supply good signal and an on-site signal of a power supply module of the server storage subsystem;
judging whether the operation state of the power supply module is normal or not according to the power supply good signal and the on-site signal;
if not, enabling all RAID controllers to start a power failure protection mechanism.
6. The control method of claim 5, wherein the process of monitoring the power up signal and the on-bit signal of the power module of the server storage subsystem comprises:
monitoring power supply good signals and in-place signals of all power supply modules of the server storage subsystem;
correspondingly, the process of judging whether the operation state of the power supply module is normal according to the power supply good signal and the in-place signal comprises the following steps:
judging whether the operation states of all the power supply modules are normal or not according to the power supply good signals and the on-site signals;
and if the running states of all the power supply modules are abnormal, enabling all the RAID controllers to start a power failure protection mechanism.
7. The control method of claim 5, wherein the process of monitoring the power up signal and the on-bit signal of the power module of the server storage subsystem comprises:
monitoring a power supply good signal and an on-site signal of a power supply module of the server storage subsystem through a CPLD;
the process of judging whether the operation state of the power supply module is normal according to the power supply good signal and the on-site signal comprises the following steps:
judging whether the running state of the power supply module is normal or not according to the power supply good signal and the on-site signal through the CPLD;
if not, enabling all RAID controllers to start a power failure protection mechanism through the CPLD.
CN202111199036.3A 2021-10-14 2021-10-14 Server storage subsystem and control method thereof Active CN113867648B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111199036.3A CN113867648B (en) 2021-10-14 2021-10-14 Server storage subsystem and control method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111199036.3A CN113867648B (en) 2021-10-14 2021-10-14 Server storage subsystem and control method thereof

Publications (2)

Publication Number Publication Date
CN113867648A CN113867648A (en) 2021-12-31
CN113867648B true CN113867648B (en) 2023-12-29

Family

ID=78999412

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111199036.3A Active CN113867648B (en) 2021-10-14 2021-10-14 Server storage subsystem and control method thereof

Country Status (1)

Country Link
CN (1) CN113867648B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101488077A (en) * 2009-02-24 2009-07-22 浪潮电子信息产业股份有限公司 Redundancy magnetic disk control server system
CN103488435A (en) * 2013-09-24 2014-01-01 华为技术有限公司 RAID (redundant arrays of independent disks) subsystem and data read-write method thereof
CN207650802U (en) * 2018-01-09 2018-07-24 郑州云海信息技术有限公司 A kind of server system with expanded function
CN208224988U (en) * 2018-04-03 2018-12-11 郑州云海信息技术有限公司 A kind of server input and output I O board card
CN208907999U (en) * 2018-11-28 2019-05-28 贵州浪潮英信科技有限公司 A kind of novel Raid buckle
CN209248436U (en) * 2019-03-01 2019-08-13 浪潮商用机器有限公司 A kind of expansion board clamping and server
CN210129211U (en) * 2019-08-09 2020-03-06 苏州浪潮智能科技有限公司 SD RAID storage device supporting hot plug
CN211427190U (en) * 2019-10-14 2020-09-04 苏州浪潮智能科技有限公司 Server circuit and mainboard based on Feiteng treater 2000+

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7958273B2 (en) * 2006-10-10 2011-06-07 Lsi Corporation System and method for connecting SAS RAID controller device channels across redundant storage subsystems

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101488077A (en) * 2009-02-24 2009-07-22 浪潮电子信息产业股份有限公司 Redundancy magnetic disk control server system
CN103488435A (en) * 2013-09-24 2014-01-01 华为技术有限公司 RAID (redundant arrays of independent disks) subsystem and data read-write method thereof
CN207650802U (en) * 2018-01-09 2018-07-24 郑州云海信息技术有限公司 A kind of server system with expanded function
CN208224988U (en) * 2018-04-03 2018-12-11 郑州云海信息技术有限公司 A kind of server input and output I O board card
CN208907999U (en) * 2018-11-28 2019-05-28 贵州浪潮英信科技有限公司 A kind of novel Raid buckle
CN209248436U (en) * 2019-03-01 2019-08-13 浪潮商用机器有限公司 A kind of expansion board clamping and server
CN210129211U (en) * 2019-08-09 2020-03-06 苏州浪潮智能科技有限公司 SD RAID storage device supporting hot plug
CN211427190U (en) * 2019-10-14 2020-09-04 苏州浪潮智能科技有限公司 Server circuit and mainboard based on Feiteng treater 2000+

Also Published As

Publication number Publication date
CN113867648A (en) 2021-12-31

Similar Documents

Publication Publication Date Title
US10467110B2 (en) Implementing cable failover in multiple cable PCI Express IO interconnections
US5754804A (en) Method and system for managing system bus communications in a data processing system
US10417166B2 (en) Implementing sideband control structure for PCIE cable cards and IO expansion enclosures
US8020035B2 (en) Expander circuit for a solid state persistent storage device that provides a plurality of interfaces to corresponding storage controllers
CN100541444C (en) The management system of multiple main board system
US20060161714A1 (en) Method and apparatus for monitoring number of lanes between controller and PCI Express device
US20070088978A1 (en) Internal failover path for SAS disk drive enclosure
US8990632B2 (en) System for monitoring state information in a multiplex system
US20040059862A1 (en) Method and apparatus for providing redundant bus control
US6943463B2 (en) System and method of testing connectivity between a main power supply and a standby power supply
CN113867648B (en) Server storage subsystem and control method thereof
JP2002196890A (en) Highly available storage system
CN109995597B (en) Network equipment fault processing method and device
CN112306938B (en) Hot plug method and device for OCP card and multi-host card
WO2001084252A1 (en) Multiplexing control system and multiplexing method therefor
JP2001027972A (en) Disk controller
CN117591457B (en) PCIE expansion box, server, method, device and product for controlling data transmission
JP4779948B2 (en) Server system
CN212009370U (en) Distributed control equipment of power plant
CN117992386A (en) Method and equipment for flexibly configuring signal processing module based on PCIE bus
CN117219133A (en) SAS hard disk power supply device and control method
CN113535472A (en) Cluster server
CN117008462A (en) Redundant controller
CN115309692A (en) Multi-path server device based on single node
CN117667534A (en) Memory abnormality detection system, motherboard, electronic device and abnormality detection method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant