CN106648947B - A kind of method and apparatus of test multi-controller storage equipment - Google Patents
A kind of method and apparatus of test multi-controller storage equipment Download PDFInfo
- Publication number
- CN106648947B CN106648947B CN201611196787.9A CN201611196787A CN106648947B CN 106648947 B CN106648947 B CN 106648947B CN 201611196787 A CN201611196787 A CN 201611196787A CN 106648947 B CN106648947 B CN 106648947B
- Authority
- CN
- China
- Prior art keywords
- controller
- tested
- started
- successfully
- controllers
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000012360 testing method Methods 0.000 title claims abstract description 48
- 238000000034 method Methods 0.000 title claims abstract description 24
- 238000001514 detection method Methods 0.000 claims description 11
- 230000002159 abnormal effect Effects 0.000 claims description 10
- 230000008569 process Effects 0.000 claims description 5
- 238000010998 test method Methods 0.000 claims description 5
- 230000004044 response Effects 0.000 claims description 4
- 230000005856 abnormality Effects 0.000 claims description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000013500 data storage Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
- G06F11/0706—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
- G06F11/0727—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a storage system, e.g. in a DASD or network based storage system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/22—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
- G06F11/2273—Test methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Computer Hardware Design (AREA)
- Test And Diagnosis Of Digital Computers (AREA)
Abstract
The present invention provides a kind of method and apparatus of test multi-controller storage equipment, technical solution are as follows: during testing multi-controller storage equipment, multi-controller is stored into the segment controller in equipment in the controller of normal operation as working controller, it is responsible for the even running of multi-controller memory apparatus system, the controller of other normal operations is then used as controller to be tested, it carries out restarting test, the controller to be tested for restarting failure is marked, successful controller to be tested is restarted and then enters next round test.The present invention can improve testing efficiency with lower cost.
Description
Technical Field
The invention relates to the technical field of data storage, in particular to a method and a device for testing multi-controller storage equipment.
Background
For testing of multi-controller storage equipment, the stored abnormal test is the most critical, and the abnormal processing condition among controllers in the operation of a storage equipment system is concerned, so that high availability is realized, and online services are not influenced.
Currently, the commonly used test method is to manually plug in or unplug the controller, restart the controller, or introduce a third-party hardware to simulate the controller signal interruption. The testing method for manually plugging and restarting the controller is relatively close to the actual use condition, but the testing efficiency is very low; the test method for introducing the signal interruption of the third-party hardware analog controller has high test efficiency, but generates additional high cost.
Disclosure of Invention
In view of the above, the present invention provides a method and an apparatus for testing a multi-controller storage device, which can improve the testing efficiency at a low cost.
In order to achieve the purpose, the invention provides the following technical scheme:
a method of testing a multi-controller storage device, comprising:
step A, selecting a working controller and a controller to be tested from controllers of the multi-controller storage device which normally operate, keeping the working controller normally operating, and restarting all the controllers to be tested;
b, after restarting the preset duration of all the controllers to be tested, detecting whether each controller to be tested is started successfully, if all the controllers to be tested are started successfully, returning to execute the step A, otherwise, executing the step C;
and C, marking the unsuccessfully started controller to be tested as a controller with abnormal operation, if the number of the controllers with normal operation is not more than 1, ending the test flow, otherwise, returning to the step A for execution.
An apparatus for testing a multi-controller storage device, comprising: the device comprises a selection unit, a starting unit, a detection unit and a marking unit;
the selection unit is used for selecting a working controller and a controller to be tested from controllers which normally run in the multi-controller storage equipment;
the starting unit is used for keeping the working controller to normally operate and restarting all the controllers to be tested;
the detection unit is used for detecting whether each controller to be tested is started successfully or not after the preset duration of all the controllers to be tested is restarted by the starting unit, if all the controllers to be tested are started successfully, the selection unit is instructed to select the working controller and the controllers to be tested again, and if not, the marking unit is informed to mark the abnormality of the controllers to be tested which are not started successfully;
the marking unit is used for marking the controller to be tested which is not started successfully as the controller with abnormal operation after receiving the notification of the detection unit, if the number of the controllers which normally operate is not more than 1, the test process is ended, otherwise, the selection unit is instructed to select the working controller and the controller to be tested again.
According to the technical scheme, in each round of test, the working controller and the controller to be tested are selected from the controllers which normally operate in the multi-controller storage device, the controller to be tested is restarted and tested, the controller to be tested which is restarted successfully is used as the controller which normally operates, and the next round of test is carried out until only 1 controller which normally operates is left in the multi-controller storage device. The invention can realize the abnormal test of the multi-controller storage device under the unattended condition, and achieve the purpose of simulating the restart fault processing of the controllers on the site, thereby improving the stability and the test efficiency of the storage product.
Drawings
FIG. 1 is a flow chart of a method of testing a multi-controller storage device according to an embodiment of the present invention;
FIG. 2 is a schematic structural diagram of an apparatus for testing a multi-controller storage device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention are described in detail below with reference to the accompanying drawings according to embodiments.
In the invention, in the process of testing the multi-controller storage equipment, part of the controllers in the controllers which normally operate in the multi-controller storage equipment are used as working controllers and are responsible for stable operation of a multi-controller storage equipment system, and other controllers which normally operate are used as controllers to be tested to carry out restart test.
Referring to fig. 1, fig. 1 is a flowchart of a method for testing a multi-controller storage device according to an embodiment of the present invention, and as shown in fig. 1, the method mainly includes the following steps:
step 101, selecting a working controller and controllers to be tested from controllers which normally run in a multi-controller storage device, keeping the working controller normally running, and restarting all the controllers to be tested.
In the invention, in the control of the test of the multi-controller storage equipment, the stable operation of the multi-controller storage equipment needs to be ensured, and the front-end IO request can still be processed. Therefore, at least one controller in the multi-controller storage equipment is kept to normally work, and meanwhile, the rest controllers are subjected to restart testing, so that the front-end IO is not interrupted while the multi-controller storage equipment is tested, and the whole multi-controller storage equipment normally works.
Preferably, one controller is selected from the normally operating controllers in the multi-controller storage device to be used as the working controller, and the other normally operating controllers are used as the controllers to be tested. And keeping the working controller to normally operate, so that the multi-controller storage equipment normally processes the front-end IO request, and simultaneously restarting all the controllers to be tested to perform restart test.
Step 102, after restarting the preset duration of all the controllers to be tested, detecting whether each controller to be tested is started successfully, if all the controllers to be tested are started successfully, returning to execute step 101, otherwise, executing step 103;
after the controller to be tested is restarted, after a preset time period, whether the controller to be tested is restarted successfully or not can be detected, if all the controllers to be tested are restarted successfully, the test of the local wheel to the multi-controller storage device is passed, the next round of test can be continued, otherwise, further processing needs to be executed, and specifically, refer to step 103.
In this embodiment, the operating system of the controller to be tested is successfully started and all the boot-up programs in the operating system are normally started, which are used as the flags that the controller to be tested is successfully restarted, that is, when detecting whether each controller to be tested is successfully started (restarted), it is required to detect whether the operating system of each controller to be tested is successfully started and whether all the boot-up programs in the operating system of the controller to be tested are normally started, if the operating system of the controller to be tested is successfully started and all the boot-up programs in the operating system are normally started, it is determined that the controller to be tested is successfully started, otherwise, it is determined that the controller to be tested is not successfully started.
In this embodiment, whether the operating system of the controller to be tested is successfully started is detected by sending a ping message with a destination address being an IP address of the controller to be tested to the controller to be tested, if the ping message is successful, that is, a response message of the controller to be tested is received, it is indicated that the controller to be tested can normally communicate, and therefore it is determined that the operating system of the controller to be tested is successfully started, otherwise, it is indicated that the controller to be tested cannot normally communicate, and therefore it is determined that the operating system of the controller to be tested is not successfully started.
Under normal conditions, an authorized user can log in the controller, i.e., a user who is authorized to log in the controller by using a certain user name and password.
In order to detect whether all startup programs in an operating system of a controller to be tested are normally started, a user name and a login password of each controller in a multi-controller storage device need to be obtained in advance; when detecting whether all the boot-up programs in the operating system of the controller to be tested are normally started, the method of the prior art can be used to detect whether all the boot-up programs in the controller to be tested are started only by logging in the controller to be tested by using the user name and the login password of the controller to be tested, for example, whether all the boot-up programs are included in the boot-up programs in the operating system of the controller to be tested is checked, if yes, all the boot-up programs in the controller to be tested are started, and if not, all the boot-up programs in the controller to be tested are not normally started.
And 103, marking the unsuccessfully started controllers to be tested as abnormal-operation controllers, executing the step 104 if the number of the normally operated controllers is not more than 1, otherwise, returning to the step 101 for execution.
In this embodiment, the controller to be tested that is not successfully restarted is marked, and no longer participates in the test in the next test, and the controller to be tested that is successfully restarted enters the next test as the controller that normally operates.
And step 104, ending the test flow.
It can be seen from the above method that in this embodiment, in each round of testing of the multi-controller storage device, part of the controllers are selected as the working controllers, and the others are used as the controllers to be tested to perform the restart testing, so long as at least one controller that normally operates in the multi-controller storage device is not less than one, the multi-controller storage device can be subjected to the multi-round testing without manual intervention, and in each round of testing, because the working controller still operates normally, the multi-controller storage device can normally process the front-end IO request. Therefore, by adopting the method of the embodiment, the abnormal test under the unattended condition of the multi-controller storage equipment can be realized, the problem that the restarting fault processing of the controller occurs in a simulation field is solved, and the stability of the storage product is improved.
The method for testing the multi-controller storage device according to the present invention is described in detail above, and the present invention further provides an apparatus for testing the multi-controller storage device, which is described in detail below with reference to fig. 2.
Referring to fig. 2, fig. 2 is a schematic structural diagram of an apparatus for testing a multi-controller storage device according to an embodiment of the present invention, and as shown in fig. 2, the apparatus includes: a selection unit 201, a starting unit 202, a detection unit 203 and a marking unit 204; wherein,
a selecting unit 201, configured to select a working controller and a controller to be tested from controllers in a multi-controller storage device that operate normally;
the starting unit 202 is used for keeping the working controller selected by the selecting unit 201 to normally operate and restarting all the controllers to be tested;
the detection unit 203 is used for detecting whether each controller to be tested is started successfully or not after the preset duration of all the controllers to be tested is restarted by the starting unit 202, if all the controllers to be tested are started successfully, the selection unit is instructed to select the working controller and the controllers to be tested again, and if not, the marking unit is informed to perform abnormal marking on the controllers to be tested which are not started successfully;
and a marking unit 204, configured to mark the controller to be tested that is not successfully started as a controller with abnormal operation after receiving the notification from the detecting unit 203, end the test flow if the number of the controllers that normally operate is not greater than 1, and otherwise instruct the selecting unit 201 to select the working controller and the controller to be tested again to enter a new round of test.
In the device shown in figure 2 of the drawings,
the selection unit 201 selects one controller from the normally operating controllers of the multi-controller storage device as a working controller, and the other controllers as controllers to be tested.
In the device shown in figure 2 of the drawings,
the detecting unit 203, when detecting whether each controller to be tested is successfully started, is configured to: detecting whether the operating system of each controller to be tested is successfully started and whether all the startup programs in the operating system of the controller to be tested are normally started, if the operating system of the controller to be tested is successfully started and all the startup programs in the operating system are normally started, determining that the controller to be tested is successfully started, otherwise, determining that the controller to be tested is not successfully started.
In the device shown in figure 2 of the drawings,
the detecting unit 203, when detecting whether the operating system of each controller to be tested is successfully started, is configured to: sending a ping message with the destination address being the IP address of the controller to be tested to the controller to be tested, if receiving the response message of the controller to be tested, determining that the operating system of the controller to be tested is started successfully, otherwise, determining that the operating system of the controller to be tested is not started successfully.
The apparatus shown in fig. 2 further includes an obtaining unit, configured to obtain in advance a user name and a login password of each controller in the multi-controller storage device;
the detecting unit 203, when detecting whether all the boot programs in the operating system of the controller to be tested are normally started, is configured to: and logging in the controller to be tested by using the user name and the login password of the controller to be tested, and determining whether all startup programs in the controller to be tested are started or not based on all started programs in an operating system of the controller to be tested.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.
Claims (8)
1. A method of testing a multi-controller storage device, the method comprising:
step A, selecting one controller from the normally-operated controllers of the multi-controller storage equipment as a working controller, using other controllers as controllers to be tested, keeping the working controller to normally operate and restarting all the controllers to be tested;
b, after restarting the preset duration of all the controllers to be tested, detecting whether each controller to be tested is started successfully or not, if all the controllers to be tested are started successfully, returning to execute the step A, otherwise, executing the step C;
and C, marking the unsuccessfully started controller to be tested as a controller with abnormal operation, if the number of the controllers with normal operation is not more than 1, ending the test flow, otherwise, returning to the step A for execution.
2. The method of claim 1,
the method for detecting whether each controller to be tested is started successfully comprises the following steps: detecting whether the operating system of each controller to be tested is successfully started and whether all the startup programs in the operating system of the controller to be tested are normally started, if the operating system of the controller to be tested is successfully started and all the startup programs in the operating system are normally started, determining that the controller to be tested is successfully started, otherwise, determining that the controller to be tested is not successfully started.
3. The method of claim 2,
the method for detecting whether the operating system of each controller to be tested is started successfully comprises the following steps: sending a ping message with the destination address being the IP address of the controller to be tested to the controller to be tested, if receiving the response message of the controller to be tested, determining that the operating system of the controller to be tested is started successfully, otherwise, determining that the operating system of the controller to be tested is not started successfully.
4. The method of claim 2,
the method comprises the steps that a user name and a login password of each controller in the multi-controller storage device are obtained in advance;
the method for detecting whether all startup programs in the operating system of the controller to be tested are normally started comprises the following steps: and logging in the controller to be tested by using the user name and the login password of the controller to be tested, and determining whether all startup programs in the controller to be tested are started or not based on all started programs in an operating system of the controller to be tested.
5. An apparatus for testing a multi-controller storage device, the apparatus comprising: the device comprises a selection unit, a starting unit, a detection unit and a marking unit;
the selection unit is used for selecting one controller from the controllers which normally run in the multi-controller storage equipment as a working controller, and the other controllers as the controllers to be tested;
the starting unit is used for keeping the working controller to normally operate and restarting all the controllers to be tested;
the detection unit is used for detecting whether each controller to be tested is started successfully or not after the preset duration of all the controllers to be tested is restarted by the starting unit, if all the controllers to be tested are started successfully, the selection unit is instructed to select the working controller and the controllers to be tested again, and if not, the marking unit is informed to mark the abnormality of the controllers to be tested which are not started successfully;
the marking unit is used for marking the controller to be tested which is not started successfully as the controller with abnormal operation after receiving the notification of the detection unit, if the number of the controllers which normally operate is not more than 1, the test process is ended, otherwise, the selection unit is instructed to select the working controller and the controller to be tested again.
6. The apparatus of claim 5,
the detection unit is used for detecting whether each controller to be tested is started successfully or not, and is used for: detecting whether the operating system of each controller to be tested is successfully started and whether all the startup programs in the operating system of the controller to be tested are normally started, if the operating system of the controller to be tested is successfully started and all the startup programs in the operating system are normally started, determining that the controller to be tested is successfully started, otherwise, determining that the controller to be tested is not successfully started.
7. The apparatus of claim 6,
the detection unit is used for detecting whether the operating system of each controller to be tested is started successfully or not, and is used for: sending a ping message with the destination address being the IP address of the controller to be tested to the controller to be tested, if receiving the response message of the controller to be tested, determining that the operating system of the controller to be tested is started successfully, otherwise, determining that the operating system of the controller to be tested is not started successfully.
8. The apparatus of claim 6,
the device also comprises an acquisition unit, a storage unit and a control unit, wherein the acquisition unit is used for acquiring the user name and the login password of each controller in the multi-controller storage equipment in advance;
the detection unit is used for detecting whether all startup programs in the operating system of the controller to be tested are normally started or not, and is used for: and logging in the controller to be tested by using the user name and the login password of the controller to be tested, and determining whether all startup programs in the controller to be tested are started or not based on all started programs in an operating system of the controller to be tested.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611196787.9A CN106648947B (en) | 2016-12-22 | 2016-12-22 | A kind of method and apparatus of test multi-controller storage equipment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611196787.9A CN106648947B (en) | 2016-12-22 | 2016-12-22 | A kind of method and apparatus of test multi-controller storage equipment |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106648947A CN106648947A (en) | 2017-05-10 |
CN106648947B true CN106648947B (en) | 2019-09-13 |
Family
ID=58834274
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611196787.9A Active CN106648947B (en) | 2016-12-22 | 2016-12-22 | A kind of method and apparatus of test multi-controller storage equipment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106648947B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107220140A (en) * | 2017-06-29 | 2017-09-29 | 郑州云海信息技术有限公司 | The method for testing reliability and system of a kind of dual control storage system |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101373420A (en) * | 2008-09-09 | 2009-02-25 | 创新科存储技术(深圳)有限公司 | Multi-controller disk array and command processing method thereof |
CN102289398A (en) * | 2010-06-17 | 2011-12-21 | 英业达股份有限公司 | Restart testing method |
CN102457547A (en) * | 2010-10-20 | 2012-05-16 | 英业达股份有限公司 | Upgrading method for storage area network equipment of multiple controllers |
CN103634388A (en) * | 2013-11-22 | 2014-03-12 | 华为技术有限公司 | Method for processing restarting of controllers in storage server, related equipment and communication system |
CN105027080A (en) * | 2013-03-14 | 2015-11-04 | 密克罗奇普技术公司 | Boot sequencing for multi boot devices |
EP3278214A4 (en) * | 2015-03-31 | 2018-11-21 | Zuora, Inc. | Systems and methods for live testing performance conditions of a multi-tenant system |
-
2016
- 2016-12-22 CN CN201611196787.9A patent/CN106648947B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101373420A (en) * | 2008-09-09 | 2009-02-25 | 创新科存储技术(深圳)有限公司 | Multi-controller disk array and command processing method thereof |
CN102289398A (en) * | 2010-06-17 | 2011-12-21 | 英业达股份有限公司 | Restart testing method |
CN102457547A (en) * | 2010-10-20 | 2012-05-16 | 英业达股份有限公司 | Upgrading method for storage area network equipment of multiple controllers |
CN105027080A (en) * | 2013-03-14 | 2015-11-04 | 密克罗奇普技术公司 | Boot sequencing for multi boot devices |
CN103634388A (en) * | 2013-11-22 | 2014-03-12 | 华为技术有限公司 | Method for processing restarting of controllers in storage server, related equipment and communication system |
EP3278214A4 (en) * | 2015-03-31 | 2018-11-21 | Zuora, Inc. | Systems and methods for live testing performance conditions of a multi-tenant system |
Also Published As
Publication number | Publication date |
---|---|
CN106648947A (en) | 2017-05-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2018006702A1 (en) | Abnormality processing method, apparatus and system in automation test | |
CN103973858B (en) | The Auto-Test System of mobile terminal | |
WO2016082543A1 (en) | Device testing method and testing device | |
CN111694710A (en) | Method, device and equipment for monitoring faults of substrate management controller and storage medium | |
CN111078484A (en) | Power-off test method, device, equipment and storage medium for system upgrading | |
CN104615519A (en) | Method for detecting whether memory capacity of server is lost or not under LINUX system | |
CN113504932A (en) | Firmware data updating method and device | |
CN106648947B (en) | A kind of method and apparatus of test multi-controller storage equipment | |
US20160147636A1 (en) | Enhanced resiliency testing by enabling state level control for request | |
CN110633221B (en) | Fuzzy test automation vulnerability positioning method | |
CN114338464A (en) | Fault diagnosis method, device, equipment and computer readable storage medium | |
CN116909800B (en) | Method and device for locating crash information and storage medium | |
CN113722181A (en) | BMC process monitoring method, device, system and medium of server | |
CN107105100B (en) | Method and system for monitoring mobile terminal game | |
CN101706752B (en) | Method and device for in-situ software error positioning | |
CN108946370A (en) | Elevator faults information processing method, system, equipment and readable storage medium storing program for executing | |
CN106326089B (en) | Automatic testing method, device and system | |
CN107870840B (en) | IPMI-based server multi-test instruction automatic execution method | |
CN111722997A (en) | Abnormality detection method for automated testing and computer-readable storage medium | |
CN107612786B (en) | Method and system for testing router | |
CN106940647B (en) | Code management method and device | |
CN115794525A (en) | BMC (baseboard management controller) pressure testing method, device, equipment and storage medium | |
TW201516423A (en) | System and method for testing boot time of servers | |
CN110851309B (en) | Integrated verification system and method | |
CN106776182A (en) | A kind of automatic identification station and the method for performing correspondence script |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |