CN106648947B - A kind of method and apparatus of test multi-controller storage equipment - Google Patents

A kind of method and apparatus of test multi-controller storage equipment Download PDF

Info

Publication number
CN106648947B
CN106648947B CN201611196787.9A CN201611196787A CN106648947B CN 106648947 B CN106648947 B CN 106648947B CN 201611196787 A CN201611196787 A CN 201611196787A CN 106648947 B CN106648947 B CN 106648947B
Authority
CN
China
Prior art keywords
controller
tested
started
successfully
controllers
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201611196787.9A
Other languages
Chinese (zh)
Other versions
CN106648947A (en
Inventor
杨学龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beihai Shengyun Technology Co Ltd
Original Assignee
Beihai Shengyun Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beihai Shengyun Technology Co Ltd filed Critical Beihai Shengyun Technology Co Ltd
Priority to CN201611196787.9A priority Critical patent/CN106648947B/en
Publication of CN106648947A publication Critical patent/CN106648947A/en
Application granted granted Critical
Publication of CN106648947B publication Critical patent/CN106648947B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0727Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a storage system, e.g. in a DASD or network based storage system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2273Test methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Test And Diagnosis Of Digital Computers (AREA)

Abstract

The present invention provides a kind of method and apparatus of test multi-controller storage equipment, technical solution are as follows: during testing multi-controller storage equipment, multi-controller is stored into the segment controller in equipment in the controller of normal operation as working controller, it is responsible for the even running of multi-controller memory apparatus system, the controller of other normal operations is then used as controller to be tested, it carries out restarting test, the controller to be tested for restarting failure is marked, successful controller to be tested is restarted and then enters next round test.The present invention can improve testing efficiency with lower cost.

Description

Method and device for testing multi-controller storage equipment
Technical Field
The invention relates to the technical field of data storage, in particular to a method and a device for testing multi-controller storage equipment.
Background
For testing of multi-controller storage equipment, the stored abnormal test is the most critical, and the abnormal processing condition among controllers in the operation of a storage equipment system is concerned, so that high availability is realized, and online services are not influenced.
Currently, the commonly used test method is to manually plug in or unplug the controller, restart the controller, or introduce a third-party hardware to simulate the controller signal interruption. The testing method for manually plugging and restarting the controller is relatively close to the actual use condition, but the testing efficiency is very low; the test method for introducing the signal interruption of the third-party hardware analog controller has high test efficiency, but generates additional high cost.
Disclosure of Invention
In view of the above, the present invention provides a method and an apparatus for testing a multi-controller storage device, which can improve the testing efficiency at a low cost.
In order to achieve the purpose, the invention provides the following technical scheme:
a method of testing a multi-controller storage device, comprising:
step A, selecting a working controller and a controller to be tested from controllers of the multi-controller storage device which normally operate, keeping the working controller normally operating, and restarting all the controllers to be tested;
b, after restarting the preset duration of all the controllers to be tested, detecting whether each controller to be tested is started successfully, if all the controllers to be tested are started successfully, returning to execute the step A, otherwise, executing the step C;
and C, marking the unsuccessfully started controller to be tested as a controller with abnormal operation, if the number of the controllers with normal operation is not more than 1, ending the test flow, otherwise, returning to the step A for execution.
An apparatus for testing a multi-controller storage device, comprising: the device comprises a selection unit, a starting unit, a detection unit and a marking unit;
the selection unit is used for selecting a working controller and a controller to be tested from controllers which normally run in the multi-controller storage equipment;
the starting unit is used for keeping the working controller to normally operate and restarting all the controllers to be tested;
the detection unit is used for detecting whether each controller to be tested is started successfully or not after the preset duration of all the controllers to be tested is restarted by the starting unit, if all the controllers to be tested are started successfully, the selection unit is instructed to select the working controller and the controllers to be tested again, and if not, the marking unit is informed to mark the abnormality of the controllers to be tested which are not started successfully;
the marking unit is used for marking the controller to be tested which is not started successfully as the controller with abnormal operation after receiving the notification of the detection unit, if the number of the controllers which normally operate is not more than 1, the test process is ended, otherwise, the selection unit is instructed to select the working controller and the controller to be tested again.
According to the technical scheme, in each round of test, the working controller and the controller to be tested are selected from the controllers which normally operate in the multi-controller storage device, the controller to be tested is restarted and tested, the controller to be tested which is restarted successfully is used as the controller which normally operates, and the next round of test is carried out until only 1 controller which normally operates is left in the multi-controller storage device. The invention can realize the abnormal test of the multi-controller storage device under the unattended condition, and achieve the purpose of simulating the restart fault processing of the controllers on the site, thereby improving the stability and the test efficiency of the storage product.
Drawings
FIG. 1 is a flow chart of a method of testing a multi-controller storage device according to an embodiment of the present invention;
FIG. 2 is a schematic structural diagram of an apparatus for testing a multi-controller storage device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the technical solutions of the present invention are described in detail below with reference to the accompanying drawings according to embodiments.
In the invention, in the process of testing the multi-controller storage equipment, part of the controllers in the controllers which normally operate in the multi-controller storage equipment are used as working controllers and are responsible for stable operation of a multi-controller storage equipment system, and other controllers which normally operate are used as controllers to be tested to carry out restart test.
Referring to fig. 1, fig. 1 is a flowchart of a method for testing a multi-controller storage device according to an embodiment of the present invention, and as shown in fig. 1, the method mainly includes the following steps:
step 101, selecting a working controller and controllers to be tested from controllers which normally run in a multi-controller storage device, keeping the working controller normally running, and restarting all the controllers to be tested.
In the invention, in the control of the test of the multi-controller storage equipment, the stable operation of the multi-controller storage equipment needs to be ensured, and the front-end IO request can still be processed. Therefore, at least one controller in the multi-controller storage equipment is kept to normally work, and meanwhile, the rest controllers are subjected to restart testing, so that the front-end IO is not interrupted while the multi-controller storage equipment is tested, and the whole multi-controller storage equipment normally works.
Preferably, one controller is selected from the normally operating controllers in the multi-controller storage device to be used as the working controller, and the other normally operating controllers are used as the controllers to be tested. And keeping the working controller to normally operate, so that the multi-controller storage equipment normally processes the front-end IO request, and simultaneously restarting all the controllers to be tested to perform restart test.
Step 102, after restarting the preset duration of all the controllers to be tested, detecting whether each controller to be tested is started successfully, if all the controllers to be tested are started successfully, returning to execute step 101, otherwise, executing step 103;
after the controller to be tested is restarted, after a preset time period, whether the controller to be tested is restarted successfully or not can be detected, if all the controllers to be tested are restarted successfully, the test of the local wheel to the multi-controller storage device is passed, the next round of test can be continued, otherwise, further processing needs to be executed, and specifically, refer to step 103.
In this embodiment, the operating system of the controller to be tested is successfully started and all the boot-up programs in the operating system are normally started, which are used as the flags that the controller to be tested is successfully restarted, that is, when detecting whether each controller to be tested is successfully started (restarted), it is required to detect whether the operating system of each controller to be tested is successfully started and whether all the boot-up programs in the operating system of the controller to be tested are normally started, if the operating system of the controller to be tested is successfully started and all the boot-up programs in the operating system are normally started, it is determined that the controller to be tested is successfully started, otherwise, it is determined that the controller to be tested is not successfully started.
In this embodiment, whether the operating system of the controller to be tested is successfully started is detected by sending a ping message with a destination address being an IP address of the controller to be tested to the controller to be tested, if the ping message is successful, that is, a response message of the controller to be tested is received, it is indicated that the controller to be tested can normally communicate, and therefore it is determined that the operating system of the controller to be tested is successfully started, otherwise, it is indicated that the controller to be tested cannot normally communicate, and therefore it is determined that the operating system of the controller to be tested is not successfully started.
Under normal conditions, an authorized user can log in the controller, i.e., a user who is authorized to log in the controller by using a certain user name and password.
In order to detect whether all startup programs in an operating system of a controller to be tested are normally started, a user name and a login password of each controller in a multi-controller storage device need to be obtained in advance; when detecting whether all the boot-up programs in the operating system of the controller to be tested are normally started, the method of the prior art can be used to detect whether all the boot-up programs in the controller to be tested are started only by logging in the controller to be tested by using the user name and the login password of the controller to be tested, for example, whether all the boot-up programs are included in the boot-up programs in the operating system of the controller to be tested is checked, if yes, all the boot-up programs in the controller to be tested are started, and if not, all the boot-up programs in the controller to be tested are not normally started.
And 103, marking the unsuccessfully started controllers to be tested as abnormal-operation controllers, executing the step 104 if the number of the normally operated controllers is not more than 1, otherwise, returning to the step 101 for execution.
In this embodiment, the controller to be tested that is not successfully restarted is marked, and no longer participates in the test in the next test, and the controller to be tested that is successfully restarted enters the next test as the controller that normally operates.
And step 104, ending the test flow.
It can be seen from the above method that in this embodiment, in each round of testing of the multi-controller storage device, part of the controllers are selected as the working controllers, and the others are used as the controllers to be tested to perform the restart testing, so long as at least one controller that normally operates in the multi-controller storage device is not less than one, the multi-controller storage device can be subjected to the multi-round testing without manual intervention, and in each round of testing, because the working controller still operates normally, the multi-controller storage device can normally process the front-end IO request. Therefore, by adopting the method of the embodiment, the abnormal test under the unattended condition of the multi-controller storage equipment can be realized, the problem that the restarting fault processing of the controller occurs in a simulation field is solved, and the stability of the storage product is improved.
The method for testing the multi-controller storage device according to the present invention is described in detail above, and the present invention further provides an apparatus for testing the multi-controller storage device, which is described in detail below with reference to fig. 2.
Referring to fig. 2, fig. 2 is a schematic structural diagram of an apparatus for testing a multi-controller storage device according to an embodiment of the present invention, and as shown in fig. 2, the apparatus includes: a selection unit 201, a starting unit 202, a detection unit 203 and a marking unit 204; wherein,
a selecting unit 201, configured to select a working controller and a controller to be tested from controllers in a multi-controller storage device that operate normally;
the starting unit 202 is used for keeping the working controller selected by the selecting unit 201 to normally operate and restarting all the controllers to be tested;
the detection unit 203 is used for detecting whether each controller to be tested is started successfully or not after the preset duration of all the controllers to be tested is restarted by the starting unit 202, if all the controllers to be tested are started successfully, the selection unit is instructed to select the working controller and the controllers to be tested again, and if not, the marking unit is informed to perform abnormal marking on the controllers to be tested which are not started successfully;
and a marking unit 204, configured to mark the controller to be tested that is not successfully started as a controller with abnormal operation after receiving the notification from the detecting unit 203, end the test flow if the number of the controllers that normally operate is not greater than 1, and otherwise instruct the selecting unit 201 to select the working controller and the controller to be tested again to enter a new round of test.
In the device shown in figure 2 of the drawings,
the selection unit 201 selects one controller from the normally operating controllers of the multi-controller storage device as a working controller, and the other controllers as controllers to be tested.
In the device shown in figure 2 of the drawings,
the detecting unit 203, when detecting whether each controller to be tested is successfully started, is configured to: detecting whether the operating system of each controller to be tested is successfully started and whether all the startup programs in the operating system of the controller to be tested are normally started, if the operating system of the controller to be tested is successfully started and all the startup programs in the operating system are normally started, determining that the controller to be tested is successfully started, otherwise, determining that the controller to be tested is not successfully started.
In the device shown in figure 2 of the drawings,
the detecting unit 203, when detecting whether the operating system of each controller to be tested is successfully started, is configured to: sending a ping message with the destination address being the IP address of the controller to be tested to the controller to be tested, if receiving the response message of the controller to be tested, determining that the operating system of the controller to be tested is started successfully, otherwise, determining that the operating system of the controller to be tested is not started successfully.
The apparatus shown in fig. 2 further includes an obtaining unit, configured to obtain in advance a user name and a login password of each controller in the multi-controller storage device;
the detecting unit 203, when detecting whether all the boot programs in the operating system of the controller to be tested are normally started, is configured to: and logging in the controller to be tested by using the user name and the login password of the controller to be tested, and determining whether all startup programs in the controller to be tested are started or not based on all started programs in an operating system of the controller to be tested.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (8)

1. A method of testing a multi-controller storage device, the method comprising:
step A, selecting one controller from the normally-operated controllers of the multi-controller storage equipment as a working controller, using other controllers as controllers to be tested, keeping the working controller to normally operate and restarting all the controllers to be tested;
b, after restarting the preset duration of all the controllers to be tested, detecting whether each controller to be tested is started successfully or not, if all the controllers to be tested are started successfully, returning to execute the step A, otherwise, executing the step C;
and C, marking the unsuccessfully started controller to be tested as a controller with abnormal operation, if the number of the controllers with normal operation is not more than 1, ending the test flow, otherwise, returning to the step A for execution.
2. The method of claim 1,
the method for detecting whether each controller to be tested is started successfully comprises the following steps: detecting whether the operating system of each controller to be tested is successfully started and whether all the startup programs in the operating system of the controller to be tested are normally started, if the operating system of the controller to be tested is successfully started and all the startup programs in the operating system are normally started, determining that the controller to be tested is successfully started, otherwise, determining that the controller to be tested is not successfully started.
3. The method of claim 2,
the method for detecting whether the operating system of each controller to be tested is started successfully comprises the following steps: sending a ping message with the destination address being the IP address of the controller to be tested to the controller to be tested, if receiving the response message of the controller to be tested, determining that the operating system of the controller to be tested is started successfully, otherwise, determining that the operating system of the controller to be tested is not started successfully.
4. The method of claim 2,
the method comprises the steps that a user name and a login password of each controller in the multi-controller storage device are obtained in advance;
the method for detecting whether all startup programs in the operating system of the controller to be tested are normally started comprises the following steps: and logging in the controller to be tested by using the user name and the login password of the controller to be tested, and determining whether all startup programs in the controller to be tested are started or not based on all started programs in an operating system of the controller to be tested.
5. An apparatus for testing a multi-controller storage device, the apparatus comprising: the device comprises a selection unit, a starting unit, a detection unit and a marking unit;
the selection unit is used for selecting one controller from the controllers which normally run in the multi-controller storage equipment as a working controller, and the other controllers as the controllers to be tested;
the starting unit is used for keeping the working controller to normally operate and restarting all the controllers to be tested;
the detection unit is used for detecting whether each controller to be tested is started successfully or not after the preset duration of all the controllers to be tested is restarted by the starting unit, if all the controllers to be tested are started successfully, the selection unit is instructed to select the working controller and the controllers to be tested again, and if not, the marking unit is informed to mark the abnormality of the controllers to be tested which are not started successfully;
the marking unit is used for marking the controller to be tested which is not started successfully as the controller with abnormal operation after receiving the notification of the detection unit, if the number of the controllers which normally operate is not more than 1, the test process is ended, otherwise, the selection unit is instructed to select the working controller and the controller to be tested again.
6. The apparatus of claim 5,
the detection unit is used for detecting whether each controller to be tested is started successfully or not, and is used for: detecting whether the operating system of each controller to be tested is successfully started and whether all the startup programs in the operating system of the controller to be tested are normally started, if the operating system of the controller to be tested is successfully started and all the startup programs in the operating system are normally started, determining that the controller to be tested is successfully started, otherwise, determining that the controller to be tested is not successfully started.
7. The apparatus of claim 6,
the detection unit is used for detecting whether the operating system of each controller to be tested is started successfully or not, and is used for: sending a ping message with the destination address being the IP address of the controller to be tested to the controller to be tested, if receiving the response message of the controller to be tested, determining that the operating system of the controller to be tested is started successfully, otherwise, determining that the operating system of the controller to be tested is not started successfully.
8. The apparatus of claim 6,
the device also comprises an acquisition unit, a storage unit and a control unit, wherein the acquisition unit is used for acquiring the user name and the login password of each controller in the multi-controller storage equipment in advance;
the detection unit is used for detecting whether all startup programs in the operating system of the controller to be tested are normally started or not, and is used for: and logging in the controller to be tested by using the user name and the login password of the controller to be tested, and determining whether all startup programs in the controller to be tested are started or not based on all started programs in an operating system of the controller to be tested.
CN201611196787.9A 2016-12-22 2016-12-22 A kind of method and apparatus of test multi-controller storage equipment Active CN106648947B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611196787.9A CN106648947B (en) 2016-12-22 2016-12-22 A kind of method and apparatus of test multi-controller storage equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611196787.9A CN106648947B (en) 2016-12-22 2016-12-22 A kind of method and apparatus of test multi-controller storage equipment

Publications (2)

Publication Number Publication Date
CN106648947A CN106648947A (en) 2017-05-10
CN106648947B true CN106648947B (en) 2019-09-13

Family

ID=58834274

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611196787.9A Active CN106648947B (en) 2016-12-22 2016-12-22 A kind of method and apparatus of test multi-controller storage equipment

Country Status (1)

Country Link
CN (1) CN106648947B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107220140A (en) * 2017-06-29 2017-09-29 郑州云海信息技术有限公司 The method for testing reliability and system of a kind of dual control storage system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101373420A (en) * 2008-09-09 2009-02-25 创新科存储技术(深圳)有限公司 Multi-controller disk array and command processing method thereof
CN102289398A (en) * 2010-06-17 2011-12-21 英业达股份有限公司 Restart testing method
CN102457547A (en) * 2010-10-20 2012-05-16 英业达股份有限公司 Upgrading method for storage area network equipment of multiple controllers
CN103634388A (en) * 2013-11-22 2014-03-12 华为技术有限公司 Method for processing restarting of controllers in storage server, related equipment and communication system
CN105027080A (en) * 2013-03-14 2015-11-04 密克罗奇普技术公司 Boot sequencing for multi boot devices
EP3278214A4 (en) * 2015-03-31 2018-11-21 Zuora, Inc. Systems and methods for live testing performance conditions of a multi-tenant system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101373420A (en) * 2008-09-09 2009-02-25 创新科存储技术(深圳)有限公司 Multi-controller disk array and command processing method thereof
CN102289398A (en) * 2010-06-17 2011-12-21 英业达股份有限公司 Restart testing method
CN102457547A (en) * 2010-10-20 2012-05-16 英业达股份有限公司 Upgrading method for storage area network equipment of multiple controllers
CN105027080A (en) * 2013-03-14 2015-11-04 密克罗奇普技术公司 Boot sequencing for multi boot devices
CN103634388A (en) * 2013-11-22 2014-03-12 华为技术有限公司 Method for processing restarting of controllers in storage server, related equipment and communication system
EP3278214A4 (en) * 2015-03-31 2018-11-21 Zuora, Inc. Systems and methods for live testing performance conditions of a multi-tenant system

Also Published As

Publication number Publication date
CN106648947A (en) 2017-05-10

Similar Documents

Publication Publication Date Title
WO2018006702A1 (en) Abnormality processing method, apparatus and system in automation test
CN103973858B (en) The Auto-Test System of mobile terminal
WO2016082543A1 (en) Device testing method and testing device
CN111694710A (en) Method, device and equipment for monitoring faults of substrate management controller and storage medium
CN111078484A (en) Power-off test method, device, equipment and storage medium for system upgrading
CN104615519A (en) Method for detecting whether memory capacity of server is lost or not under LINUX system
CN113504932A (en) Firmware data updating method and device
CN106648947B (en) A kind of method and apparatus of test multi-controller storage equipment
US20160147636A1 (en) Enhanced resiliency testing by enabling state level control for request
CN110633221B (en) Fuzzy test automation vulnerability positioning method
CN114338464A (en) Fault diagnosis method, device, equipment and computer readable storage medium
CN116909800B (en) Method and device for locating crash information and storage medium
CN113722181A (en) BMC process monitoring method, device, system and medium of server
CN107105100B (en) Method and system for monitoring mobile terminal game
CN101706752B (en) Method and device for in-situ software error positioning
CN108946370A (en) Elevator faults information processing method, system, equipment and readable storage medium storing program for executing
CN106326089B (en) Automatic testing method, device and system
CN107870840B (en) IPMI-based server multi-test instruction automatic execution method
CN111722997A (en) Abnormality detection method for automated testing and computer-readable storage medium
CN107612786B (en) Method and system for testing router
CN106940647B (en) Code management method and device
CN115794525A (en) BMC (baseboard management controller) pressure testing method, device, equipment and storage medium
TW201516423A (en) System and method for testing boot time of servers
CN110851309B (en) Integrated verification system and method
CN106776182A (en) A kind of automatic identification station and the method for performing correspondence script

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant