CN101819547A - Method for testing stability and reliability of storage subsystem - Google Patents
Method for testing stability and reliability of storage subsystem Download PDFInfo
- Publication number
- CN101819547A CN101819547A CN201010132051A CN201010132051A CN101819547A CN 101819547 A CN101819547 A CN 101819547A CN 201010132051 A CN201010132051 A CN 201010132051A CN 201010132051 A CN201010132051 A CN 201010132051A CN 101819547 A CN101819547 A CN 101819547A
- Authority
- CN
- China
- Prior art keywords
- write
- disk
- script
- test
- stability
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012360 testing method Methods 0.000 title claims abstract description 33
- 238000000034 method Methods 0.000 title claims abstract description 7
- 238000010998 test method Methods 0.000 claims abstract description 13
- 238000011990 functional testing Methods 0.000 claims description 4
- 238000012163 sequencing technique Methods 0.000 claims description 3
- 238000012956 testing procedure Methods 0.000 claims description 3
- 238000012795 verification Methods 0.000 claims description 3
- 230000007774 longterm Effects 0.000 abstract 2
- 238000005516 engineering process Methods 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000013112 stability test Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
Landscapes
- Test And Diagnosis Of Digital Computers (AREA)
Abstract
The invention provides a method for testing the stability and reliability of a storage subsystem. Due to a special application environment and client requirements, a single storage server is generally requested to be provided with more than 12 hard disks, the following test method is made and a test script is formed to perform related test on the reliability and stability of the storage server aiming at the user-related application in order to ensure the reliability and stability for long-term running of the system; because many hard disks are used by the storage server, the product has serious hidden troubles in the stability such as out-of-order magnetic disk, drifting, large-pressure reading and writing environment, disconnection of the magnetic disk, IO error and the like before use of the test method; after use of the test method of the invention, the hidden troubles can be found and solved at the beginning of the development so as to solve similar lot size problem during the product supply and greatly improve the stability and reliability for long-term running of the product.
Description
Technical field
The present invention relates to a kind of Computer Applied Technology field, specifically a kind of method of testing at storage subsystem stability and reliability.
Background technology
Because the hard disk that storage server uses is more, does not use before this method of testing, has a lot of stable hidden danger in the product, for example: out of order, the drift of disk, problems such as big pressure is read and write under the environment, and disk goes offline, IO reports an error also do not have good solution at present.
Summary of the invention
The purpose of this invention is to provide a kind of method of testing at storage subsystem stability and reliability.
The objective of the invention is to realize in the following manner, concrete testing procedure is as follows:
1) disk sequence check test: write hard disk positioned in sequence script under the linux system, realize that the order of server configures hard disk is is intermittently read and write, thereby realize the verification of logical order corresponding relation under hard disc physical order and the system;
2) disc hot insert functional test: write hard disk under the linux system and continue pressure read-write script and carry out, when disk is done big pressure read-write operation, pull out certain piece or several hard disks at random, and use script logging related log record and operation note; Again the hard disk that pulls out is turned back to system again according to the physics sequencing afterwards, check and write down the associative operation record, contrast the situation of change of plug front and back device number simultaneously, check hard disc apparatus drift problem whether to exist with this;
IO readwrite performance influence to total system when 3) disk breaks down in the system is tested: write disk under the linux system and continue pressure read-write script and disk I state recording script, when disk is done big pressure read-write operation, pull out a hard disk in the system at random, utilize the system IO information of script logging iostat instrument report simultaneously, thereby whether the IO situation of checking system is normal;
4) disk continues pressure test: write hard disk pressure test script under the linux system, comprise the circular order read-write operation of data blocks such as 4K, 16K, 64K, 128K, 256K, 512K, 1M, with the actual application environment of this analog subscriber.Carry out the pressure test script, 3 working days of circular flow;
The invention has the beneficial effects as follows: the whole even formed product transboundary in fusion design back in part is carried out on the 2-8 road technically can allow the user buy the work that a two-way MP product substitutes low before configuration 4 road MP products with lower cost, necessary if any upgrading, can be at any time be upgraded to 4 tunnel 8 road MP servers even with the mode of expansion module.Avoided purchasing new server once more, reduced space for its deployment, handling cost and purchase cost and risen, other technologies such as combined with virtualization more can realize work such as headachy system applies migration.
Storage server is because of its application circumstances and customer demand, generally require 12 above hard disks of stand-alone configuration, therefore in order to guarantee the long-time running reliability and stability of this system, formulate following method of testing, and the formation test script, at user's related application storage server is done related reliability and stability test:
Embodiment
Storage server has been widely used in the Internet user at present, in order to improve stability and the reliability of this series products in the application process of reaching the standard grade, formulates this method of testing.
Embodiment
One. the environment of test
The tide storage server
The (SuSE) Linux OS environment
Two. testing procedure (implementation method):
Storage server is because of its application circumstances and customer demand, generally require 12 above hard disks of stand-alone configuration, therefore in order to guarantee the long-time running reliability and stability of this system, formulate following method of testing, and the formation test script, at user's related application storage server is done related reliability and stability test:
1) disk sequence check test: write hard disk positioned in sequence script under the linux system, realize that the order of server configures hard disk is is intermittently read and write, thereby realize the verification of logical order corresponding relation under hard disc physical order and the system;
2) disc hot insert functional test: write hard disk under the linux system and continue pressure read-write script and carry out, when disk is done big pressure read-write operation, pull out certain piece or several hard disks at random, and use script logging related log record and operation note; Again the hard disk that pulls out is turned back to system again according to the physics sequencing afterwards, check and write down the associative operation record, contrast the situation of change of plug front and back device number simultaneously, check hard disc apparatus drift problem whether to exist with this;
IO readwrite performance influence to total system when 3) disk breaks down in the system is tested: write disk under the linux system and continue pressure read-write script and disk I state recording script, when disk is done big pressure read-write operation, pull out a hard disk in the system at random, utilize the system IO information of script logging iostat instrument report simultaneously, thereby whether the IO situation of checking system is normal;
4) disk continues pressure test: write hard disk pressure test script under the linux system, comprise the circular order read-write operation of data blocks such as 4K, 16K, 64K, 128K, 256K, 512K, 1M, with the actual application environment of this analog subscriber.Carry out the pressure test script, 3 working days of circular flow;
Three. test data and defining standard:
1) in the disk sequence check process, logical order can be mapped one by one under the physical sequential of disk and the system, then calculate test and pass through, otherwise test is not passed through;
2) in the disc hot insert functional test, disk is after pulling out, and there are relevant daily record and warning message prompting in system, and after hard disk was inserted again, system can normally discern and use, and then calculate this test and pass through, otherwise test is not passed through;
In the IO readwrite performance influence test to total system when 3) disk breaks down in the system, after a hard disk in the system was removed, therefore the IO performance of other disks was not affected in the system, still can normally read and write, how to calculate this test and pass through, otherwise test is not passed through;
4) disk continues in the pressure test, and system journal does not have and reports an error, and disk does not have relevant informations such as the dish of falling, IO report an error, and disk read-write is normal, how to calculate test to pass through, otherwise test is not passed through;
Four. the effect of test
Because the hard disk that storage server uses is more, does not use before this method of testing, has a lot of stable hidden danger in the product, for example: out of order, the drift of disk, under the big pressure read-write environment, problem such as disk goes offline, IO reports an error; After using this method of testing, this type of potential problem just can be found at the beginning of exploitation and in time be solved, and has avoided occurring similar lot-size problem in the subsequent product supply of material process, has greatly improved the stability and the reliability of product long-time running.
Claims (1)
1. the method for testing at storage subsystem stability and reliability is characterized in that, in order to guarantee the long-time running reliability and stability of this system, formulates method of testing, and forms test script, and testing procedure is as follows:
A. disk sequence check test: under the linux system, write hard disk positioned in sequence script, realize that the order of server configures hard disk is is intermittently read and write, thereby realize the verification of logical order corresponding relation under hard disc physical order and the system;
B. disc hot insert functional test: under the linux system, write hard disk and continue pressure read-write script and carry out, when disk is done big pressure read-write operation, pull out certain piece or several hard disks at random, and use script logging related log record and operation note; Again the hard disk that pulls out is turned back to system again according to the physics sequencing afterwards, check and write down the associative operation record, contrast the situation of change of plug front and back device number simultaneously, check hard disc apparatus drift problem whether to exist with this;
IO readwrite performance influence to total system when c. disk breaks down in the system is tested: write disk under the linux system and continue pressure read-write script and disk I state recording script, when disk is done big pressure read-write operation, pull out a hard disk in the system at random, utilize the system IO information of script logging iostat instrument report simultaneously, thereby whether the IO situation of checking system is normal;
D. disk continues pressure test: write hard disk pressure test script under the linux system, the circular order read-write operation that comprises 4K, 16K, 64K, 128K, 256K, 512K, 1M data block, with the actual application environment of this analog subscriber, carry out the pressure test script, 3 working days of circular flow.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010132051A CN101819547A (en) | 2010-03-25 | 2010-03-25 | Method for testing stability and reliability of storage subsystem |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010132051A CN101819547A (en) | 2010-03-25 | 2010-03-25 | Method for testing stability and reliability of storage subsystem |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101819547A true CN101819547A (en) | 2010-09-01 |
Family
ID=42654660
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201010132051A Pending CN101819547A (en) | 2010-03-25 | 2010-03-25 | Method for testing stability and reliability of storage subsystem |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101819547A (en) |
Cited By (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103000228A (en) * | 2011-09-08 | 2013-03-27 | 上海宝信软件股份有限公司 | Storage device test method and system |
CN103116542A (en) * | 2013-01-24 | 2013-05-22 | 浪潮(北京)电子信息产业有限公司 | Test method of equipment expansion stability |
CN103473158A (en) * | 2013-09-18 | 2013-12-25 | 浪潮电子信息产业股份有限公司 | Disk pressure testing method for Linux server |
CN103744759A (en) * | 2013-12-27 | 2014-04-23 | 浪潮电子信息产业股份有限公司 | Method for verifying unattended disk performance and stability under Linux system |
CN104133749A (en) * | 2014-07-23 | 2014-11-05 | 浪潮电子信息产业股份有限公司 | Verification method for HDD detecting failure and HDD out-of-order defect of server |
CN104392748A (en) * | 2014-10-28 | 2015-03-04 | 浪潮电子信息产业股份有限公司 | Method for testing hard disk reading speed under linux system |
CN104484253A (en) * | 2014-12-29 | 2015-04-01 | 浪潮电子信息产业股份有限公司 | Automatic testing method for human-computer interaction Intel MIC (Many Integrated Core) card |
CN104536865A (en) * | 2015-01-15 | 2015-04-22 | 浪潮电子信息产业股份有限公司 | Method for testing read-write performance of PMC Raid card |
CN104536902A (en) * | 2015-01-28 | 2015-04-22 | 浪潮电子信息产业股份有限公司 | Performance optimization method for IO (Input/Output) subsystem by testing server |
CN104536860A (en) * | 2015-01-16 | 2015-04-22 | 浪潮电子信息产业股份有限公司 | Hard disk sequence marshalling method based on real-time JBOD monitoring mode |
CN105260274A (en) * | 2015-10-23 | 2016-01-20 | 浪潮电子信息产业股份有限公司 | Method for detecting random hot plug stability of hard disk based on linux |
CN105528269A (en) * | 2016-01-29 | 2016-04-27 | 浪潮电子信息产业股份有限公司 | Design method for detecting disorder of hard disks based on Itanium platform |
CN106095634A (en) * | 2016-06-21 | 2016-11-09 | 浪潮电子信息产业股份有限公司 | Automatic test method for verifying disk disorder applied to Windows |
CN106933509A (en) * | 2017-02-17 | 2017-07-07 | 联想(北京)有限公司 | The processing method and electronic equipment of a kind of disk number |
CN107329914A (en) * | 2017-06-29 | 2017-11-07 | 郑州云海信息技术有限公司 | It is a kind of that the out of order method and device of hard disk is detected based on linux system |
CN107391333A (en) * | 2017-08-14 | 2017-11-24 | 郑州云海信息技术有限公司 | A kind of OSD disk failures method of testing and system |
CN108279747A (en) * | 2018-01-25 | 2018-07-13 | 郑州云海信息技术有限公司 | A method of not influencing heat dissipation instead of hard disk bracket |
CN109062749A (en) * | 2018-08-15 | 2018-12-21 | 郑州云海信息技术有限公司 | A kind of server M.3 hard disk hot-plug stability test method and device |
US10671403B2 (en) | 2017-02-17 | 2020-06-02 | Lenovo (Beijing) Co., Ltd. | Method and apparatus for identifying hardware device in operating system |
CN111309535A (en) * | 2020-02-14 | 2020-06-19 | 苏州浪潮智能科技有限公司 | Method and system for testing hard disk in server, electronic equipment and storage medium |
CN112068998A (en) * | 2019-06-10 | 2020-12-11 | 山东华芯半导体有限公司 | Automatic solid state disk testing method based on USB (universal serial bus) flash disk PE (provider edge) system |
-
2010
- 2010-03-25 CN CN201010132051A patent/CN101819547A/en active Pending
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103000228A (en) * | 2011-09-08 | 2013-03-27 | 上海宝信软件股份有限公司 | Storage device test method and system |
CN103116542A (en) * | 2013-01-24 | 2013-05-22 | 浪潮(北京)电子信息产业有限公司 | Test method of equipment expansion stability |
CN103116542B (en) * | 2013-01-24 | 2015-12-02 | 浪潮(北京)电子信息产业有限公司 | Equipment dilatation stability test method |
CN103473158A (en) * | 2013-09-18 | 2013-12-25 | 浪潮电子信息产业股份有限公司 | Disk pressure testing method for Linux server |
CN103744759A (en) * | 2013-12-27 | 2014-04-23 | 浪潮电子信息产业股份有限公司 | Method for verifying unattended disk performance and stability under Linux system |
CN104133749A (en) * | 2014-07-23 | 2014-11-05 | 浪潮电子信息产业股份有限公司 | Verification method for HDD detecting failure and HDD out-of-order defect of server |
CN104392748A (en) * | 2014-10-28 | 2015-03-04 | 浪潮电子信息产业股份有限公司 | Method for testing hard disk reading speed under linux system |
CN104484253A (en) * | 2014-12-29 | 2015-04-01 | 浪潮电子信息产业股份有限公司 | Automatic testing method for human-computer interaction Intel MIC (Many Integrated Core) card |
CN104536865A (en) * | 2015-01-15 | 2015-04-22 | 浪潮电子信息产业股份有限公司 | Method for testing read-write performance of PMC Raid card |
CN104536860A (en) * | 2015-01-16 | 2015-04-22 | 浪潮电子信息产业股份有限公司 | Hard disk sequence marshalling method based on real-time JBOD monitoring mode |
CN104536902A (en) * | 2015-01-28 | 2015-04-22 | 浪潮电子信息产业股份有限公司 | Performance optimization method for IO (Input/Output) subsystem by testing server |
CN105260274A (en) * | 2015-10-23 | 2016-01-20 | 浪潮电子信息产业股份有限公司 | Method for detecting random hot plug stability of hard disk based on linux |
CN105528269A (en) * | 2016-01-29 | 2016-04-27 | 浪潮电子信息产业股份有限公司 | Design method for detecting disorder of hard disks based on Itanium platform |
CN106095634A (en) * | 2016-06-21 | 2016-11-09 | 浪潮电子信息产业股份有限公司 | Automatic test method for verifying disk disorder applied to Windows |
CN106933509A (en) * | 2017-02-17 | 2017-07-07 | 联想(北京)有限公司 | The processing method and electronic equipment of a kind of disk number |
CN106933509B (en) * | 2017-02-17 | 2019-09-24 | 联想(北京)有限公司 | A kind of processing method and electronic equipment of disk number |
US10671403B2 (en) | 2017-02-17 | 2020-06-02 | Lenovo (Beijing) Co., Ltd. | Method and apparatus for identifying hardware device in operating system |
CN107329914A (en) * | 2017-06-29 | 2017-11-07 | 郑州云海信息技术有限公司 | It is a kind of that the out of order method and device of hard disk is detected based on linux system |
CN107391333A (en) * | 2017-08-14 | 2017-11-24 | 郑州云海信息技术有限公司 | A kind of OSD disk failures method of testing and system |
CN107391333B (en) * | 2017-08-14 | 2020-10-16 | 苏州浪潮智能科技有限公司 | OSD disk fault testing method and system |
CN108279747A (en) * | 2018-01-25 | 2018-07-13 | 郑州云海信息技术有限公司 | A method of not influencing heat dissipation instead of hard disk bracket |
CN109062749A (en) * | 2018-08-15 | 2018-12-21 | 郑州云海信息技术有限公司 | A kind of server M.3 hard disk hot-plug stability test method and device |
CN112068998A (en) * | 2019-06-10 | 2020-12-11 | 山东华芯半导体有限公司 | Automatic solid state disk testing method based on USB (universal serial bus) flash disk PE (provider edge) system |
CN111309535A (en) * | 2020-02-14 | 2020-06-19 | 苏州浪潮智能科技有限公司 | Method and system for testing hard disk in server, electronic equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101819547A (en) | Method for testing stability and reliability of storage subsystem | |
TWI674503B (en) | Method and system for testing firmware of solid-state storage device, and electronic apparatus | |
CN103473158A (en) | Disk pressure testing method for Linux server | |
CN102568522B (en) | The method of testing of hard disk performance and device | |
CN112331256B (en) | DRAM test method and device, readable storage medium and electronic equipment | |
WO2018118837A1 (en) | Method to dynamically inject errors in a repairable memory on silicon and a method to validate built-in-self-repair logic | |
US20140372814A1 (en) | Method for testing a memory and memory system | |
CN107797881A (en) | A kind of data coherence tester method, apparatus, equipment and storage medium | |
CN102521092A (en) | Hard disk test method and device | |
CN105573676B (en) | A kind of method of verify data consistency in storage system | |
CN107391333A (en) | A kind of OSD disk failures method of testing and system | |
CN109445691B (en) | Method and device for improving FTL algorithm development and verification efficiency | |
CN104503781A (en) | Firmware upgrading method for hard disk and storage system | |
CN105741883A (en) | Test method and device | |
CN107885624A (en) | Control the method, apparatus and server of BIOS Debugging message output | |
Xu et al. | The research of memory fault simulation and fault injection method for bit software test | |
CN103116542A (en) | Test method of equipment expansion stability | |
TWI514400B (en) | Repairing a memory device | |
CN107273251A (en) | A kind of method of testing of the racks of Rack in a production environment JBOD storages | |
CN105824719B (en) | A kind of detection method and system of random access memory | |
CN105183641A (en) | Data consistency check method and system for kernel module | |
US6910100B2 (en) | Detecting open write transactions to mass storage | |
CN109359001A (en) | Method, device and equipment for testing cold restart of solid-state disk | |
CN109582513A (en) | A kind of JBOD test method and system based on generic server | |
CN107039085A (en) | A kind of method and system for realizing the test of storage subsystem data integrity |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20100901 |