CN101819547A - Method for testing stability and reliability of storage subsystem - Google Patents

Method for testing stability and reliability of storage subsystem Download PDF

Info

Publication number
CN101819547A
CN101819547A CN201010132051A CN201010132051A CN101819547A CN 101819547 A CN101819547 A CN 101819547A CN 201010132051 A CN201010132051 A CN 201010132051A CN 201010132051 A CN201010132051 A CN 201010132051A CN 101819547 A CN101819547 A CN 101819547A
Authority
CN
China
Prior art keywords
write
disk
script
test
stability
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201010132051A
Other languages
Chinese (zh)
Inventor
孙波
蔡积淼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Electronic Information Industry Co Ltd
Original Assignee
Langchao Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Langchao Electronic Information Industry Co Ltd filed Critical Langchao Electronic Information Industry Co Ltd
Priority to CN201010132051A priority Critical patent/CN101819547A/en
Publication of CN101819547A publication Critical patent/CN101819547A/en
Pending legal-status Critical Current

Links

Landscapes

  • Test And Diagnosis Of Digital Computers (AREA)

Abstract

The invention provides a method for testing the stability and reliability of a storage subsystem. Due to a special application environment and client requirements, a single storage server is generally requested to be provided with more than 12 hard disks, the following test method is made and a test script is formed to perform related test on the reliability and stability of the storage server aiming at the user-related application in order to ensure the reliability and stability for long-term running of the system; because many hard disks are used by the storage server, the product has serious hidden troubles in the stability such as out-of-order magnetic disk, drifting, large-pressure reading and writing environment, disconnection of the magnetic disk, IO error and the like before use of the test method; after use of the test method of the invention, the hidden troubles can be found and solved at the beginning of the development so as to solve similar lot size problem during the product supply and greatly improve the stability and reliability for long-term running of the product.

Description

A kind of method of testing at storage subsystem stability and reliability
Technical field
The present invention relates to a kind of Computer Applied Technology field, specifically a kind of method of testing at storage subsystem stability and reliability.
Background technology
Because the hard disk that storage server uses is more, does not use before this method of testing, has a lot of stable hidden danger in the product, for example: out of order, the drift of disk, problems such as big pressure is read and write under the environment, and disk goes offline, IO reports an error also do not have good solution at present.
Summary of the invention
The purpose of this invention is to provide a kind of method of testing at storage subsystem stability and reliability.
The objective of the invention is to realize in the following manner, concrete testing procedure is as follows:
1) disk sequence check test: write hard disk positioned in sequence script under the linux system, realize that the order of server configures hard disk is is intermittently read and write, thereby realize the verification of logical order corresponding relation under hard disc physical order and the system;
2) disc hot insert functional test: write hard disk under the linux system and continue pressure read-write script and carry out, when disk is done big pressure read-write operation, pull out certain piece or several hard disks at random, and use script logging related log record and operation note; Again the hard disk that pulls out is turned back to system again according to the physics sequencing afterwards, check and write down the associative operation record, contrast the situation of change of plug front and back device number simultaneously, check hard disc apparatus drift problem whether to exist with this;
IO readwrite performance influence to total system when 3) disk breaks down in the system is tested: write disk under the linux system and continue pressure read-write script and disk I state recording script, when disk is done big pressure read-write operation, pull out a hard disk in the system at random, utilize the system IO information of script logging iostat instrument report simultaneously, thereby whether the IO situation of checking system is normal;
4) disk continues pressure test: write hard disk pressure test script under the linux system, comprise the circular order read-write operation of data blocks such as 4K, 16K, 64K, 128K, 256K, 512K, 1M, with the actual application environment of this analog subscriber.Carry out the pressure test script, 3 working days of circular flow;
The invention has the beneficial effects as follows: the whole even formed product transboundary in fusion design back in part is carried out on the 2-8 road technically can allow the user buy the work that a two-way MP product substitutes low before configuration 4 road MP products with lower cost, necessary if any upgrading, can be at any time be upgraded to 4 tunnel 8 road MP servers even with the mode of expansion module.Avoided purchasing new server once more, reduced space for its deployment, handling cost and purchase cost and risen, other technologies such as combined with virtualization more can realize work such as headachy system applies migration.
Storage server is because of its application circumstances and customer demand, generally require 12 above hard disks of stand-alone configuration, therefore in order to guarantee the long-time running reliability and stability of this system, formulate following method of testing, and the formation test script, at user's related application storage server is done related reliability and stability test:
Embodiment
Storage server has been widely used in the Internet user at present, in order to improve stability and the reliability of this series products in the application process of reaching the standard grade, formulates this method of testing.
Embodiment
One. the environment of test
The tide storage server
The (SuSE) Linux OS environment
Two. testing procedure (implementation method):
Storage server is because of its application circumstances and customer demand, generally require 12 above hard disks of stand-alone configuration, therefore in order to guarantee the long-time running reliability and stability of this system, formulate following method of testing, and the formation test script, at user's related application storage server is done related reliability and stability test:
1) disk sequence check test: write hard disk positioned in sequence script under the linux system, realize that the order of server configures hard disk is is intermittently read and write, thereby realize the verification of logical order corresponding relation under hard disc physical order and the system;
2) disc hot insert functional test: write hard disk under the linux system and continue pressure read-write script and carry out, when disk is done big pressure read-write operation, pull out certain piece or several hard disks at random, and use script logging related log record and operation note; Again the hard disk that pulls out is turned back to system again according to the physics sequencing afterwards, check and write down the associative operation record, contrast the situation of change of plug front and back device number simultaneously, check hard disc apparatus drift problem whether to exist with this;
IO readwrite performance influence to total system when 3) disk breaks down in the system is tested: write disk under the linux system and continue pressure read-write script and disk I state recording script, when disk is done big pressure read-write operation, pull out a hard disk in the system at random, utilize the system IO information of script logging iostat instrument report simultaneously, thereby whether the IO situation of checking system is normal;
4) disk continues pressure test: write hard disk pressure test script under the linux system, comprise the circular order read-write operation of data blocks such as 4K, 16K, 64K, 128K, 256K, 512K, 1M, with the actual application environment of this analog subscriber.Carry out the pressure test script, 3 working days of circular flow;
Three. test data and defining standard:
1) in the disk sequence check process, logical order can be mapped one by one under the physical sequential of disk and the system, then calculate test and pass through, otherwise test is not passed through;
2) in the disc hot insert functional test, disk is after pulling out, and there are relevant daily record and warning message prompting in system, and after hard disk was inserted again, system can normally discern and use, and then calculate this test and pass through, otherwise test is not passed through;
In the IO readwrite performance influence test to total system when 3) disk breaks down in the system, after a hard disk in the system was removed, therefore the IO performance of other disks was not affected in the system, still can normally read and write, how to calculate this test and pass through, otherwise test is not passed through;
4) disk continues in the pressure test, and system journal does not have and reports an error, and disk does not have relevant informations such as the dish of falling, IO report an error, and disk read-write is normal, how to calculate test to pass through, otherwise test is not passed through;
Four. the effect of test
Because the hard disk that storage server uses is more, does not use before this method of testing, has a lot of stable hidden danger in the product, for example: out of order, the drift of disk, under the big pressure read-write environment, problem such as disk goes offline, IO reports an error; After using this method of testing, this type of potential problem just can be found at the beginning of exploitation and in time be solved, and has avoided occurring similar lot-size problem in the subsequent product supply of material process, has greatly improved the stability and the reliability of product long-time running.

Claims (1)

1. the method for testing at storage subsystem stability and reliability is characterized in that, in order to guarantee the long-time running reliability and stability of this system, formulates method of testing, and forms test script, and testing procedure is as follows:
A. disk sequence check test: under the linux system, write hard disk positioned in sequence script, realize that the order of server configures hard disk is is intermittently read and write, thereby realize the verification of logical order corresponding relation under hard disc physical order and the system;
B. disc hot insert functional test: under the linux system, write hard disk and continue pressure read-write script and carry out, when disk is done big pressure read-write operation, pull out certain piece or several hard disks at random, and use script logging related log record and operation note; Again the hard disk that pulls out is turned back to system again according to the physics sequencing afterwards, check and write down the associative operation record, contrast the situation of change of plug front and back device number simultaneously, check hard disc apparatus drift problem whether to exist with this;
IO readwrite performance influence to total system when c. disk breaks down in the system is tested: write disk under the linux system and continue pressure read-write script and disk I state recording script, when disk is done big pressure read-write operation, pull out a hard disk in the system at random, utilize the system IO information of script logging iostat instrument report simultaneously, thereby whether the IO situation of checking system is normal;
D. disk continues pressure test: write hard disk pressure test script under the linux system, the circular order read-write operation that comprises 4K, 16K, 64K, 128K, 256K, 512K, 1M data block, with the actual application environment of this analog subscriber, carry out the pressure test script, 3 working days of circular flow.
CN201010132051A 2010-03-25 2010-03-25 Method for testing stability and reliability of storage subsystem Pending CN101819547A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010132051A CN101819547A (en) 2010-03-25 2010-03-25 Method for testing stability and reliability of storage subsystem

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010132051A CN101819547A (en) 2010-03-25 2010-03-25 Method for testing stability and reliability of storage subsystem

Publications (1)

Publication Number Publication Date
CN101819547A true CN101819547A (en) 2010-09-01

Family

ID=42654660

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010132051A Pending CN101819547A (en) 2010-03-25 2010-03-25 Method for testing stability and reliability of storage subsystem

Country Status (1)

Country Link
CN (1) CN101819547A (en)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103000228A (en) * 2011-09-08 2013-03-27 上海宝信软件股份有限公司 Storage device test method and system
CN103116542A (en) * 2013-01-24 2013-05-22 浪潮(北京)电子信息产业有限公司 Test method of equipment expansion stability
CN103473158A (en) * 2013-09-18 2013-12-25 浪潮电子信息产业股份有限公司 Disk pressure testing method for Linux server
CN103744759A (en) * 2013-12-27 2014-04-23 浪潮电子信息产业股份有限公司 Method for verifying unattended disk performance and stability under Linux system
CN104133749A (en) * 2014-07-23 2014-11-05 浪潮电子信息产业股份有限公司 Verification method for HDD detecting failure and HDD out-of-order defect of server
CN104392748A (en) * 2014-10-28 2015-03-04 浪潮电子信息产业股份有限公司 Method for testing hard disk reading speed under linux system
CN104484253A (en) * 2014-12-29 2015-04-01 浪潮电子信息产业股份有限公司 Automatic testing method for human-computer interaction Intel MIC (Many Integrated Core) card
CN104536865A (en) * 2015-01-15 2015-04-22 浪潮电子信息产业股份有限公司 Method for testing read-write performance of PMC Raid card
CN104536902A (en) * 2015-01-28 2015-04-22 浪潮电子信息产业股份有限公司 Performance optimization method for IO (Input/Output) subsystem by testing server
CN104536860A (en) * 2015-01-16 2015-04-22 浪潮电子信息产业股份有限公司 Hard disk sequence marshalling method based on real-time JBOD monitoring mode
CN105260274A (en) * 2015-10-23 2016-01-20 浪潮电子信息产业股份有限公司 Method for detecting random hot plug stability of hard disk based on linux
CN105528269A (en) * 2016-01-29 2016-04-27 浪潮电子信息产业股份有限公司 Design method for detecting disorder of hard disks based on Itanium platform
CN106095634A (en) * 2016-06-21 2016-11-09 浪潮电子信息产业股份有限公司 Automatic test method for verifying disk disorder applied to Windows
CN106933509A (en) * 2017-02-17 2017-07-07 联想(北京)有限公司 The processing method and electronic equipment of a kind of disk number
CN107329914A (en) * 2017-06-29 2017-11-07 郑州云海信息技术有限公司 It is a kind of that the out of order method and device of hard disk is detected based on linux system
CN107391333A (en) * 2017-08-14 2017-11-24 郑州云海信息技术有限公司 A kind of OSD disk failures method of testing and system
CN108279747A (en) * 2018-01-25 2018-07-13 郑州云海信息技术有限公司 A method of not influencing heat dissipation instead of hard disk bracket
CN109062749A (en) * 2018-08-15 2018-12-21 郑州云海信息技术有限公司 A kind of server M.3 hard disk hot-plug stability test method and device
US10671403B2 (en) 2017-02-17 2020-06-02 Lenovo (Beijing) Co., Ltd. Method and apparatus for identifying hardware device in operating system
CN111309535A (en) * 2020-02-14 2020-06-19 苏州浪潮智能科技有限公司 Method and system for testing hard disk in server, electronic equipment and storage medium
CN112068998A (en) * 2019-06-10 2020-12-11 山东华芯半导体有限公司 Automatic solid state disk testing method based on USB (universal serial bus) flash disk PE (provider edge) system

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103000228A (en) * 2011-09-08 2013-03-27 上海宝信软件股份有限公司 Storage device test method and system
CN103116542A (en) * 2013-01-24 2013-05-22 浪潮(北京)电子信息产业有限公司 Test method of equipment expansion stability
CN103116542B (en) * 2013-01-24 2015-12-02 浪潮(北京)电子信息产业有限公司 Equipment dilatation stability test method
CN103473158A (en) * 2013-09-18 2013-12-25 浪潮电子信息产业股份有限公司 Disk pressure testing method for Linux server
CN103744759A (en) * 2013-12-27 2014-04-23 浪潮电子信息产业股份有限公司 Method for verifying unattended disk performance and stability under Linux system
CN104133749A (en) * 2014-07-23 2014-11-05 浪潮电子信息产业股份有限公司 Verification method for HDD detecting failure and HDD out-of-order defect of server
CN104392748A (en) * 2014-10-28 2015-03-04 浪潮电子信息产业股份有限公司 Method for testing hard disk reading speed under linux system
CN104484253A (en) * 2014-12-29 2015-04-01 浪潮电子信息产业股份有限公司 Automatic testing method for human-computer interaction Intel MIC (Many Integrated Core) card
CN104536865A (en) * 2015-01-15 2015-04-22 浪潮电子信息产业股份有限公司 Method for testing read-write performance of PMC Raid card
CN104536860A (en) * 2015-01-16 2015-04-22 浪潮电子信息产业股份有限公司 Hard disk sequence marshalling method based on real-time JBOD monitoring mode
CN104536902A (en) * 2015-01-28 2015-04-22 浪潮电子信息产业股份有限公司 Performance optimization method for IO (Input/Output) subsystem by testing server
CN105260274A (en) * 2015-10-23 2016-01-20 浪潮电子信息产业股份有限公司 Method for detecting random hot plug stability of hard disk based on linux
CN105528269A (en) * 2016-01-29 2016-04-27 浪潮电子信息产业股份有限公司 Design method for detecting disorder of hard disks based on Itanium platform
CN106095634A (en) * 2016-06-21 2016-11-09 浪潮电子信息产业股份有限公司 Automatic test method for verifying disk disorder applied to Windows
CN106933509A (en) * 2017-02-17 2017-07-07 联想(北京)有限公司 The processing method and electronic equipment of a kind of disk number
CN106933509B (en) * 2017-02-17 2019-09-24 联想(北京)有限公司 A kind of processing method and electronic equipment of disk number
US10671403B2 (en) 2017-02-17 2020-06-02 Lenovo (Beijing) Co., Ltd. Method and apparatus for identifying hardware device in operating system
CN107329914A (en) * 2017-06-29 2017-11-07 郑州云海信息技术有限公司 It is a kind of that the out of order method and device of hard disk is detected based on linux system
CN107391333A (en) * 2017-08-14 2017-11-24 郑州云海信息技术有限公司 A kind of OSD disk failures method of testing and system
CN107391333B (en) * 2017-08-14 2020-10-16 苏州浪潮智能科技有限公司 OSD disk fault testing method and system
CN108279747A (en) * 2018-01-25 2018-07-13 郑州云海信息技术有限公司 A method of not influencing heat dissipation instead of hard disk bracket
CN109062749A (en) * 2018-08-15 2018-12-21 郑州云海信息技术有限公司 A kind of server M.3 hard disk hot-plug stability test method and device
CN112068998A (en) * 2019-06-10 2020-12-11 山东华芯半导体有限公司 Automatic solid state disk testing method based on USB (universal serial bus) flash disk PE (provider edge) system
CN111309535A (en) * 2020-02-14 2020-06-19 苏州浪潮智能科技有限公司 Method and system for testing hard disk in server, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN101819547A (en) Method for testing stability and reliability of storage subsystem
TWI674503B (en) Method and system for testing firmware of solid-state storage device, and electronic apparatus
CN103473158A (en) Disk pressure testing method for Linux server
CN102568522B (en) The method of testing of hard disk performance and device
CN112331256B (en) DRAM test method and device, readable storage medium and electronic equipment
WO2018118837A1 (en) Method to dynamically inject errors in a repairable memory on silicon and a method to validate built-in-self-repair logic
US20140372814A1 (en) Method for testing a memory and memory system
CN107797881A (en) A kind of data coherence tester method, apparatus, equipment and storage medium
CN102521092A (en) Hard disk test method and device
CN105573676B (en) A kind of method of verify data consistency in storage system
CN107391333A (en) A kind of OSD disk failures method of testing and system
CN109445691B (en) Method and device for improving FTL algorithm development and verification efficiency
CN104503781A (en) Firmware upgrading method for hard disk and storage system
CN105741883A (en) Test method and device
CN107885624A (en) Control the method, apparatus and server of BIOS Debugging message output
Xu et al. The research of memory fault simulation and fault injection method for bit software test
CN103116542A (en) Test method of equipment expansion stability
TWI514400B (en) Repairing a memory device
CN107273251A (en) A kind of method of testing of the racks of Rack in a production environment JBOD storages
CN105824719B (en) A kind of detection method and system of random access memory
CN105183641A (en) Data consistency check method and system for kernel module
US6910100B2 (en) Detecting open write transactions to mass storage
CN109359001A (en) Method, device and equipment for testing cold restart of solid-state disk
CN109582513A (en) A kind of JBOD test method and system based on generic server
CN107039085A (en) A kind of method and system for realizing the test of storage subsystem data integrity

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20100901