CN105204968A - Method and device for detecting fault memory - Google Patents

Method and device for detecting fault memory Download PDF

Info

Publication number
CN105204968A
CN105204968A CN201510763358.4A CN201510763358A CN105204968A CN 105204968 A CN105204968 A CN 105204968A CN 201510763358 A CN201510763358 A CN 201510763358A CN 105204968 A CN105204968 A CN 105204968A
Authority
CN
China
Prior art keywords
memory
failure
physical address
slot
internal memory
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510763358.4A
Other languages
Chinese (zh)
Other versions
CN105204968B (en
Inventor
常现超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Beijing Electronic Information Industry Co Ltd
Original Assignee
Inspur Beijing Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Beijing Electronic Information Industry Co Ltd filed Critical Inspur Beijing Electronic Information Industry Co Ltd
Priority to CN201510763358.4A priority Critical patent/CN105204968B/en
Publication of CN105204968A publication Critical patent/CN105204968A/en
Application granted granted Critical
Publication of CN105204968B publication Critical patent/CN105204968B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Debugging And Monitoring (AREA)
  • Techniques For Improving Reliability Of Storages (AREA)

Abstract

The embodiment of the invention provides a method and a device for detecting a fault memory. The method comprises the steps of monitoring the operation state of memories in real time, generating fault information including a physical address of the fault memory when detecting that the memory fails, acquiring the fault information, acquiring the physical address of the fault memory according to the fault information, acquiring all slots complying with a PCI standard, a PCI-X standard or a PCI-E standard by virtue of a system kernel, analyzing to obtain operation information of memories on the slots, acquiring the variation range of the physical addresses of all the memories on the slots according to the operation information, and positioning to obtain the fault memory according to the physical address of the fault memory and the variation range of the physical addresses of all the memories on the slots. According to the method and the device, the accuracy of a search result is guaranteed, and meanwhile, the working efficiency is relatively high; the memory failure can be timely found out and effectively processed, so that the damage caused by the memory failure to application services is reduced, and the stability and the reliability of a system are improved.

Description

A kind of failure memory detection method and device
Technical field
The present invention relates to computer application field, particularly relate to a kind of failure memory detection method and device.
Background technology
Along with the develop rapidly of computer technology and integrated circuit technique, no matter from software or hardware, computing machine is obtained for lifting at full speed.Due to the increase of computer hardware, also improve the failure rate of computer hardware simultaneously, especially in internal memory, present application program is in order to improve performance, increasing to the demand of internal memory, the number of the memory bar inserted in computing machine also increases thereupon, and this just makes the probability of malfunction of internal memory greatly promote.If the some memory bars in one group of memory bar break down, and service routine may use the memory bar of fault, thus make service become unstable, even occur data corruption, bring about great losses.At present, when internal memory breaks down, from database, memory information is obtained by artificial mode, then the memory information obtained is analyzed, finally search the internal memory obtaining fault, because manual working speed is limited with reflection speed, and there is higher error rate, therefore, from database, memory information is acquired by artificial mode, and analysis obtains failure memory, the accuracy analyzing the failure memory obtained can not be ensured, the efficiency obtaining failure memory is also lower, make to process memory failure timely and effectively, application service is worked the mischief, the stable of system and reliability are had a strong impact on.
Summary of the invention
In view of this, the embodiment of the present invention provides a kind of failure memory detection method and device, to solve in prior art when internal memory breaks down, from database, memory information is obtained by artificial mode, then the memory information obtained is analyzed, finally search the internal memory obtaining fault, the accuracy analyzing the failure memory obtained can not be ensured, the efficiency obtaining failure memory is also lower, make to process memory failure timely and effectively, application service is worked the mischief, has had a strong impact on the problem of the stable of system and reliability.
For achieving the above object, the embodiment of the present invention provides following technical scheme:
A kind of failure memory detection method, comprising:
Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Obtain all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
According to physical address and all physical address variation ranges being placed in internal memory on described slot of described failure memory, location obtains failure memory.
Wherein, described location also comprises after obtaining failure memory:
Logic off-line operation is carried out to described failure memory, by Data Migration in described failure memory in other normal running memories.
Wherein, when described failure memory detection method is used for linux system, by mcelog program Real-Time Monitoring internal memory running status in described linux system, when detecting that internal memory breaks down, comprised the failure message of failure memory physical address by described mcelog Program Generating.
Wherein, described generation comprises after comprising the failure message of failure memory physical address: preserved in a register by described failure message.
Wherein, the described failure message of described acquisition comprises:
Judge to store failure message in described register;
If store, be then stored in the failure message in described register described in obtaining.
Wherein, described obtained the slot of all PCI of deferring to standards, PCI-X or PCI-E standard by system kernel after also comprise:
Obtain the operation information of all described slots, determine the current operation slot used in all described slots;
Resolve the operation information obtaining and running internal memory on slot described in all being placed in, obtain the physical address variation range running internal memory on slot described in all being placed in.
Wherein, described location also comprises after obtaining failure memory:
Give the alarm, and generate journal file, wherein, described alarm is audible alarm and/or flashlamp alarm.
A kind of failure memory pick-up unit, comprising: monitoring acquisition module, slot acquiring unit and positioning unit; Wherein,
Described monitoring acquiring unit, for Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Described slot acquiring unit, for being obtained all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
Described positioning unit, for according to the physical address of described failure memory and all physical address variation ranges being placed in internal memory on described slot, location obtains failure memory.
Wherein, described failure memory pick-up unit, also comprises: transferring module, for carrying out logic off-line operation to described failure memory, by Data Migration in described failure memory in other normal running memories.
Wherein, described failure memory pick-up unit, also comprises: memory module, for generate comprise failure memory physical address failure message after, described failure message is preserved in a register.
Based on technique scheme, the failure memory detection method that the embodiment of the present invention provides and device, Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, the failure message generated during by obtaining memory failure, the physical address of failure memory is obtained according to this failure message, then obtained by system kernel and allly defer to PCI standard, the slot of PCI-X or PCI-E standard, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot, according to physical address and all physical address variation ranges being placed in internal memory on described slot of the failure memory obtained, location obtains failure memory.Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, located by physical address in the failure message of this generation and all physical address variation ranges being placed in internal memory on slot and obtain failure memory, from database, memory information is obtained than in artificial mode, then the memory information obtained is analyzed, finally search the internal memory obtaining fault, the physical location of failure memory can be obtained very accurately, ensure that lookup result correctness, simultaneously, there is higher work efficiency, enable Timeliness coverage also processes memory failure effectively, reduce because memory failure works the mischief to application service, improve the stable of system and reliability.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is only embodiments of the invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to the accompanying drawing provided.
The process flow diagram of the failure memory detection method that Fig. 1 provides for the embodiment of the present invention;
Another process flow diagram of the failure memory detection method that Fig. 2 provides for the embodiment of the present invention;
The method flow diagram obtaining failure message is obtained in the failure memory detection method that Fig. 3 provides for the embodiment of the present invention;
Fig. 4 shows the method flow diagram obtaining being placed in the physical address variation range of internal memory on slot in the failure memory detection method that the embodiment of the present invention provides;
The system chart of the failure memory pick-up unit that Fig. 5 provides for the embodiment of the present invention;
Fig. 6 shows another system chart of the failure memory pick-up unit that the embodiment of the present invention provides;
Fig. 7 shows another system chart of the failure memory pick-up unit that the embodiment of the present invention provides.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
The process flow diagram of the failure memory detection method that Fig. 1 provides for the embodiment of the present invention, Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, located by physical address in the failure message of this generation and all physical address variation ranges being placed in internal memory on slot and obtain failure memory, the physical location of failure memory can be obtained very accurately, ensure that lookup result correctness, simultaneously, there is higher work efficiency, enable Timeliness coverage also processes memory failure effectively, reduce because memory failure works the mischief to application service, improve the stable of system and reliability, with reference to Fig. 1, this failure memory detection method can comprise:
Step S100: Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Optionally, when the failure memory detection method that the embodiment of the present invention provides is for linux system, can by mcelog program Real-Time Monitoring internal memory running status in linux system, when detecting that internal memory breaks down, comprised the failure message of failure memory physical address by this mcelog Program Generating.
Optionally, generating the failure message comprising failure memory physical address, also this failure message is being preserved in a register.
Optionally, can by storing failure message in criterion register time, obtain the failure message that is stored in this register, obtain failure message.
Step S110: obtain all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolves and obtains all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
Optionally, can obtained by system kernel all defer to the slot of PCI standard, PCI-X or PCI-E standard after, obtain the operation information of all slots, determine the current operation slot used in all slots, namely current interior existence placed on it is not by the slot used, only parsing acquisition is all is placed in the operation information running internal memory on slot, obtain allly being placed in the physical address variation range running internal memory on slot, can avoid obtaining useless physical address conversion range data.
Step S120: according to physical address and all physical address variation ranges being placed in internal memory on described slot of described failure memory, location obtains failure memory.
While location obtains failure memory, the slot inserted described in this failure memory can be obtained, that is, obtain failed storage and be the physical location obtaining this failure memory.
Optionally, after location obtains failure memory, logic off-line operation can also be carried out, by Data Migration in this failure memory in other normal running memories to locating the failure memory obtained, its no longer serviced program and operating system are used, ensures the normal operation of system.
Optionally, after the failure memory obtained location carries out logic off-line operation, this failure memory can be changed, and the normal internal memory after this replacing is carried out on-line running by rear flank at any time in replacing fault.
Optionally, if obtained by system kernel all defer to the slot of PCI standard, PCI-X or PCI-E standard after, obtain the operation information of all slots, determine the current operation slot used in all slots, then only according to the physical address of the failure memory obtained be allly placed in the physical address variation range running internal memory on slot, can locate and obtain failure memory.
Optionally, after location obtains failure memory, can also give the alarm, and generate journal file.
Optionally, after location obtains failure memory, the alarm sent can be audible alarm, can be flashlamp alarm, also can be the combination of audible alarm and flashlamp alarm.
Based on technique scheme, the failure memory detection method that the embodiment of the present invention provides, Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, the failure message generated during by obtaining memory failure, the physical address of failure memory is obtained according to this failure message, then obtained by system kernel and allly defer to PCI standard, the slot of PCI-X or PCI-E standard, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot, according to physical address and all physical address variation ranges being placed in internal memory on described slot of the failure memory obtained, location obtains failure memory.Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, located by physical address in the failure message of this generation and all physical address variation ranges being placed in internal memory on slot and obtain failure memory, from database, memory information is obtained than in artificial mode, then the memory information obtained is analyzed, finally search the internal memory obtaining fault, the physical location of failure memory can be obtained very accurately, ensure that lookup result correctness, simultaneously, there is higher work efficiency, enable Timeliness coverage also processes memory failure effectively, reduce because memory failure works the mischief to application service, improve the stable of system and reliability.
Optionally, Fig. 2 shows another process flow diagram of the failure memory detection method that the embodiment of the present invention provides, and with reference to Fig. 2, this failure memory detection method can comprise:
Step S200: Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Step S210: obtain all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolves and obtains all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
Step S220: according to physical address and all physical address variation ranges being placed in internal memory on described slot of described failure memory, location obtains failure memory,
Step S230: carry out logic off-line operation to described failure memory, by Data Migration in described failure memory in other normal running memories.
After location obtains failure memory, logic off-line operation can also be carried out to locating the failure memory obtained, by Data Migration in this failure memory in other normal running memories, its no longer serviced program and operating system are used, ensures the normal operation of system.
Optionally, after the failure memory obtained location carries out logic off-line operation, this failure memory can be changed, and the normal internal memory after this replacing is carried out on-line running by rear flank at any time in replacing fault.
Optionally, Fig. 3 shows in the failure memory detection method that the embodiment of the present invention provides the method flow diagram obtaining obtaining failure message, and with reference to Fig. 3, the method for this acquisition failure message can comprise:
Step S300: described failure message is preserved in a register;
Step S310: judge to store failure message in described register;
If after generation comprises the failure message of failure memory physical address, this failure message is preserved in a register, then can by judging that storing failure message in register judges whether system occurs memory failure.
Step S320: if store, be then stored in the failure message in described register described in obtaining.
If store failure message in criterion register, then there is memory failure in illustrative system, then obtain this storage failure message in a register, then obtain the physical address of failure memory according to this failure message, continues subsequent operation; Anyway, if do not store failure message in criterion register, then not there is memory failure in illustrative system, then continue monitoring internal memory running status.
Optionally, Fig. 4 shows the method flow diagram obtaining being placed in the physical address variation range of internal memory on slot in the failure memory detection method that the embodiment of the present invention provides, with reference to Fig. 4, this method obtaining being placed in the physical address variation range of internal memory on slot can comprise:
Step S400: obtain all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel;
PCI (PeripheralComponentInterconnect, peripheral component interconnect) standard is a kind of standard for defining local bus released for 1991 by Intel (Intel) company, uses the slot of PCI standard to use 32 bit widths transmission data.
PCI-X interface is and the renewal version of the pci bus connected, and uses the slot of PCI-X standard to adopt 64 bit widths to transmit data.
PCI-E (PCI-Express) standard is standard third generation I/O (I/O) the bussing technique standard for defining local bus that Intel (Intel) recommends by general acclaim out.Use the slot of PCI-E standard according to bus bit wide different and difference to some extent.
Step S410: the operation information obtaining all described slots, determines the current operation slot used in all described slots;
And be all inserted with internal memory in not all slot, and also the internal memory of not all insertion slot is all used at any time, therefore, after all slots of acquisition, by obtaining the operation information of all slots, determine the current operation slot used in all slots by this operation information, be placed in this internal memory run on slot and be the internal memory run when there is memory failure.
Step S420: resolve the operation information obtaining and running internal memory on slot described in all being placed in, obtain the physical address variation range running internal memory on slot described in all being placed in.
Just memory failure can be detected when memory failure runs because only have, therefore failure memory must be present in the internal memory run, therefore, only can resolve to obtain and allly be placed in the operation information running internal memory on slot, obtain allly being placed in the physical address variation range running internal memory on slot, avoid obtaining useless physical address conversion range data, the physical address of the failure memory that last basis obtains and be allly placed in the physical address variation range running internal memory on slot, location obtains failure memory.
The failure memory detection method that the embodiment of the present invention provides, Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, located by physical address in the failure message of this generation and all physical address variation ranges being placed in internal memory on slot and obtain failure memory, ensure that lookup result correctness, simultaneously, there is higher work efficiency, enable Timeliness coverage also processes memory failure effectively, reduce because memory failure works the mischief to application service, improve the stable of system and reliability.
Be introduced the failure memory pick-up unit that the embodiment of the present invention provides below, failure memory pick-up unit described below can mutual corresponding reference with above-described failure memory detection method.
The system chart of the failure memory pick-up unit that Fig. 5 provides for the embodiment of the present invention, with reference to Fig. 5, this failure memory pick-up unit can comprise: monitoring acquisition module 100, slot acquiring unit 200 and positioning unit 300; Wherein,
Monitoring acquiring unit 100, for Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Slot acquiring unit 200, for being obtained all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
Positioning unit 300, for according to the physical address of described failure memory and all physical address variation ranges being placed in internal memory on described slot, location obtains failure memory.
Optionally, Fig. 6 shows another system chart of the failure memory pick-up unit that the embodiment of the present invention provides, and with reference to Fig. 6, this failure memory pick-up unit can comprise: transferring module 400.
Transferring module 400, for carrying out logic off-line operation to described failure memory, by Data Migration in described failure memory in other normal running memories.
Optionally, Fig. 7 shows another system chart of the failure memory pick-up unit that the embodiment of the present invention provides, and with reference to Fig. 7, this failure memory pick-up unit can comprise: memory module 500.
Memory module 500, for generate comprise failure memory physical address failure message after, described failure message is preserved in a register.
The failure memory pick-up unit that the embodiment of the present invention provides, Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, located by physical address in the failure message of this generation and all physical address variation ranges being placed in internal memory on slot and obtain failure memory, ensure that lookup result correctness, simultaneously, there is higher work efficiency, enable Timeliness coverage also processes memory failure effectively, reduce because memory failure works the mischief to application service, improve the stable of system and reliability.
In this instructions, each embodiment adopts the mode of going forward one by one to describe, and what each embodiment stressed is the difference with other embodiments, between each embodiment identical similar portion mutually see.For device disclosed in embodiment, because it corresponds to the method disclosed in Example, so description is fairly simple, relevant part illustrates see method part.
Professional can also recognize further, in conjunction with unit and the algorithm steps of each example of embodiment disclosed herein description, can realize with electronic hardware, computer software or the combination of the two, in order to the interchangeability of hardware and software is clearly described, generally describe composition and the step of each example in the above description according to function.These functions perform with hardware or software mode actually, depend on application-specific and the design constraint of technical scheme.Professional and technical personnel can use distinct methods to realize described function to each specifically should being used for, but this realization should not thought and exceeds scope of the present invention.
To the above-mentioned explanation of the disclosed embodiments, professional and technical personnel in the field are realized or uses the present invention.To be apparent for those skilled in the art to the multiple amendment of these embodiments, General Principle as defined herein can without departing from the spirit or scope of the present invention, realize in other embodiments.Therefore, the present invention can not be restricted to these embodiments shown in this article, but will meet the widest scope consistent with principle disclosed herein and features of novelty.

Claims (10)

1. a failure memory detection method, is characterized in that, comprising:
Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Obtain all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
According to physical address and all physical address variation ranges being placed in internal memory on described slot of described failure memory, location obtains failure memory.
2. failure memory detection method according to claim 1, is characterized in that, described location also comprises after obtaining failure memory:
Logic off-line operation is carried out to described failure memory, by Data Migration in described failure memory in other normal running memories.
3. failure memory detection method according to claim 1, it is characterized in that, when described failure memory detection method is used for linux system, by mcelog program Real-Time Monitoring internal memory running status in described linux system, when detecting that internal memory breaks down, comprised the failure message of failure memory physical address by described mcelog Program Generating.
4. failure memory detection method according to claim 1, is characterized in that, described generation comprises after comprising the failure message of failure memory physical address: preserved in a register by described failure message.
5. failure memory detection method according to claim 4, is characterized in that, the described failure message of described acquisition comprises:
Judge to store failure message in described register;
If store, be then stored in the failure message in described register described in obtaining.
6. failure memory detection method according to claim 1, is characterized in that, described obtained the slot of all PCI of deferring to standards, PCI-X or PCI-E standard by system kernel after also comprise:
Obtain the operation information of all described slots, determine the current operation slot used in all described slots;
Resolve the operation information obtaining and running internal memory on slot described in all being placed in, obtain the physical address variation range running internal memory on slot described in all being placed in.
7. failure memory detection method according to claim 1, is characterized in that, described location also comprises after obtaining failure memory:
Give the alarm, and generate journal file, wherein, described alarm is audible alarm and/or flashlamp alarm.
8. a failure memory pick-up unit, is characterized in that, comprising: monitoring acquisition module, slot acquiring unit and positioning unit; Wherein,
Described monitoring acquiring unit, for Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Described slot acquiring unit, for being obtained all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
Described positioning unit, for according to the physical address of described failure memory and all physical address variation ranges being placed in internal memory on described slot, location obtains failure memory.
9. failure memory pick-up unit according to claim 8, is characterized in that, also comprise: transferring module, for carrying out logic off-line operation to described failure memory, by Data Migration in described failure memory in other normal running memories.
10. failure memory pick-up unit according to claim 8, is characterized in that, also comprise: memory module, for generate comprise failure memory physical address failure message after, described failure message is preserved in a register.
CN201510763358.4A 2015-11-10 2015-11-10 A kind of failure memory detection method and device Active CN105204968B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510763358.4A CN105204968B (en) 2015-11-10 2015-11-10 A kind of failure memory detection method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510763358.4A CN105204968B (en) 2015-11-10 2015-11-10 A kind of failure memory detection method and device

Publications (2)

Publication Number Publication Date
CN105204968A true CN105204968A (en) 2015-12-30
CN105204968B CN105204968B (en) 2019-05-10

Family

ID=54952662

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510763358.4A Active CN105204968B (en) 2015-11-10 2015-11-10 A kind of failure memory detection method and device

Country Status (1)

Country Link
CN (1) CN105204968B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106055438A (en) * 2016-05-27 2016-10-26 深圳市国鑫恒宇科技有限公司 Method and system for rapidly locating anomaly of memory banks on mainboard
CN106126368A (en) * 2016-08-22 2016-11-16 浪潮电子信息产业股份有限公司 A kind of method of memory failure address resolution under LINUX
CN106126364A (en) * 2016-06-28 2016-11-16 浪潮(北京)电子信息产业有限公司 A kind of fault event memory collection method based on Linux system and system
CN106201750A (en) * 2016-06-28 2016-12-07 浪潮(北京)电子信息产业有限公司 A kind of processing method and processing device based on linux EMS memory error
CN107092549A (en) * 2017-04-26 2017-08-25 郑州云海信息技术有限公司 A kind of automatic monitoring and the instrument and method for parsing memory failure
CN109408273A (en) * 2018-11-13 2019-03-01 郑州云海信息技术有限公司 A kind of failure memory of eliminating is to the method and device of systematic influence
CN115292113A (en) * 2022-09-30 2022-11-04 新华三信息技术有限公司 Method and device for fault detection of internal memory of server and electronic equipment
CN115932532A (en) * 2023-03-09 2023-04-07 长鑫存储技术有限公司 Method, apparatus, device and storage medium for testing semiconductor device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102799506A (en) * 2012-06-29 2012-11-28 浪潮电子信息产业股份有限公司 Method for positioning fault memory
CN103197999A (en) * 2013-03-22 2013-07-10 北京百度网讯科技有限公司 Method and device for automatically positioning internal memory fault
CN103198000A (en) * 2013-04-02 2013-07-10 浪潮电子信息产业股份有限公司 Method for positioning faulted memory in linux system
CN103514068A (en) * 2012-06-28 2014-01-15 北京百度网讯科技有限公司 Method for automatically locating internal storage faults

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103514068A (en) * 2012-06-28 2014-01-15 北京百度网讯科技有限公司 Method for automatically locating internal storage faults
CN102799506A (en) * 2012-06-29 2012-11-28 浪潮电子信息产业股份有限公司 Method for positioning fault memory
CN103197999A (en) * 2013-03-22 2013-07-10 北京百度网讯科技有限公司 Method and device for automatically positioning internal memory fault
CN103198000A (en) * 2013-04-02 2013-07-10 浪潮电子信息产业股份有限公司 Method for positioning faulted memory in linux system

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106055438A (en) * 2016-05-27 2016-10-26 深圳市国鑫恒宇科技有限公司 Method and system for rapidly locating anomaly of memory banks on mainboard
CN106055438B (en) * 2016-05-27 2019-12-03 深圳市同泰怡信息技术有限公司 The method and system of memory bar exception on a kind of quick positioning mainboard
CN106126364A (en) * 2016-06-28 2016-11-16 浪潮(北京)电子信息产业有限公司 A kind of fault event memory collection method based on Linux system and system
CN106201750A (en) * 2016-06-28 2016-12-07 浪潮(北京)电子信息产业有限公司 A kind of processing method and processing device based on linux EMS memory error
CN106126368A (en) * 2016-08-22 2016-11-16 浪潮电子信息产业股份有限公司 A kind of method of memory failure address resolution under LINUX
CN107092549A (en) * 2017-04-26 2017-08-25 郑州云海信息技术有限公司 A kind of automatic monitoring and the instrument and method for parsing memory failure
CN109408273A (en) * 2018-11-13 2019-03-01 郑州云海信息技术有限公司 A kind of failure memory of eliminating is to the method and device of systematic influence
CN115292113A (en) * 2022-09-30 2022-11-04 新华三信息技术有限公司 Method and device for fault detection of internal memory of server and electronic equipment
CN115932532A (en) * 2023-03-09 2023-04-07 长鑫存储技术有限公司 Method, apparatus, device and storage medium for testing semiconductor device

Also Published As

Publication number Publication date
CN105204968B (en) 2019-05-10

Similar Documents

Publication Publication Date Title
CN105204968A (en) Method and device for detecting fault memory
CN107872528B (en) Message pushing method and device
CN111414268B (en) Fault processing method and device and server
CN108683528B (en) Data transmission method, central server, server and data transmission system
CN102479138A (en) System and method for detecting error by utilizing image
CN110995851B (en) Message processing method, device, storage medium and equipment
CN102904685A (en) Method and device for processing hardware table entry checking error
CN106648968A (en) Data recovery method and device when ECC correction failure occurs on chip
CN104685474A (en) Notification of address range including non-correctable error
CN107423171A (en) The detection method and device of insertion slot type function expansion card based on PCIE standards
CN116049249A (en) Error information processing method, device, system, equipment and storage medium
CN110362435B (en) PCIE fault positioning method, device, equipment and medium for Purley platform server
CN111709452B (en) Method for evaluating surface defect model of wine bottle, electronic device and storage medium
CN106201753A (en) A kind of based on the processing method of PCIE mistake in linux and system
CN115883340B (en) HPLC (high Performance liquid chromatography) and HRF (high performance liquid chromatography) based dual-mode communication fault processing method and device
CN117055496A (en) Multi-station product processing method and device, electronic equipment and storage medium
CN111124818A (en) Monitoring method, device and equipment for Expander
CN116723206A (en) Vehicle fault information processing method and device, electronic equipment and storage medium
CN109710187A (en) Read command accelerated method, device, computer equipment and the storage medium of NVMe SSD main control chip
CN112134933A (en) Method and device for realizing OpenStack high-availability cache cluster and storage medium
CN106776169A (en) A kind of method and device of the PSU of testing service device
CN113900914A (en) Exception handling method and device, electronic equipment and computer storage medium
CN107562553B (en) Data center management method and equipment
CN112905602B (en) Data comparison method, computing device and computer storage medium
CN111240956A (en) Memory leakage monitoring method and device, electronic equipment and computer storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant