CN105204968A - Method and device for detecting fault memory - Google Patents
Method and device for detecting fault memory Download PDFInfo
- Publication number
- CN105204968A CN105204968A CN201510763358.4A CN201510763358A CN105204968A CN 105204968 A CN105204968 A CN 105204968A CN 201510763358 A CN201510763358 A CN 201510763358A CN 105204968 A CN105204968 A CN 105204968A
- Authority
- CN
- China
- Prior art keywords
- memory
- failure
- physical address
- slot
- internal memory
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Debugging And Monitoring (AREA)
- Techniques For Improving Reliability Of Storages (AREA)
Abstract
The embodiment of the invention provides a method and a device for detecting a fault memory. The method comprises the steps of monitoring the operation state of memories in real time, generating fault information including a physical address of the fault memory when detecting that the memory fails, acquiring the fault information, acquiring the physical address of the fault memory according to the fault information, acquiring all slots complying with a PCI standard, a PCI-X standard or a PCI-E standard by virtue of a system kernel, analyzing to obtain operation information of memories on the slots, acquiring the variation range of the physical addresses of all the memories on the slots according to the operation information, and positioning to obtain the fault memory according to the physical address of the fault memory and the variation range of the physical addresses of all the memories on the slots. According to the method and the device, the accuracy of a search result is guaranteed, and meanwhile, the working efficiency is relatively high; the memory failure can be timely found out and effectively processed, so that the damage caused by the memory failure to application services is reduced, and the stability and the reliability of a system are improved.
Description
Technical field
The present invention relates to computer application field, particularly relate to a kind of failure memory detection method and device.
Background technology
Along with the develop rapidly of computer technology and integrated circuit technique, no matter from software or hardware, computing machine is obtained for lifting at full speed.Due to the increase of computer hardware, also improve the failure rate of computer hardware simultaneously, especially in internal memory, present application program is in order to improve performance, increasing to the demand of internal memory, the number of the memory bar inserted in computing machine also increases thereupon, and this just makes the probability of malfunction of internal memory greatly promote.If the some memory bars in one group of memory bar break down, and service routine may use the memory bar of fault, thus make service become unstable, even occur data corruption, bring about great losses.At present, when internal memory breaks down, from database, memory information is obtained by artificial mode, then the memory information obtained is analyzed, finally search the internal memory obtaining fault, because manual working speed is limited with reflection speed, and there is higher error rate, therefore, from database, memory information is acquired by artificial mode, and analysis obtains failure memory, the accuracy analyzing the failure memory obtained can not be ensured, the efficiency obtaining failure memory is also lower, make to process memory failure timely and effectively, application service is worked the mischief, the stable of system and reliability are had a strong impact on.
Summary of the invention
In view of this, the embodiment of the present invention provides a kind of failure memory detection method and device, to solve in prior art when internal memory breaks down, from database, memory information is obtained by artificial mode, then the memory information obtained is analyzed, finally search the internal memory obtaining fault, the accuracy analyzing the failure memory obtained can not be ensured, the efficiency obtaining failure memory is also lower, make to process memory failure timely and effectively, application service is worked the mischief, has had a strong impact on the problem of the stable of system and reliability.
For achieving the above object, the embodiment of the present invention provides following technical scheme:
A kind of failure memory detection method, comprising:
Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Obtain all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
According to physical address and all physical address variation ranges being placed in internal memory on described slot of described failure memory, location obtains failure memory.
Wherein, described location also comprises after obtaining failure memory:
Logic off-line operation is carried out to described failure memory, by Data Migration in described failure memory in other normal running memories.
Wherein, when described failure memory detection method is used for linux system, by mcelog program Real-Time Monitoring internal memory running status in described linux system, when detecting that internal memory breaks down, comprised the failure message of failure memory physical address by described mcelog Program Generating.
Wherein, described generation comprises after comprising the failure message of failure memory physical address: preserved in a register by described failure message.
Wherein, the described failure message of described acquisition comprises:
Judge to store failure message in described register;
If store, be then stored in the failure message in described register described in obtaining.
Wherein, described obtained the slot of all PCI of deferring to standards, PCI-X or PCI-E standard by system kernel after also comprise:
Obtain the operation information of all described slots, determine the current operation slot used in all described slots;
Resolve the operation information obtaining and running internal memory on slot described in all being placed in, obtain the physical address variation range running internal memory on slot described in all being placed in.
Wherein, described location also comprises after obtaining failure memory:
Give the alarm, and generate journal file, wherein, described alarm is audible alarm and/or flashlamp alarm.
A kind of failure memory pick-up unit, comprising: monitoring acquisition module, slot acquiring unit and positioning unit; Wherein,
Described monitoring acquiring unit, for Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Described slot acquiring unit, for being obtained all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
Described positioning unit, for according to the physical address of described failure memory and all physical address variation ranges being placed in internal memory on described slot, location obtains failure memory.
Wherein, described failure memory pick-up unit, also comprises: transferring module, for carrying out logic off-line operation to described failure memory, by Data Migration in described failure memory in other normal running memories.
Wherein, described failure memory pick-up unit, also comprises: memory module, for generate comprise failure memory physical address failure message after, described failure message is preserved in a register.
Based on technique scheme, the failure memory detection method that the embodiment of the present invention provides and device, Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, the failure message generated during by obtaining memory failure, the physical address of failure memory is obtained according to this failure message, then obtained by system kernel and allly defer to PCI standard, the slot of PCI-X or PCI-E standard, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot, according to physical address and all physical address variation ranges being placed in internal memory on described slot of the failure memory obtained, location obtains failure memory.Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, located by physical address in the failure message of this generation and all physical address variation ranges being placed in internal memory on slot and obtain failure memory, from database, memory information is obtained than in artificial mode, then the memory information obtained is analyzed, finally search the internal memory obtaining fault, the physical location of failure memory can be obtained very accurately, ensure that lookup result correctness, simultaneously, there is higher work efficiency, enable Timeliness coverage also processes memory failure effectively, reduce because memory failure works the mischief to application service, improve the stable of system and reliability.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is only embodiments of the invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to the accompanying drawing provided.
The process flow diagram of the failure memory detection method that Fig. 1 provides for the embodiment of the present invention;
Another process flow diagram of the failure memory detection method that Fig. 2 provides for the embodiment of the present invention;
The method flow diagram obtaining failure message is obtained in the failure memory detection method that Fig. 3 provides for the embodiment of the present invention;
Fig. 4 shows the method flow diagram obtaining being placed in the physical address variation range of internal memory on slot in the failure memory detection method that the embodiment of the present invention provides;
The system chart of the failure memory pick-up unit that Fig. 5 provides for the embodiment of the present invention;
Fig. 6 shows another system chart of the failure memory pick-up unit that the embodiment of the present invention provides;
Fig. 7 shows another system chart of the failure memory pick-up unit that the embodiment of the present invention provides.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
The process flow diagram of the failure memory detection method that Fig. 1 provides for the embodiment of the present invention, Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, located by physical address in the failure message of this generation and all physical address variation ranges being placed in internal memory on slot and obtain failure memory, the physical location of failure memory can be obtained very accurately, ensure that lookup result correctness, simultaneously, there is higher work efficiency, enable Timeliness coverage also processes memory failure effectively, reduce because memory failure works the mischief to application service, improve the stable of system and reliability, with reference to Fig. 1, this failure memory detection method can comprise:
Step S100: Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Optionally, when the failure memory detection method that the embodiment of the present invention provides is for linux system, can by mcelog program Real-Time Monitoring internal memory running status in linux system, when detecting that internal memory breaks down, comprised the failure message of failure memory physical address by this mcelog Program Generating.
Optionally, generating the failure message comprising failure memory physical address, also this failure message is being preserved in a register.
Optionally, can by storing failure message in criterion register time, obtain the failure message that is stored in this register, obtain failure message.
Step S110: obtain all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolves and obtains all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
Optionally, can obtained by system kernel all defer to the slot of PCI standard, PCI-X or PCI-E standard after, obtain the operation information of all slots, determine the current operation slot used in all slots, namely current interior existence placed on it is not by the slot used, only parsing acquisition is all is placed in the operation information running internal memory on slot, obtain allly being placed in the physical address variation range running internal memory on slot, can avoid obtaining useless physical address conversion range data.
Step S120: according to physical address and all physical address variation ranges being placed in internal memory on described slot of described failure memory, location obtains failure memory.
While location obtains failure memory, the slot inserted described in this failure memory can be obtained, that is, obtain failed storage and be the physical location obtaining this failure memory.
Optionally, after location obtains failure memory, logic off-line operation can also be carried out, by Data Migration in this failure memory in other normal running memories to locating the failure memory obtained, its no longer serviced program and operating system are used, ensures the normal operation of system.
Optionally, after the failure memory obtained location carries out logic off-line operation, this failure memory can be changed, and the normal internal memory after this replacing is carried out on-line running by rear flank at any time in replacing fault.
Optionally, if obtained by system kernel all defer to the slot of PCI standard, PCI-X or PCI-E standard after, obtain the operation information of all slots, determine the current operation slot used in all slots, then only according to the physical address of the failure memory obtained be allly placed in the physical address variation range running internal memory on slot, can locate and obtain failure memory.
Optionally, after location obtains failure memory, can also give the alarm, and generate journal file.
Optionally, after location obtains failure memory, the alarm sent can be audible alarm, can be flashlamp alarm, also can be the combination of audible alarm and flashlamp alarm.
Based on technique scheme, the failure memory detection method that the embodiment of the present invention provides, Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, the failure message generated during by obtaining memory failure, the physical address of failure memory is obtained according to this failure message, then obtained by system kernel and allly defer to PCI standard, the slot of PCI-X or PCI-E standard, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot, according to physical address and all physical address variation ranges being placed in internal memory on described slot of the failure memory obtained, location obtains failure memory.Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, located by physical address in the failure message of this generation and all physical address variation ranges being placed in internal memory on slot and obtain failure memory, from database, memory information is obtained than in artificial mode, then the memory information obtained is analyzed, finally search the internal memory obtaining fault, the physical location of failure memory can be obtained very accurately, ensure that lookup result correctness, simultaneously, there is higher work efficiency, enable Timeliness coverage also processes memory failure effectively, reduce because memory failure works the mischief to application service, improve the stable of system and reliability.
Optionally, Fig. 2 shows another process flow diagram of the failure memory detection method that the embodiment of the present invention provides, and with reference to Fig. 2, this failure memory detection method can comprise:
Step S200: Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Step S210: obtain all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolves and obtains all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
Step S220: according to physical address and all physical address variation ranges being placed in internal memory on described slot of described failure memory, location obtains failure memory,
Step S230: carry out logic off-line operation to described failure memory, by Data Migration in described failure memory in other normal running memories.
After location obtains failure memory, logic off-line operation can also be carried out to locating the failure memory obtained, by Data Migration in this failure memory in other normal running memories, its no longer serviced program and operating system are used, ensures the normal operation of system.
Optionally, after the failure memory obtained location carries out logic off-line operation, this failure memory can be changed, and the normal internal memory after this replacing is carried out on-line running by rear flank at any time in replacing fault.
Optionally, Fig. 3 shows in the failure memory detection method that the embodiment of the present invention provides the method flow diagram obtaining obtaining failure message, and with reference to Fig. 3, the method for this acquisition failure message can comprise:
Step S300: described failure message is preserved in a register;
Step S310: judge to store failure message in described register;
If after generation comprises the failure message of failure memory physical address, this failure message is preserved in a register, then can by judging that storing failure message in register judges whether system occurs memory failure.
Step S320: if store, be then stored in the failure message in described register described in obtaining.
If store failure message in criterion register, then there is memory failure in illustrative system, then obtain this storage failure message in a register, then obtain the physical address of failure memory according to this failure message, continues subsequent operation; Anyway, if do not store failure message in criterion register, then not there is memory failure in illustrative system, then continue monitoring internal memory running status.
Optionally, Fig. 4 shows the method flow diagram obtaining being placed in the physical address variation range of internal memory on slot in the failure memory detection method that the embodiment of the present invention provides, with reference to Fig. 4, this method obtaining being placed in the physical address variation range of internal memory on slot can comprise:
Step S400: obtain all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel;
PCI (PeripheralComponentInterconnect, peripheral component interconnect) standard is a kind of standard for defining local bus released for 1991 by Intel (Intel) company, uses the slot of PCI standard to use 32 bit widths transmission data.
PCI-X interface is and the renewal version of the pci bus connected, and uses the slot of PCI-X standard to adopt 64 bit widths to transmit data.
PCI-E (PCI-Express) standard is standard third generation I/O (I/O) the bussing technique standard for defining local bus that Intel (Intel) recommends by general acclaim out.Use the slot of PCI-E standard according to bus bit wide different and difference to some extent.
Step S410: the operation information obtaining all described slots, determines the current operation slot used in all described slots;
And be all inserted with internal memory in not all slot, and also the internal memory of not all insertion slot is all used at any time, therefore, after all slots of acquisition, by obtaining the operation information of all slots, determine the current operation slot used in all slots by this operation information, be placed in this internal memory run on slot and be the internal memory run when there is memory failure.
Step S420: resolve the operation information obtaining and running internal memory on slot described in all being placed in, obtain the physical address variation range running internal memory on slot described in all being placed in.
Just memory failure can be detected when memory failure runs because only have, therefore failure memory must be present in the internal memory run, therefore, only can resolve to obtain and allly be placed in the operation information running internal memory on slot, obtain allly being placed in the physical address variation range running internal memory on slot, avoid obtaining useless physical address conversion range data, the physical address of the failure memory that last basis obtains and be allly placed in the physical address variation range running internal memory on slot, location obtains failure memory.
The failure memory detection method that the embodiment of the present invention provides, Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, located by physical address in the failure message of this generation and all physical address variation ranges being placed in internal memory on slot and obtain failure memory, ensure that lookup result correctness, simultaneously, there is higher work efficiency, enable Timeliness coverage also processes memory failure effectively, reduce because memory failure works the mischief to application service, improve the stable of system and reliability.
Be introduced the failure memory pick-up unit that the embodiment of the present invention provides below, failure memory pick-up unit described below can mutual corresponding reference with above-described failure memory detection method.
The system chart of the failure memory pick-up unit that Fig. 5 provides for the embodiment of the present invention, with reference to Fig. 5, this failure memory pick-up unit can comprise: monitoring acquisition module 100, slot acquiring unit 200 and positioning unit 300; Wherein,
Monitoring acquiring unit 100, for Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Slot acquiring unit 200, for being obtained all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
Positioning unit 300, for according to the physical address of described failure memory and all physical address variation ranges being placed in internal memory on described slot, location obtains failure memory.
Optionally, Fig. 6 shows another system chart of the failure memory pick-up unit that the embodiment of the present invention provides, and with reference to Fig. 6, this failure memory pick-up unit can comprise: transferring module 400.
Transferring module 400, for carrying out logic off-line operation to described failure memory, by Data Migration in described failure memory in other normal running memories.
Optionally, Fig. 7 shows another system chart of the failure memory pick-up unit that the embodiment of the present invention provides, and with reference to Fig. 7, this failure memory pick-up unit can comprise: memory module 500.
Memory module 500, for generate comprise failure memory physical address failure message after, described failure message is preserved in a register.
The failure memory pick-up unit that the embodiment of the present invention provides, Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generate the failure message comprising failure memory physical address, located by physical address in the failure message of this generation and all physical address variation ranges being placed in internal memory on slot and obtain failure memory, ensure that lookup result correctness, simultaneously, there is higher work efficiency, enable Timeliness coverage also processes memory failure effectively, reduce because memory failure works the mischief to application service, improve the stable of system and reliability.
In this instructions, each embodiment adopts the mode of going forward one by one to describe, and what each embodiment stressed is the difference with other embodiments, between each embodiment identical similar portion mutually see.For device disclosed in embodiment, because it corresponds to the method disclosed in Example, so description is fairly simple, relevant part illustrates see method part.
Professional can also recognize further, in conjunction with unit and the algorithm steps of each example of embodiment disclosed herein description, can realize with electronic hardware, computer software or the combination of the two, in order to the interchangeability of hardware and software is clearly described, generally describe composition and the step of each example in the above description according to function.These functions perform with hardware or software mode actually, depend on application-specific and the design constraint of technical scheme.Professional and technical personnel can use distinct methods to realize described function to each specifically should being used for, but this realization should not thought and exceeds scope of the present invention.
To the above-mentioned explanation of the disclosed embodiments, professional and technical personnel in the field are realized or uses the present invention.To be apparent for those skilled in the art to the multiple amendment of these embodiments, General Principle as defined herein can without departing from the spirit or scope of the present invention, realize in other embodiments.Therefore, the present invention can not be restricted to these embodiments shown in this article, but will meet the widest scope consistent with principle disclosed herein and features of novelty.
Claims (10)
1. a failure memory detection method, is characterized in that, comprising:
Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Obtain all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
According to physical address and all physical address variation ranges being placed in internal memory on described slot of described failure memory, location obtains failure memory.
2. failure memory detection method according to claim 1, is characterized in that, described location also comprises after obtaining failure memory:
Logic off-line operation is carried out to described failure memory, by Data Migration in described failure memory in other normal running memories.
3. failure memory detection method according to claim 1, it is characterized in that, when described failure memory detection method is used for linux system, by mcelog program Real-Time Monitoring internal memory running status in described linux system, when detecting that internal memory breaks down, comprised the failure message of failure memory physical address by described mcelog Program Generating.
4. failure memory detection method according to claim 1, is characterized in that, described generation comprises after comprising the failure message of failure memory physical address: preserved in a register by described failure message.
5. failure memory detection method according to claim 4, is characterized in that, the described failure message of described acquisition comprises:
Judge to store failure message in described register;
If store, be then stored in the failure message in described register described in obtaining.
6. failure memory detection method according to claim 1, is characterized in that, described obtained the slot of all PCI of deferring to standards, PCI-X or PCI-E standard by system kernel after also comprise:
Obtain the operation information of all described slots, determine the current operation slot used in all described slots;
Resolve the operation information obtaining and running internal memory on slot described in all being placed in, obtain the physical address variation range running internal memory on slot described in all being placed in.
7. failure memory detection method according to claim 1, is characterized in that, described location also comprises after obtaining failure memory:
Give the alarm, and generate journal file, wherein, described alarm is audible alarm and/or flashlamp alarm.
8. a failure memory pick-up unit, is characterized in that, comprising: monitoring acquisition module, slot acquiring unit and positioning unit; Wherein,
Described monitoring acquiring unit, for Real-Time Monitoring internal memory running status, when detecting that internal memory breaks down, generating the failure message comprising failure memory physical address, obtaining described failure message, obtain the physical address of failure memory according to described failure message;
Described slot acquiring unit, for being obtained all slots deferring to PCI standard, PCI-X or PCI-E standard by system kernel, resolve and obtain all operation informations being placed in internal memory on described slot, obtain all physical address variation ranges being placed in internal memory on described slot;
Described positioning unit, for according to the physical address of described failure memory and all physical address variation ranges being placed in internal memory on described slot, location obtains failure memory.
9. failure memory pick-up unit according to claim 8, is characterized in that, also comprise: transferring module, for carrying out logic off-line operation to described failure memory, by Data Migration in described failure memory in other normal running memories.
10. failure memory pick-up unit according to claim 8, is characterized in that, also comprise: memory module, for generate comprise failure memory physical address failure message after, described failure message is preserved in a register.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510763358.4A CN105204968B (en) | 2015-11-10 | 2015-11-10 | A kind of failure memory detection method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510763358.4A CN105204968B (en) | 2015-11-10 | 2015-11-10 | A kind of failure memory detection method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105204968A true CN105204968A (en) | 2015-12-30 |
CN105204968B CN105204968B (en) | 2019-05-10 |
Family
ID=54952662
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510763358.4A Active CN105204968B (en) | 2015-11-10 | 2015-11-10 | A kind of failure memory detection method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105204968B (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106055438A (en) * | 2016-05-27 | 2016-10-26 | 深圳市国鑫恒宇科技有限公司 | Method and system for rapidly locating anomaly of memory banks on mainboard |
CN106126368A (en) * | 2016-08-22 | 2016-11-16 | 浪潮电子信息产业股份有限公司 | A kind of method of memory failure address resolution under LINUX |
CN106126364A (en) * | 2016-06-28 | 2016-11-16 | 浪潮(北京)电子信息产业有限公司 | A kind of fault event memory collection method based on Linux system and system |
CN106201750A (en) * | 2016-06-28 | 2016-12-07 | 浪潮(北京)电子信息产业有限公司 | A kind of processing method and processing device based on linux EMS memory error |
CN107092549A (en) * | 2017-04-26 | 2017-08-25 | 郑州云海信息技术有限公司 | A kind of automatic monitoring and the instrument and method for parsing memory failure |
CN109408273A (en) * | 2018-11-13 | 2019-03-01 | 郑州云海信息技术有限公司 | A kind of failure memory of eliminating is to the method and device of systematic influence |
CN115292113A (en) * | 2022-09-30 | 2022-11-04 | 新华三信息技术有限公司 | Method and device for fault detection of internal memory of server and electronic equipment |
CN115932532A (en) * | 2023-03-09 | 2023-04-07 | 长鑫存储技术有限公司 | Method, apparatus, device and storage medium for testing semiconductor device |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102799506A (en) * | 2012-06-29 | 2012-11-28 | 浪潮电子信息产业股份有限公司 | Method for positioning fault memory |
CN103197999A (en) * | 2013-03-22 | 2013-07-10 | 北京百度网讯科技有限公司 | Method and device for automatically positioning internal memory fault |
CN103198000A (en) * | 2013-04-02 | 2013-07-10 | 浪潮电子信息产业股份有限公司 | Method for positioning faulted memory in linux system |
CN103514068A (en) * | 2012-06-28 | 2014-01-15 | 北京百度网讯科技有限公司 | Method for automatically locating internal storage faults |
-
2015
- 2015-11-10 CN CN201510763358.4A patent/CN105204968B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103514068A (en) * | 2012-06-28 | 2014-01-15 | 北京百度网讯科技有限公司 | Method for automatically locating internal storage faults |
CN102799506A (en) * | 2012-06-29 | 2012-11-28 | 浪潮电子信息产业股份有限公司 | Method for positioning fault memory |
CN103197999A (en) * | 2013-03-22 | 2013-07-10 | 北京百度网讯科技有限公司 | Method and device for automatically positioning internal memory fault |
CN103198000A (en) * | 2013-04-02 | 2013-07-10 | 浪潮电子信息产业股份有限公司 | Method for positioning faulted memory in linux system |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106055438A (en) * | 2016-05-27 | 2016-10-26 | 深圳市国鑫恒宇科技有限公司 | Method and system for rapidly locating anomaly of memory banks on mainboard |
CN106055438B (en) * | 2016-05-27 | 2019-12-03 | 深圳市同泰怡信息技术有限公司 | The method and system of memory bar exception on a kind of quick positioning mainboard |
CN106126364A (en) * | 2016-06-28 | 2016-11-16 | 浪潮(北京)电子信息产业有限公司 | A kind of fault event memory collection method based on Linux system and system |
CN106201750A (en) * | 2016-06-28 | 2016-12-07 | 浪潮(北京)电子信息产业有限公司 | A kind of processing method and processing device based on linux EMS memory error |
CN106126368A (en) * | 2016-08-22 | 2016-11-16 | 浪潮电子信息产业股份有限公司 | A kind of method of memory failure address resolution under LINUX |
CN107092549A (en) * | 2017-04-26 | 2017-08-25 | 郑州云海信息技术有限公司 | A kind of automatic monitoring and the instrument and method for parsing memory failure |
CN109408273A (en) * | 2018-11-13 | 2019-03-01 | 郑州云海信息技术有限公司 | A kind of failure memory of eliminating is to the method and device of systematic influence |
CN115292113A (en) * | 2022-09-30 | 2022-11-04 | 新华三信息技术有限公司 | Method and device for fault detection of internal memory of server and electronic equipment |
CN115932532A (en) * | 2023-03-09 | 2023-04-07 | 长鑫存储技术有限公司 | Method, apparatus, device and storage medium for testing semiconductor device |
Also Published As
Publication number | Publication date |
---|---|
CN105204968B (en) | 2019-05-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105204968A (en) | Method and device for detecting fault memory | |
CN107872528B (en) | Message pushing method and device | |
CN111414268B (en) | Fault processing method and device and server | |
CN108683528B (en) | Data transmission method, central server, server and data transmission system | |
CN102479138A (en) | System and method for detecting error by utilizing image | |
CN110995851B (en) | Message processing method, device, storage medium and equipment | |
CN102904685A (en) | Method and device for processing hardware table entry checking error | |
CN106648968A (en) | Data recovery method and device when ECC correction failure occurs on chip | |
CN104685474A (en) | Notification of address range including non-correctable error | |
CN107423171A (en) | The detection method and device of insertion slot type function expansion card based on PCIE standards | |
CN116049249A (en) | Error information processing method, device, system, equipment and storage medium | |
CN110362435B (en) | PCIE fault positioning method, device, equipment and medium for Purley platform server | |
CN111709452B (en) | Method for evaluating surface defect model of wine bottle, electronic device and storage medium | |
CN106201753A (en) | A kind of based on the processing method of PCIE mistake in linux and system | |
CN115883340B (en) | HPLC (high Performance liquid chromatography) and HRF (high performance liquid chromatography) based dual-mode communication fault processing method and device | |
CN117055496A (en) | Multi-station product processing method and device, electronic equipment and storage medium | |
CN111124818A (en) | Monitoring method, device and equipment for Expander | |
CN116723206A (en) | Vehicle fault information processing method and device, electronic equipment and storage medium | |
CN109710187A (en) | Read command accelerated method, device, computer equipment and the storage medium of NVMe SSD main control chip | |
CN112134933A (en) | Method and device for realizing OpenStack high-availability cache cluster and storage medium | |
CN106776169A (en) | A kind of method and device of the PSU of testing service device | |
CN113900914A (en) | Exception handling method and device, electronic equipment and computer storage medium | |
CN107562553B (en) | Data center management method and equipment | |
CN112905602B (en) | Data comparison method, computing device and computer storage medium | |
CN111240956A (en) | Memory leakage monitoring method and device, electronic equipment and computer storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |